Advanced Micro Devices-posted 3 days ago
Senior
Hybrid • Santa Clara, CA
Computer and Electronic Product Manufacturing

As a Senior Member of Technical Staff (SMTS), you will be at the heart of AMD's AI strategy, tackling one of the most exciting challenges in the industry: training and running AI to make AI itself more efficient on GPUs on the fly, which can dramatically alter the trajectory of AI progress. This is a high-impact, hands-on role where your work will directly define the software that powers the future of AI.

  • Architect and Drive the AI Software Stack: Establish best practices and optimize performance from the lowest-level GPU kernels to large-scale distributed systems, shaping the foundational software for AMD hardware.
  • Accelerate Foundational Models: Directly accelerate cutting-edge applications like foundation models (LLMs) and autonomous AI agents, ensuring AMD is the platform of choice for the most demanding workloads.
  • Innovate Across Hardware and Software: Contribute to the entire co-design lifecycle, from influencing future GPU architectures to developing groundbreaking software for new accelerators and collaborating with the broader AI community.
  • Extensive professional software development experience in performance-critical environments.
  • Long term hands-on experience in GPU programming (HIP/CUDA) and optimizing deep learning kernels and operators.
  • A fundamental understanding of GPU architecture and memory hierarchy, used to diagnose and resolve complex performance bottlenecks.
  • Expert-level proficiency in modern C++ and object-oriented design.
  • Deep experience using GPU profiling and performance analysis tools (e.g., AMD ROCm Profiler, NVIDIA Nsight) to diagnose and resolve complex bottlenecks in distributed, multi-GPU systems.
  • Deep knowledge of transformer architectures, attention mechanisms, and modern AI systems (Generative AI, Agentic AI).
  • Hands-on experience optimizing the post-training and inference pipelines of Large Language Models (LLMs).
  • Strong technical ownership, communication, and problem-solving skills with a track record of delivering complex technical solutions.
  • Experience or deep expertise with the AMD ROCm/HIP ecosystem.
  • Experience and interest in code generation and/or self-improving LLMs.
  • AMD benefits at a glance.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service