Principal AI Performance Engineer - GPU Kernels

Advanced Micro Devices, IncAmsterdam, OH
16hHybrid

About The Position

AMD is looking for a senior software engineer to lead the development of high-performance GPU kernels for frontier AI workloads, including large language models, mixture-of-experts architectures, and diffusion models. You will be a technical leader within a core team of incredibly talented industry specialists, driving architecture decisions and mentoring engineers while working with the very latest hardware and software technology. THE PERSON: The ideal candidate should be passionate about software engineering and possess strong technical leadership skills to drive sophisticated cross-functional issues to resolution. Able to influence technical direction, communicate effectively, and work optimally with different teams across AMD and with external partners.

Requirements

  • Deep understanding of modern GPU architectures and micro-architectural performance characteristics
  • 7+ years of GPU software development experience using HIP, CUDA, or OpenCL
  • 8+ years of system-level programming experience in C++ and/or Python (C++17 or later preferred)
  • Proven track record of leading technical projects from design through delivery
  • Experience with GPU profiling, debugging, benchmarking, and performance analysis tools
  • Background in high-performance computing (HPC) or other performance-critical systems
  • Familiarity with modern ML frameworks such as PyTorch, vLLM and SGLang
  • Strong written and verbal English communication skills
  • Bachelor's or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent

Nice To Haves

  • Experience working directly with Hardware ISA is a big plus
  • Experience with tile-based programming models and frameworks (e.g., Triton/Gluon, CUTLASS, CK)
  • Experience with GPU compiler toolchains (e.g., LLVM) and intermediate representations (e.g., MLIR, LLVM IR, Triton IR) is a strong plus
  • Experience mentoring engineers and driving technical excellence within a team
  • PhD is a plus.

Responsibilities

  • Lead the identification and implementation of improvements to machine-learning kernels for AMD GPUs, with a focus on performance and power efficiency
  • Define technical strategy and best practices for kernel development within the team
  • Stay informed about software and hardware trends, particularly in GPU architecture and ML algorithms, and translate insights into actionable roadmap items
  • Improve development workflows and CI infrastructure to enable faster and more reliable delivery
  • Architect and develop new, innovative GPU and ML technologies
  • Debug and resolve complex issues, and research alternative, more efficient implementations
  • Mentor junior and mid-level engineers, raising the technical bar across the team
  • Build and maintain strong technical relationships with internal teams and external partners, influencing upstream roadmaps and priorities

Benefits

  • AMD benefits at a glance.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service