Senior Compiler Engineer - AI

NVIDIARedmond, WA
1d

About The Position

NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company”. NVIDIA is hiring world class Software Engineers with AI Compiler experience for its Deep Learning Compiler Technology team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning, enabling breakthroughs in problems from image classification to speech recognition to natural language processing and artificial intelligence. Join the team which is developing advanced technology for the software which will be used by the entire deep learning community. What you’ll be doing: In this role, you will be leading by hands-on example doing technology development on problems of kernel generation and optimizations for computational graphs for next generation NVIDIA GPUs. The goal of this work will be to advance the state of the art in compilation problems of DL graphs for current and future NVIDIA GPUs and transfer this tech to products. The problems of interest will be drawn from computational graphs as encountered in inference and training workloads. Develop, both, online and offline techniques for use in the production compiler NVIDIA is developing. You will work with experts across software, hardware, and research divisions to co-design the next generation chips. As part of your role you will also be responsible for technology transfer to production groups.

Requirements

  • Masters or PhD in Computer Science, Computer Engineering, or related field or equivalent experience
  • 8+ years of relevant work or research experience in kernel generation, mega kernels, compiler optimizations, synthesis, LLM inference and computer architecture.
  • Be able to work independently, define project goals and scope, and lead your own development efforts
  • Excellent programming and software design skills, including debugging, performance analysis, and test design.
  • Strong communication skills are required along with the ability to work in a dynamic product-oriented team.
  • Experience with the following technologies is a huge plus: OpenAI Triton language and compiler; Deep learning models and algorithms; Tile based IR and domain specific language; Auto-tuning; Deep learning framework design

Nice To Haves

  • Knowledge of CPU and/or GPU architecture.
  • CUDA or OpenCL programming experience desired but not required
  • Experience in mentoring early career engineers and interns is a bonus.

Responsibilities

  • Leading by hands-on example doing technology development on problems of kernel generation and optimizations for computational graphs for next generation NVIDIA GPUs.
  • Advance the state of the art in compilation problems of DL graphs for current and future NVIDIA GPUs and transfer this tech to products.
  • Develop, both, online and offline techniques for use in the production compiler NVIDIA is developing.
  • Work with experts across software, hardware, and research divisions to co-design the next generation chips.
  • Responsible for technology transfer to production groups.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service