AI Software Archtiect Intern

d-MatrixSanta Clara, CA
1dHybrid

About The Position

At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation of technology. We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. Our culture is one of respect and collaboration. We value humility and believe in direct communication. Our team is inclusive, and our differing perspectives allow for better solutions. We are seeking individuals passionate about tackling challenges and are driven by execution. Ready to come find your playground? Together, we can help shape the endless possibilities of AI. d-Matrix is seeking outstanding computer architects to help accelerate AI application performance at the intersection of both hardware and software, with particular focus on fundamental hardware technologies (such as matrix/vector processing, memory hierarchy, ISA abstraction) and emerging workloads (such as generative inference etc.). Our acceleration philosophy cuts through the system ranging from efficient tensor cores, storage, and data movements along with co-design of dataflow, and collective communication techniques.

Requirements

  • MS with 3+ years of experience or PhD candidates with prior industry internship experience
  • Solid grasp through academic or industry experience in multiple of the relevant areas – computer architecture, hardware software codesign, performance modeling, ML fundamentals (particularly DNNs)
  • Hands-on experience with SoC design, bus protocols, memory interfaces, network-on-chip topologies, I/O and multi-device scalable systems is preferred
  • Programming fluency in C/C++ or Python, familiarity with common ML frameworks (e.g. PyTorch) is preferred
  • Self-motivated team player with strong sense of collaboration and initiative

Nice To Haves

  • Research background with publication record in top-tier architecture, or machine learning venues is a huge plus (such as ISCA, MICRO, ASPLOS, HPCA, ICLR, NeurIPS, etc.)

Responsibilities

  • As a member of the architecture team, you will contribute to features that power the next generation of inference accelerators in datacenters
  • This role requires to keep up the latest research in ML architecture and algorithms space, and collaborate with different partner teams including design, verification, and software
  • Your day-to-day work will include (1) mapping machine learning algorithms into functional, performance specifications of key hardware units (2) proposing new features to enable or accelerate these algorithms, (3) studying the benefits of proposed features with system models (analytical, cycle-level) and generating software references
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service