Research Engineer - Language Model Pre-Training

ZyphraSan Francisco, CA
9dOnsite

About The Position

As a Research Engineer - Language Model Pre-Training, you'll shape our language model roadmap through end-to-end pretraining development. You will work extremely closely with our pretraining team, who will integrate your insights into our next-generation models.

Requirements

  • Strong engineering aptitude for rapidly implementing reliable and robust systems
  • Can rapidly learn new fields and are excited to implement new ideas
  • Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale
  • High proficiency with PyTorch and Python.
  • Strong ability to dive into large pre-existing codebases and rapidly get up to speed

Nice To Haves

  • Deep expertise and intuition for solving machine learning problems and training models
  • Experience with training on large-scale (multi-node) GPU clusters
  • Deep understanding of model training pipelines – including model/data parallelism, distributed optimizers, etc.
  • Strong grasp of proper experimental methodology for running rigorous ablations and other hypothesis testing
  • Understanding of large-scale, highly parallel data processing pipelines
  • Published machine learning research in well-respected venues is a plus
  • Postgraduate degree in a scientific subject (Computer Science, EE/EECS, Math, Physics)

Responsibilities

  • Large-scale training runs and model parallelization
  • Performance optimization of our pretraining stack
  • Dataset collection, processing, and evaluation
  • Architecture and methodology research, including optimizer ablations

Benefits

  • Comprehensive medical, dental, vision, and FSA plans
  • Competitive compensation and 401(k) plan
  • Relocation and immigration support on a case-by-case basis
  • In-office snacks and meals provided
  • Unlimited PTO and company holidays
  • In-person team in San Francisco with a collaborative, high-energy environment
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service