Senior / Principal Research Engineer, Synthetic Data

Lila SciencesSan Francisco, MA
3d$224,000 - $336,000

About The Position

As a Sr./Principal Research Engineer focused on Synthetic Data, you’ll contribute to the vision, roadmap, and delivery of our synthetic data program—from asset generation and simulation to ML training integration and measurable model gains. You will be working with the Research Engineering team in designing, generating, and implementing artificial datasets to train, test, and improve Lila’s platform and help us reach our goals.

Requirements

  • 6+ years in applied ML/ML systems with 3+ years leading industry initiatives; track record of advanced algorithms and frameworks designed specifically for generating large-scale synthetic data.
  • 8+ years working with modern ML workflows: Python, PyTorch, dataset tooling, training loops, and evaluation frameworks; comfortable profiling and optimizing GPU-heavy pipelines.

Nice To Haves

  • Track record of building synthetic data sets from source data to measurably augment model performance in targeted domains.
  • Experience instruction fine tuning and hillclimbing therein.
  • Building product requirements and feedback into sythentic data generation pipeline at scale
  • Knowledge of quantization/distillation, routing/mixture-of-experts, and cost optimization at scale
  • Experience working in compliance-heavy environments (HIPAA, PCI, FedRAMP) and on-prem/VPC deployments

Responsibilities

  • Help to define the synthetic data strategy and multi-quarter roadmap
  • Develop evaluation frameworks that tie synthetic interventions to real model performance
  • Establish standards for asset quality, diversity, documentation, and reproducibility; contribute to a healthy review culture.

Benefits

  • We offer competitive compensation including bonus potential and generous early equity.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service