Labelboxposted 1 day ago
$250,000 - $300,000/Yr
Full-time • Senior
San Francisco, CA

About the position

As an Applied Research Engineer at Labelbox, you’ll sit at the junction of advanced AI research and real product impact, with a focus on the data that makes modern agents work—browser interactions, SWE/code traces, GUI sessions, and multi-turn workflows. You’ll drive the data landscape required to advance capable, adaptable agents and help shape Labelbox’s strategy for collecting, synthesizing, and evaluating it. You will possess expertise in LLM agents and planning/execution loops, plus creativity in tackling problems across data design, interaction, and measurement. You’ll publish meaningful results, collaborate with customer researchers in frontier AI labs, and turn prototypes into reliable, scalable features.

Responsibilities

  • Create frameworks and tools to construct, train, benchmark and evaluate autonomous agent capabilities.
  • Design agent-focused data programs using supervised fine-tuning (SFT) and reinforcement learning (RL) methodologies.
  • Develop data pipelines from diverse sources like code repositories, web browsers, and computer systems.
  • Implement and adapt popular open-source agent libraries and benchmarks with proprietary datasets and models.
  • Engage with research teams in frontier AI labs and the wider AI community to understand evolving agent data needs for frontier models and share best practices.
  • Collaborate closely with frontier AI lab customers to understand requirements and guide model development.
  • Publish research findings in academic journals, conferences, and blog posts.

Requirements

  • Ph.D. or Master's degree in Computer Science, Machine Learning, AI, or related field.
  • At least 3 years of experience addressing sophisticated ML problems with successful delivery to customers.
  • Experience building and training autonomous agents—tool use, structured outputs, multi-step planning—across browsers/GUI, codebases, and databases using SFT and RL.
  • Constructed and evaluated agentic benchmarks (e.g. SWE-bench, WebArena, τ-bench, OSWorld) and reliability/efficiency suites (e.g. WABER).
  • Adept at interpreting research literature and quickly turning new ideas into prototypes.
  • Deep understanding of frontier models (autoregressive, diffusion), post-training (SFT, RLVR, RLAIF, RLHF, et al.), and their human data requirements.
  • Proficient in Python, data science libraries and deep learning frameworks (e.g., PyTorch, JAX, TensorFlow).
  • Strong analytical and problem-solving abilities in ambiguous situations.
  • Excellent communication skills.
  • Track record of publications in top-tier AI/ML venues (e.g., ACL, EMNLP, NAACL, NeurIPS, ICML, ICLR, etc.).

Benefits

  • Annual base salary range of $250,000—$300,000 USD.
  • Hybrid work model with 2 days per week in office.
  • Career advancement opportunities directly tied to your impact.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service