AI Trainer: Code Generation

Embedding VC•Palo Alto, CA

About The Position

We are building a focused group of engineers to improve how large language models reason through real world code. This initiative centers on evaluating and refining multi step reasoning trajectories derived from real GitHub repositories, with the goal of producing higher quality, more reliable code generation outputs. This is a long term project requiring strong engineering judgment rather than surface level labeling. Contributors will work directly with complex code paths and reasoning flows across multiple platforms.

Requirements

Be proficient in at least two mainstream programming languages such as Python, C++, Java, TypeScript, or JavaScript
Have real world development experience in areas such as backend systems, frontend applications, algorithms, testing, or infrastructure
Be comfortable reading and reasoning through large GitHub repositories
Have strong written communication skills

Nice To Haves

Experience contributing to high visibility or high star GitHub repositories is a strong plus.

Responsibilities

Analyze and refine multi step code reasoning trajectories generated from real production repositories.
Reviewing model generated reasoning sequences
Identifying logical inconsistencies or weak reasoning steps
Improving trajectory structure to produce stronger, production grade outputs
Evaluating reasoning quality across different programming environments

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume