We are building a focused group of engineers to improve how large language models reason through real world code. This initiative centers on evaluating and refining multi step reasoning trajectories derived from real GitHub repositories, with the goal of producing higher quality, more reliable code generation outputs. This is a long term project requiring strong engineering judgment rather than surface level labeling. Contributors will work directly with complex code paths and reasoning flows across multiple platforms.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Education Level
No Education Listed