As one of our first dedicated product engineers, you will build the core simulation platform: Create a realistic, containerized environment where AI agents perform software engineering and other professional tasks. You will design realistic evaluation scenarios by independently identifying meaningful software engineering tasks, determining clear grading criteria, and selecting the appropriate tools and workflows that allow AI agents to reliably complete tasks. Additionally, you will develop a consumer software product that will serve as a foundation for our simulation platform, where we will task AI agents with building and modifying this product. This will require full stack skills from writing a backend in Python and a frontend with web technologies like React, Typescript, HTML and CSS. You will also establish engineering standards by setting up automated testing, practical CI/CD pipelines, effective monitoring, and clear processes to ensure high-quality code and quick issue resolution. Finally, you will influence technical direction by directly shaping product development, making key architectural decisions, and contributing to growing our engineering team at an early-stage startup.