Design, develop, and optimize reinforcement learning algorithms for real-time control and locomotion of humanoid robots. Integrate learned policies into real-world robot platforms with hardware-in-the-loop validation. Collaborate with mechanical, perception, and embedded systems teams to ensure tight integration between hardware and software. Apply advanced techniques such as curriculum learning, domain randomization, and sim2real transfer to improve policy generalization. Analyze and optimize control performance with a focus on robustness, energy efficiency, and adaptability. Contribute to the continuous development of our in-house RL training pipelines and tooling.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level