As a Data Engineer - Multimodal Systems, you will be a core contributor to creating, collecting, and improving Zyphra’s datasets and data pipelines across a variety of modalities. Your work will intersect with almost every team at Zyphra. You will be involved in collecting large-scale datasets and implementing and optimizing highly parallel data pipelines. You’ll Work Across: Large-scale data collection across a variety of modalities (text, audio, image) Designing and working with highly efficient, parallelized data processing pipelines across modalities Designing and running rigorous experimental ablations to demonstrate the impact of new data improvements
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level