Join Evolv as Senior Data Infrastructure Engineer in the Machine Learning & Sensors organization, responsible for building and operating the scalable, secure, and reliable data pipelines that power our AI/ML research and production systems. In this role, you will own the end‑to‑end data lifecycle—from collection on thousands to millions of edge devices, through cloud ingestion and processing, into a centralized data factory enabling model training, evaluation, and continuous improvement. Data is the backbone of our mission to deliver best‑in‑class AI‑based weapon detection systems. You will ensure that data flows seamlessly across geographies, devices, and cloud systems while meeting strict requirements for quality, privacy, security, and scale. This role is ideal for someone who thrives at the intersection of distributed systems, cloud pipelines, and ML‑driven data needs. Success in the Role: What performance outcomes will you work toward in the first 6–12 months? In the first 30 days: Develop a deep understanding of existing edge‑to‑cloud data pipelines and deployment environments. Review current data ingestion flows, governance policies, and cloud infrastructure. Assess pain points in data reliability, quality, and operational scalability. Build relationships with AI/ML, data science, field operations, and cloud engineering teams. Design and prototype data processing pipelines (both cloud and edge) Within the first three months: Design and implement improvements to core ingestion, validation, and processing pipelines. Deploy scalable data pipeline with AWS‑based components (S3, EC2, Lambda, Glue, Step Functions, SageMaker integrations). Introduce automated validation workflows to detect corruption, missing metadata, or malformed data. Design and implement automated model evaluation, model training and model improvement pipeline to speed up experiments Partner with field operations to improve data reliability, observability, and coverage across deployments. By the end of the first year: Own the entire lifecycle of mission‑critical data pipelines supporting AI/ML research and production. Architect next‑generation edge‑to‑cloud data systems that scale across millions of devices. Define and enforce data governance frameworks including retention, access control, privacy, and lineage. Enable ML teams to rapidly experiment through high‑quality, discoverable, versioned datasets.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level