THE OPPORTUNITY We are Amplifier, and we have built the world’s first Large Acoustic Model (LAM), a foundation model that uses human voice to detect health conditions. This is sci-fi becoming reality. We have raised significant capital from top-tier investors to turn this technology into a massive new category in healthcare. We are looking for a heavy-hitter to join our engineering core. We don't need a manager; we need a high-level individual contributor who wants to spend 90% of their time building and shipping. THE REALITY Let’s be clear about what we are signing up for. We are entering a phase of hyper-growth. We are pushing ourselves—and this technology—further than most would consider reasonable. We are doing this because we believe the outcome (saving lives at scale) is worth the intensity required to get there. We work in person in San Francisco. We believe that the hardest problems are solved at a whiteboard, not over a Zoom call. We want the energy, the speed, and the camaraderie that comes from being in the arena together. We move fast. The feedback loop is immediate, and the standards are high. You will deploy code on Tuesday that is processing patient data on Wednesday. We have fun. We are a small, tight-knit crew on an adventure. We work hard because we love the game, not because we have to. THE MISSION You will report to the Head of AI and act as the engine room for our model deployment. While the research team builds the models, you build the machine that makes them run. Your primary focus is Scale, Reliability, and Latency of our Acoustic Model. You will own the serving infrastructure that allows us to process millions of voice biomarkers without breaking the bank (or the server). The Challenge: Inference Optimization: Taking a massive transformer model and making it scream. You will work with TensorRT, ONNX, and quantization techniques to squeeze every ounce of performance out of our GPUs. Pipeline Architecture: Building the CI/CD pipelines for ML. You ensure that when Research commits a new model weights file, it seamlessly passes through testing and lands in production without downtime. Cluster Management: You will manage our Kubernetes clusters and GPU resources. You treat compute efficiency as a personal scorecard.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
No Education Listed