Senior Machine Learning Engineer

CloudflareSan Francisco, CA
20h

About The Position

We are looking for an experienced Machine Learning Engineer to join our Bengaluru team and help scale the next generation of our AI/ML models. In this role, you will build and operate the pipelines behind AI-driven applications, Agents, Chatbots that power teams across the company, including go-to-market, engineering, and product. Because the products we build are initiated and owned by the team, you will be involved end-to-end — from shaping requirements and designing systems to implementation, deployment, and long-term ownership. You will focus on building scalable, reliable services and application backends, partnering closely with data and full-stack engineers to deliver new features and operate the pipelines and platforms behind our products. Our stack spans multiple languages and technologies, including Python, React, and TypeScript, and continues to evolve. Many of our applications also incorporate strong AI components, giving you opportunities to work with modern AI infrastructure and tools such as vector databases, MCP-based services, and Workers AI.

Requirements

  • M.S or Ph.D in Computer Science, Statistics, Mathematics, or other quantitative fields.
  • 5+ years of ML Engineering experience with proven industry experience in a large scale environment (PBs scale & globally distributed teams)
  • Strong experience in scientific computing using Python with Scikit-Learn & PyTorch or Tensorflow.
  • Strong experience working with Docker & Kubernetes to build and deploy applications and systems.
  • Experience working with ML Platform tools (AirFlow, Argo Workflows, ArgoCD) preferred.
  • Experience working with Data Scientists to deploy Machine Learning applications systems for training, inference and observability.
  • Experience with Full-stack Web technologies and languages (FastAPI, Streamlit, JavaScript/TypeScript, Cloudflare Workers, etc.) preferred with the ability to quickly learn and contribute across a multi-language stack.
  • Experience with Terraform, Google Cloud Platform (or any other public cloud equivalent) etc.
  • Experience working with CI/CD systems, version control (Git, Bitbucket, etc.) and DevOps tools.
  • Experience with Databases such as BigQuery, Postgres, SQLite and ETL/ELT practices
  • Strong cross-functional collaboration experience with data engineering and data analysts teams within the function.
  • Proficiency in large language models and the frameworks necessary for implementing GenAI applications, such as chatbots and related use cases.
  • Strong communication and presentation skills catered to different audiences within the company.
  • Demonstrated ability to design scalable, reliable, and observable systems, with experience influencing architecture and improving platform foundations.
  • Strong problem-solving and communication skills, with a track record of working effectively with cross-functional partners and driving projects through ambiguous problem spaces.
  • Experience mentoring engineers, leading by example in code and design, and contributing to a culture of technical excellence.

Responsibilities

  • Deploy, manage & support ML Applications & Services on Kubernetes
  • Understand MLOps landscape i.e tooling, tech stack, source systems etc. and work on introducing new tools and solutions for ML & AI initiatives.
  • Partner and align with Data Scientists, Data Engineers and internal teams to deliver ML solutions in a globally distributed environment.
  • Lead development of efficiencies to boost model training to deployment lead times
  • Understand business/product strategy and high-level roadmap and align analysis efforts to enable them with data insights and help achieve their strategic goals.
  • Leverage Cloudflare products and services for AI & ML initiatives and applications
  • Use software engineering best practices to publish model scores/insights/learnings at scale within the company.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service