This leadership role is responsible for defining and executing the strategy, operating model, and organizational development for a high-impact reliability engineering function. You will lead teams responsible for infrastructure, site reliability, database engineering, observability, and incident management, ensuring the platform scales securely and efficiently. The role requires balancing feature velocity with system stability through SLOs, error budgets, and operational excellence frameworks. You will partner closely with executives to guide risk trade-offs, optimize cloud infrastructure costs, and enable AI-native workloads. This is a strategic, hands-on position where you will build and mentor high-performing teams while shaping the future reliability posture of a rapidly evolving platform. Success in this role demands deep technical expertise, executive presence, and the ability to transform reliability into a business accelerator.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Executive
Education Level
No Education Listed