The Site Reliability Engineering Lead is a senior, hands-on technical leader within the Wholesale Production Support Operations organization. This teammate is accountable for elevating the reliability, resiliency, and operational excellence of critical enterprise platforms across hybrid cloud and onprem environments. Acting as both a handson SRE expert and a crossdomain influencer, the SRE Lead drives systemic improvements in observability, automation, AIOps adoption, fault tolerance, and incident management. The role partners closely with Application Development, Infrastructure, Production Support, Platform Delivery, Architecture, Cybersecurity, Risk, and Business technology teams to uplift operational practices and deliver stable, predictable, and scalable services. This position also plays a pivotal role in building and maturing the SRE Center for Enablement (C4E) by contributing standards, repeatable patterns, runbooks, playbooks, and coaching that amplify reliability practices across the enterprise. The SRE Lead delivers measurable impact through deep expertise in distributed systems, modern operational tooling, cloud-native reliability patterns, and enterprise-scale incident/problem management.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed