Senior Manager, Site Reliability Engineering

PelotonNew York, NY
4dHybrid

About The Position

At Peloton, we provide a seamless experience for our members. To achieve that, our internal engines—Finance, HR, Supply Chain, and Legal—must run with the same precision as our world-class fitness content. As the Senior Manager of SRE for Internal Systems, you will lead the team responsible for the "Order-to-Cash," "Procure-to-Pay," and "Record-to-Report" lifecycles. You aren't just managing infrastructure; you are the architect of business continuity. You will lead a team of high-performing SREs to ensure our global SaaS ecosystem (NetSuite, Coupa, Workday) and underlying network infrastructure are resilient, observable, and ready to scale.

Requirements

  • 8+ years in SRE, DevOps, or Production Engineering, with 2+ years of direct people management experience
  • Deep understanding of Order-to-Cash or Procure-to-Pay cycles. You can translate a "database lag" into its specific impact on warehouse shipping or financial reconciliation
  • Management of enterprise ecosystems (NetSuite, SAP, Workday, Salesforce)
  • Solid grasp of Networking (SD-WAN, VPNs), Identity (IAM), and Endpoint Management
  • Proficiency with Datadog, Splunk, New Relic, or Prometheus
  • Proven ability to communicate technical risk to non-technical stakeholders (CFO, General Counsel, Head of People

Responsibilities

  • Lead, mentor, and grow a team of SREs. Conduct 1:1s, define career growth paths, and foster a culture of high accountability and psychological safety
  • Transition from reactive support to proactive engineering. Align the team’s quarterly goals with broader Finance and Supply Chain digital transformation initiatives
  • Architect observability across complex business paths (e.g., ensuring a customer order flows from e-commerce through supply chain into the financial ledger)
  • Partner with business owners to define and track Service Level Objectives (SLOs) and Error Budgets for critical SaaS integrations
  • Own the Major Incident Response process for corporate systems. Ensure "War Rooms" are efficient and result in actionable improvements
  • Lead the Root Cause Analysis (RCA) process, ensuring a culture of continuous learning and systematic "toil" reduction
  • Oversee the reliability of API-driven connections and identity management (Okta/Azure AD) across our tech stack
  • Champion "Infrastructure as Code" (IaC) to automate manual hand-offs between business systems using Python, Go, or Terraform

Benefits

  • Medical, dental and vision insurance
  • Generous paid time off policy
  • Short-term and long-term disability
  • Access to mental health services
  • 401k, tuition reimbursement and student loan paydown plans
  • Employee Stock Purchase Plan
  • Fertility and adoption support and up to 18 weeks of paid parental leave
  • Child care and family care discounts
  • Free access to Peloton Digital App and apparel and product discounts
  • Commuter benefits and Citi Bike Discount
  • Pet insurance and so much more!
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service