Site Reliability Engineering Manager II

FlywireBoston, MA
18hRemote

About The Position

The Opportunity We, at Flywire, are looking for an experienced Manager II, Site Reliability Engineering to join our team. In this role, you’ll help drive reliability, automation and performance within our cloud-based infrastructure. At Flywire, the SRE team is responsible for the lifecycle of production systems. Our team is embedded within Software Engineering teams enabling and empowering them to achieve full speed on shipping reliable and operable systems. They also work at a global scale driving initiatives to achieve production excellence.

Requirements

  • 5 years of experience within the SRE space
  • 2-5 years of leading or managing and developing SRE teams
  • Comfortable with the idea of being or becoming a generalizing specialist as we are aiming to build a multidisciplinary and balanced team based on "t-shaped" individuals.
  • Experience with at least one programming language is required as software engineering is an important part of our work and we actively use and support many different platforms and languages
  • Proficient with testing techniques such TDD or BDD will be highly valued
  • Familiarity with the container ecosystem, cloud infrastructure, build systems and CI/CD tools is key for being successful at this role
  • Comfortable taking ownership of complex systems challenges and help uncover opportunities for improvement
  • Strong communication and collaboration skills, and most importantly, empathy as we enable, empower and encourage our fellow colleagues

Responsibilities

  • Coordinate and support daily activities for SREs on the team and partner with their managers to determine approach for managing daily tasks
  • Track success on the team based on established goals and objectives
  • Work on issues of limited scope with the ability to find and execute solutions to routine problems
  • Become embedded within an Engineering team helping them navigate production excellence and advocate for best practices
  • Mentor team members and drive initiatives
  • Drive a design for a feature while understanding system-wide and architectural concerns
  • Understand the basic day-to-day tasks traits of a production environment and participate in on-call support
  • Engage and collaborate with other disciplines within the design, deployment, operation and optimization of services
  • Debug production issues across services and levels of the stack as well as practice incident response and blameless postmortems
  • Identifies opportunities both in processes and tools to improve the overall productivity of the team
  • Identify great talent and excite them to join our team
  • Provide estimations, track progress and manage risk as well as team members' time
  • Participate in an on-call shift along with other disciplines to respond to incidents
  • Become involved in tech communities and add contributions to enhance them
  • Lean into our business domain and needs as well as our company vision, mission and strategy to deliver on our short and long term goals

Benefits

  • Competitive compensation, including Restricted Stock Units
  • Employee Stock Purchase Plan (ESPP)
  • Flying Start - Our immersive Global Induction Program (Meet our Execs & Global Teams)
  • Work with brilliant people that will keep you on your toes, learn more about their journeys by checking out #InsideFlywire on social media
  • Dynamic & Global Team (we have been collaborating virtually for years!)
  • Wellbeing Programs (Mental Health, Wellness, Yoga/Pilates/HIIT Classes) with Global FlyMates
  • Competitive time off including FlyBetter Days to volunteer in your community and Digital Disconnect Days!
  • Great Talent & Development Programs (Managers Taking Flight – for new or aspiring managers!)
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service