Staff Site Reliability Engineer

PismoAustin, TX
2dHybrid

About The Position

We are seeking a highly skilled and experienced Staff Site Reliability Engineer to join our DevOps squad. This role will focus on leading technical initiatives, optimizing CI/CD pipelines, automating infrastructure provisioning, and ensuring platform resilience. The ideal candidate will have a strong background in CI/CD, Infrastructure as Code (IaC), and cloud technologies, as well as the ability to mentor engineers and contribute to the overall stability and scalability of our platform.

Requirements

  • 5 or more years of relevant work experience with a Bachelors Degree or at least 2 years of work experience with an Advanced degree (e.g. Masters, MBA, JD, MD) or 0 years of work experience with a PhD
  • Proficiency in CI/CD tools such as Argo and Codefresh.
  • Expertise in Infrastructure as Code (IaC) tools like Terraform.
  • Strong knowledge of Docker and Kubernetes for containerization and orchestration.
  • Proficiency in monitoring and observability tools like Grafana, Grafana Loki, Honeycomb, OpenTelemetry, and Prometheus.
  • Proficiency in AWS cloud services.
  • Hands-on experience with automating deployment processes and integrating CI/CD pipelines.
  • Experience with support and participation in OnCall rotations.
  • Strong problem-solving skills, especially under pressure.
  • Ability to mentor and provide constructive feedback to engineers.
  • Effective communication and collaboration skills.

Nice To Haves

  • 6 or more years of work experience with a Bachelors Degree or 4 or more years of relevant experience with an Advanced Degree (e.g. Masters, MBA, JD, MD) or up to 3 years of relevant experience with a PhD
  • Experience with service mesh technologies such as Istio.
  • Experience with programming languages such as Golang, Java, or Groovy can be considered a plus.

Responsibilities

  • Technical Leadership: Lead the implementation and optimization of CI/CD pipelines.
  • Develop and maintain Infrastructure as Code (IaC) scripts to automate infrastructure provisioning and management.
  • Identify and implement automation opportunities to improve efficiency and reduce maxnual effort.
  • Ensure best practices in CI/CD and IaC to promote consistency, repeatability, and compliance.
  • Platform Resilience: Maintain CI/CD resilience by avoiding unplanned or uncommunicated changes.
  • Serve as an example of diligence and reliability to the team.
  • Technical Contributions: Make high-impact technical contributions recognized by the team and organization.
  • Write effective post-mortem documentation for internal and external stakeholders.
  • Mentorship: Mentor and provide constructive feedback to engineers across the company.
  • Review pull requests and source code, focusing on improving CI/CD and automation practices.
  • Consultation and Problem-Solving: Serve as a consultant for engineers from different squads.
  • Solve complex and unknown problems under pressure.
  • Technology Trends and POCs: Stay up-to-date with the latest technology trends in CI/CD and automation.
  • Lead and execute Proof of Concepts (POCs) to introduce new technologies to the team.

Benefits

  • Medical
  • Dental
  • Vision
  • 401 (k)
  • FSA/HSA
  • Life Insurance
  • Paid Time Off
  • Wellness Program
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service