Sr. Site Reliability Engineer

ShipperHQAustin, TX
3dHybrid

About The Position

We’re seeking a Senior Site Reliability Engineer to join our fast-paced Engineering team. The ideal candidate is a self-starter with a strong engineering mindset who can take ownership of our cloud systems, reference architectures, deployment processes, and landing zones. This role requires deep experience with AWS, DevOps practices, and automation to support and improve our complex cloud environment.

Requirements

  • 5+ years of experience in Software Engineering, Site Reliability Engineering, DevOps, or similar roles
  • Strong experience with AWS, particularly in multi-region or multi-tenant environments
  • Background in software development with a solid understanding of modern engineering practices
  • Experience managing infrastructure using Infrastructure as Code, preferably Terraform
  • Hands-on experience with CI/CD tools such as GitLab
  • Experience working with containerized environments and modernizing applications
  • Familiarity with observability, monitoring, and incident response practices
  • Experience working with databases, enterprise systems, or eCommerce platforms
  • Proficiency in administering Linux/Unix systems
  • Strong understanding of Agile methodologies and test-driven development practices
  • Collaborative mindset with a proactive, solutions-oriented approach
  • AI fluency: You leverage a diverse AI toolkit to accelerate your output. You don't just "prompt and pray"; you have a strong grasp of AI governance, including how to protect proprietary data and how to critically audit AI-generated results for accuracy and compliance with company standards

Responsibilities

  • Design, implement, and maintain Infrastructure as Code (IaC) for cloud systems
  • Build and provision solutions that improve cloud operations and support engineering and QA teams
  • Develop, maintain, and optimize CI/CD pipelines for deployments across multiple environments
  • Collaborate with security teams to maintain a strong security posture
  • Configure and maintain highly available systems in AWS
  • Build and maintain observability, monitoring, and logging systems
  • Support software engineers and collaborate on infrastructure needs
  • Guide teams on best practices for infrastructure, deployment, and reliability
  • Implement monitoring and logging standards for infrastructure and applications
  • Develop tools and services that reduce operational dependencies on the DevOps team

Benefits

  • Collaborate with a motivated team, directly tying your results to organizational success
  • 22 days of PTO plus public holidays
  • 401k Match
  • Medical, Dental, and Vision Insurance
  • Maternity and Paternity Leave
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service