About The Position

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior SRE DevOps Engineer. In this role, you will have the opportunity to contribute to the development of a satellite communication platform that enables vital voice calls and messaging when traditional connectivity fails. Your expertise will play a crucial part in ensuring reliability and enhancing operational efficiency across our cloud infrastructure. You'll collaborate with a dedicated team to implement innovative solutions that address the complexities of real-time applications and bridging mobile devices with satellite hardware. This is a remote position that offers flexibility and a chance to make a significant impact in the field of cloud and DevOps engineering.

Requirements

  • 7+ years of experience in SRE/DevOps/Platform Engineering with a strong software background
  • Proficient in at least one backend language (TypeScript/Node.js, Python, or Go)
  • Deep expertise in AWS technologies including ECS, EKS, and RDS
  • Strong experience with IaC tools like Terraform or CloudFormation
  • Proven track record in CI/CD pipeline design for both on-prem and cloud environments
  • Experience in container orchestration with Docker and Kubernetes
  • Solid understanding of network security and incident response
  • Experience implementing SLI/SLO frameworks and reduction strategies
  • Operations knowledge for PostgreSQL, Redis, and message queues
  • Strong understanding of distributed systems patterns

Responsibilities

  • Implement SLI/SLO frameworks with error budgets to drive reliability decisions
  • Design release strategies including blue/green deployments and version tracking
  • Lead incident response and develop automated runbooks to reduce MTTR
  • Develop tooling and automation frameworks in TypeScript/Python for enhanced productivity
  • Write services focused on reliability, such as health checkers and auto-remediation controllers
  • Maintain production AWS infrastructure using IaC with a focus on microservices orchestration
  • Establish CI/CD pipelines for backend services and mobile apps
  • Define and enforce security policies across the infrastructure
  • Build observability features with OpenTelemetry and distributed tracing
  • Manage database configurations including PostgreSQL and Redis

Benefits

  • Build critical communication infrastructure for remote areas
  • A role merging engineering and operations with significant ownership
  • Technically challenging environment across cloud, IoT, and satellite systems
  • Full ownership of infrastructure with direct impact on reliability
  • Competitive compensation and flexible remote work options
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service