Senior Site Reliability Engineer

AppOmniSan Francisco, CA
2d$180,000 - $200,000Hybrid

About The Position

AppOmni prevents SaaS data breaches by delivering end-to-end SaaS security. Our platform gives security teams clear visibility into posture, access, third-party connections, AI-related activity, and with built-in discovery to identify unsanctioned SaaS and Shadow AI tools. Backed by continuous monitoring and real-time threat detection, AppOmni helps enterprises identify and resolve risks early, keeping their SaaS applications secure. Recognized as a Frost Radar™ 2025 Leader and Great Place To Work ®, AppOmni continues to set the standard for innovation and customer value in SaaS security. The largest and fastest-growing global enterprises across industries trust AppOmni to secure their SaaS applications. We are looking for someone who is open to collaborative hybrid work in either the Bay Area, CA or Denver, CO. As the Senior Site Reliability Engineer (SRE), you will ensure our systems and infrastructure's reliability, scalability, and performance. Key duties include monitoring system availability, implementing automation for deployment and maintenance tasks, and proactively identifying areas for optimization. You will also collaborate with the development team to establish and refine service-level objectives and drive incident response and postmortem analysis to minimize service disruptions.

Requirements

  • Excellent technical and non-technical communication skills
  • Prior Experience as an SRE or related discipline responsible for maintaining high availability of a cloud based application, troubleshooting performance bottlenecks, configuring monitoring and alerting, and conducting incident response in a blameless environment
  • A knack for reducing manual toil tasks with automation and systematic thinking
  • Prior experience working with CI/CD tools and processes, pipelines-as-code (GitHub Actions, CircleCI)
  • At least 5+ years of hands-on experience with Python or Golang
  • A solid background in configuration management and infrastructure-as-code (Terraform)
  • Solid experience in monitoring/observability systems (Grafana, Prometheus, etc.)
  • Demonstrated knowledge with Container orchestration (Kubernetes/GKE)
  • Experience managing Kubernetes platforms and resources, and using Kubernetes deployment tool and patterns (Helm, GitOps, Knative)

Nice To Haves

  • Experience in FedRAMP or similar secure environments
  • Expertise working within highly controlled environments containing sensitive information.
  • Experience designing and maintaining CI/CD pipelines using commercial solutions
  • Experience working on and within GCP and/or AWS

Responsibilities

  • monitoring system availability
  • implementing automation for deployment and maintenance tasks
  • proactively identifying areas for optimization
  • collaborate with the development team to establish and refine service-level objectives
  • drive incident response and postmortem analysis to minimize service disruptions

Benefits

  • Generous PTO, company and floating holidays, parental and family leave, health insurance (medical, dental, vision with HSA option), EAP, company-provided life insurance, AD&D, STD/LTD, supplemental life insurance options, 401(k) with Roth, and a monthly wellness benefit reimbursement.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service