Senior Site Reliability Engineer

AppOmni•San Francisco, CA

2d•$180,000 - $200,000•Hybrid

About The Position

AppOmni prevents SaaS data breaches by delivering end-to-end SaaS security. Our platform gives security teams clear visibility into posture, access, third-party connections, AI-related activity, and with built-in discovery to identify unsanctioned SaaS and Shadow AI tools. Backed by continuous monitoring and real-time threat detection, AppOmni helps enterprises identify and resolve risks early, keeping their SaaS applications secure. Recognized as a Frost Radar™ 2025 Leader and Great Place To Work ®, AppOmni continues to set the standard for innovation and customer value in SaaS security. The largest and fastest-growing global enterprises across industries trust AppOmni to secure their SaaS applications. We are looking for someone who is open to collaborative hybrid work in either the Bay Area, CA or Denver, CO. As the Senior Site Reliability Engineer (SRE), you will ensure our systems and infrastructure's reliability, scalability, and performance. Key duties include monitoring system availability, implementing automation for deployment and maintenance tasks, and proactively identifying areas for optimization. You will also collaborate with the development team to establish and refine service-level objectives and drive incident response and postmortem analysis to minimize service disruptions.

Requirements

Excellent technical and non-technical communication skills
Prior Experience as an SRE or related discipline responsible for maintaining high availability of a cloud based application, troubleshooting performance bottlenecks, configuring monitoring and alerting, and conducting incident response in a blameless environment
A knack for reducing manual toil tasks with automation and systematic thinking
Prior experience working with CI/CD tools and processes, pipelines-as-code (GitHub Actions, CircleCI)
At least 5+ years of hands-on experience with Python or Golang
A solid background in configuration management and infrastructure-as-code (Terraform)
Solid experience in monitoring/observability systems (Grafana, Prometheus, etc.)
Demonstrated knowledge with Container orchestration (Kubernetes/GKE)
Experience managing Kubernetes platforms and resources, and using Kubernetes deployment tool and patterns (Helm, GitOps, Knative)

Nice To Haves

Experience in FedRAMP or similar secure environments
Expertise working within highly controlled environments containing sensitive information.
Experience designing and maintaining CI/CD pipelines using commercial solutions
Experience working on and within GCP and/or AWS

Responsibilities

monitoring system availability
implementing automation for deployment and maintenance tasks
proactively identifying areas for optimization
collaborate with the development team to establish and refine service-level objectives
drive incident response and postmortem analysis to minimize service disruptions

Benefits

Generous PTO, company and floating holidays, parental and family leave, health insurance (medical, dental, vision with HSA option), EAP, company-provided life insurance, AD&D, STD/LTD, supplemental life insurance options, 401(k) with Roth, and a monthly wellness benefit reimbursement.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume