This role specializes in modern, cloud-native environments, bridging the gap between high-scale software development and advanced system observability. You will be a key architect of our "Observability-as-Code" strategy, ensuring that monitoring, alerting, and asset management are baked into the development process rather than treated as an afterthought. How you’ll make an impact Observability as Code (OaC): Utilize Grafana Cloud and Splunk Cloud to build deep visibility into system health. Manage these platforms using Configuration-as-Code (e.g., Terraform, Grafana Grizzly, or Splunk Monitoring-as-Code) to ensure environment parity and version-controlled dashboards. Incident Response & Asset Intelligence: Integrate xMatters for automated incident routing and communication. Utilize Axonius to maintain a comprehensive, real-time inventory of cyber assets and ensure security compliance across the tech stack. Documentation & Mentorship: Create and maintain high-fidelity technical documentation and runbooks to empower future engineers and managers to resolve issues independently.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed
Number of Employees
1,001-5,000 employees