Sr Engineer Observability

OptimumPlano, TX
1d

About The Position

The Observability Engineer designs and maintains the platforms that provide deep visibility into systems, networks, and applications. This role enables proactive detection, faster incident response, and data-driven operational excellence through metrics, logs, traces, and events.

Requirements

  • 5+ years of experience in observability, SRE, or platform engineering
  • Strong experience with monitoring and observability tools
  • Experience instrumenting applications and infrastructure
  • Strong understanding of distributed systems and failure modes
  • Ability to translate operational needs into actionable signals
  • Experience working within an Agile or SAFE Agile work model

Nice To Haves

  • Experience with OpenTelemetry
  • Exposure to large-scale network or infrastructure environments
  • Experience supporting NOC, SRE, or incident response teams
  • Familiarity with observability-driven automation or AIOps

Responsibilities

  • Design and operate observability platforms (metrics, logs, traces, events)
  • Build instrumentation standards and onboarding patterns
  • Implement monitoring, alerting, and dashboards for critical systems
  • Partner with engineering and operations teams on observability best practices
  • Optimize signal quality and reduce alert noise
  • Support incident response, post-incident analysis, and reporting
  • Enable observability data for AI Ops and automation use cases
  • Maintain platform reliability, scalability, and cost efficiency
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service