About The Position

Axos Bank is seeking an experienced Manager to oversee Observability & Performance Engineering. You will primarily be responsible for leading the strategy, implementation, and ongoing optimization of enterprise observability capabilities across applications, networks, and infrastructure. This role is a hands on technical leader and people manager with deep expertise in Cisco AppDynamics, ThousandEyes, and Splunk, ensuring end to end visibility, rapid root cause analysis, and improved digital experience for customers and internal users. This leader partners closely with Business Stakeholders, Application Development, SRE, Infrastructure, Network, and Security teams to drive proactive monitoring, reduce mean time to detect (MTTD) and mean time to resolve (MTTR), and support highly available, scalable, and resilient platforms. This role also oversees the “Release Management” team. The Release Management team, managed by an Operations Manager reporting to this role, ensures compliance with documented policies and controls, relative to change management software releases.

Requirements

  • Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent experience).
  • 7+ years of experience in application performance monitoring, observability, or production operations.
  • 3+ years of experience managing or leading technical teams.
  • Deep, hands‑on expertise with: Cisco AppDynamics (Required) (APM, Business Transactions, Analytics, RUM, Synthetics)
  • ThousandEyes (Required) (Network path visibility, SaaS monitoring, endpoint agents, synthetic tests)
  • Splunk (Preferred) (Search, dashboards, alerts, log and metric ingestion)
  • Experience supporting mission‑critical, customer‑facing applications in regulated or high‑availability environments.

Nice To Haves

  • Experience in financial services, banking, or other highly regulated industries.
  • Familiarity with SRE practices, SLIs/SLOs, and error budgets.
  • Experience integrating observability tools with CI/CD pipelines and incident management platforms.
  • Certifications in AppDynamics, ThousandEyes, Splunk, cloud platforms, or ITIL.
  • Strong executive communication and ability to translate technical insights into business impact.

Responsibilities

  • Observability Strategy & Leadership Own and evolve the enterprise observability strategy across application performance, network experience, and log/metric analytics.
  • Define standards, best practices, and operating models for AppDynamics, ThousandEyes, and Splunk usage across teams.
  • Translate business and customer experience objectives into measurable observability KPIs and dashboards.
  • Act as the senior technical escalation point for complex performance and availability issues.
  • Platform Ownership & Engineering Lead the design, deployment, and optimization of Cisco AppDynamics for application performance monitoring, business transaction visibility, and deep diagnostics.
  • Ensure tight integration between AppDynamics, ThousandEyes, and Splunk to enable full‑stack correlation and faster root cause analysis.
  • Drive automation, alert tuning, and noise reduction to improve signal quality and operational efficiency.
  • Oversee ThousandEyes implementations for network path visibility, SaaS monitoring, endpoint experience, and external dependency monitoring.
  • Understand and leverage Splunk for centralized logging, metrics, alerting, and correlation across applications.
  • Incident Management & Operational Excellence Partner with SRE, NOC, and Incident Management teams to improve incident detection, triage, and post‑incident analysis.
  • Establish runbooks, dashboards, and workflows that enable proactive detection of performance and availability risks.
  • Lead root cause analysis efforts for major incidents and ensure actionable remediation plans are implemented.
  • Continuously improve platform reliability, scalability, and observability maturity.
  • People Management & Collaboration Manage, mentor, and grow a team of observability and performance engineers.
  • Set clear goals, provide coaching, and develop technical depth across the team.
  • Collaborate with application development, infrastructure, network, security, and vendor partners.
  • Serve as a trusted advisor to senior technology leaders on observability, performance, and resilience.
  • Vendor & Stakeholder Management Act as the primary technical liaison with Cisco, Splunk, and related vendors.
  • Oversee licensing usage, platform roadmap alignment, and vendor‑led optimization initiatives.
  • Support audits, compliance, and governance requirements related to monitoring and logging.

Benefits

  • Medical, Dental, Vision, and Life Insurance
  • Paid Sick Leave, 3 weeks’ Vacation, and Holidays (about 11 a year)
  • HSA or FSA account and other voluntary benefits
  • 401(k) Retirement Saving Plan with Employer Match Program and 529 Savings Plan
  • Employee Mortgage Loan Program and free access to an Axos Bank Account with Self-Directed Trading
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service