Senior Site Reliability Engineer - Tactical Reconnaissance & Strike

Anduril IndustriesAtlanta, GA
1d$144,000 - $191,000

About The Position

As a Site Reliability Engineer you will be responsible for deploying, integrating, and managing customer and developmental cloud environments across TRS. This role requires a systems-thinking engineer who can bridge software development, platform engineering, and mission operations to ensure seamless integration of new capabilities while improving production scalability and maintaining reliability. The ideal candidate will own the end-to-end lifecycle of cloud-based deployments, drive continuous improvement of data pipelines and observability infrastructure for TRS’s growing drone fleets, and identify opportunities to leverage emerging platform services to enhance system performance and data quality. This position will also play a critical role in scaling integration best practices and building out functional capabilities across additional TRS product lines.

Requirements

  • Bachelors Degree in Computer Science or other STEM focused degree
  • Advanced proficiency in programming languages (Python for scripting and integration).
  • 5+ years of experience with CI/CD tools like GitHub Actions, Jfrog Artifactory, and Git.
  • Proficiency with IaC tools (Terraform, Ansible).
  • 5+ plus years of experience with cloud platforms (Azure, AWS, GCP).
  • Proficiency in containerization (Docker) and container orchestration (Kubernetes).
  • Experience with logging and monitoring tools (Nominal and Grafana).
  • Understanding of parallel computing frameworks (CUDA, OpenCL).
  • Strong collaboration skills and proficiency with collaborative tools (JIRA, Confluence).
  • Eligible to obtain and maintain an active U.S. Secret security clearance.

Nice To Haves

  • Masters or other advanced STEM degree
  • Technical expertise and demonstrated performance in one or more of the following areas: networking, cloud technologies, application development, hardware design, and/or cybersecurity
  • Minimum of 7 years of operations and engineering experience

Responsibilities

  • Cloud Deployment & Environment Management: Own and execute customer and developmental cloud deployments across TRS product lines, ensuring reliable configuration management, version control, and seamless promotion of releases from development through production environments.
  • Anduril Platform Services Integration: Evaluate, prototype, and integrate emerging platform capabilities (such as RDF and MissionSim) and/or 3rd party services (such as Arena AI and AFATDS/AXS) to improve data discoverability, consistency, and analytical capabilities across TRS systems.
  • Data Pipeline & Observability Infrastructure: Maintain and enhance existing data pipelines, metrics frameworks, and monitoring solutions including Grafana and Nominal; ensure high availability, data quality, and actionable insights for engineering and operations teams.
  • Field Support & Operational Testing: Collaborate directly with field operation teams during feature rollouts to conduct real-world testing, troubleshoot issues in operational environments, gather actionable feedback to inform system improvements and ensure mission success, and enable customer self-serve provisioning of environments.
  • Cross-Product Line Expansion: Partner with leadership to establish integration engineering functions and best practices across all TRS product lines, developing reusable patterns, documentation, and tooling that accelerate deployment capabilities and operational maturity.

Benefits

  • Healthcare Benefits
  • US Roles: Comprehensive medical, dental, and vision plans at little to no cost to you.
  • UK & AUS Roles: We cover full cost of medical insurance premiums for you and your dependents.
  • IE Roles: We offer an annual contribution toward your private health insurance for you and your dependents.
  • Additional Benefits
  • Income Protection: Anduril covers life and disability insurance for all employees.
  • Generous time off: Highly competitive PTO plans with a holiday hiatus in December. Caregiver & Wellness Leave is available to care for family members, bond with a new baby, or address your own medical needs.
  • Family Planning & Parenting Support: Coverage for fertility treatments (e.g., IVF, preservation), adoption, and gestational carriers, along with resources to support you and your partner from planning to parenting.
  • Mental Health Resources: Access free mental health resources 24/7, including therapy and life coaching. Additional work-life services, such as legal and financial support, are also available.
  • Professional Development: Annual reimbursement for professional development
  • Commuter Benefits: Company-funded commuter benefits based on your region.
  • Relocation Assistance: Available depending on role eligibility.
  • Retirement Savings Plan
  • US Roles: Traditional 401(k), Roth, and after-tax (mega backdoor Roth) options.
  • UK & IE Roles: Pension plan with employer match.
  • AUS Roles: Superannuation plan.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service