As a Site Reliability Engineer (SRE) for GDMS's Space and Intelligence Systems line of business, you will be a member of a cross functional team responsible for maintaining survivability and reliability of mission critical resources. SREs monitor high priority systems and automate recovery mechanisms to ensure they remain operational for the warfighter. We encourage you to apply if you have any of these skills or experiences: Ensuring Uptime of Critical Systems (Incident Response / Triage) Automating Systems Administration Activities (Bash / Python / Ansible are preferred) Monitoring, and Troubleshooting Enterprise Services (Prometheus, Grafana, Splunk) Configuring Enterprise Services (Ansible, YAML, JSON) Developing recovery procedures for large systems (Backup and Restore, Blue/Green Deployment).