This role supports operations within a cloud-based analytics platform built with Java and leveraging open-source technologies such as Kubernetes, Hadoop, and Accumulo. The platform enables the execution of large-scale, data-intensive analytics on a managed infrastructure supporting mission-critical workloads. As a member of the Operations Team, the candidate will help ensure day-to-day platform stability, provide responsive customer support, and perform technical troubleshooting and system repair across the environment. The role requires strong operational awareness, attention to detail, and the ability to work effectively in a fast-paced, collaborative team environment while proactively addressing system issues. The candidate will work with a variety of technologies depending on evolving customer requirements and mission needs. This position includes participation in an on-call rotation and requires the ability to provide Tier 1–3 operational support. A strong background in troubleshooting Linux-based systems is essential. Experience with containerization, orchestration, and automation technologies—such as Docker, Kubernetes, and scripting with Python or Bash—is highly beneficial. Additional experience that would be valuable in this role includes familiarity with monitoring and observability tools such as Prometheus and Grafana, issue tracking with JIRA, distributed storage technologies such as Hadoop Distributed File System (HDFS), virtualization platforms, configuration management tools such as Salt or Ansible, and cloud platforms including OpenStack and AWS.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Number of Employees
5,001-10,000 employees