Senior Systems Engineer (Linux/Cloud)

PeratonSan Antonio, TX
20d

About The Position

Join Peraton’s Innovative Cloud Operations Team in San Antonio, TX! Are you ready to work on cutting-edge cloud technologies that power large-scale, data-intensive analytics? At Peraton, our Operations Team manages a sophisticated cloud environment platform built in Java and enhanced with Free and Open-Source Software technologies such as Kubernetes, Hadoop, and Accumulo. This platform delivers high performance, scalability, and reliability, enabling critical insights for our clients every day. We’re looking for passionate, problem-solving professionals who thrive in dynamic environments and enjoy collaborating with talented teams to drive operational excellence.

Requirements

  • Experience working in Linux-based cloud environments with strong troubleshooting and operational expertise.
  • Familiarity with Java-based platforms and the ability to support software in production environments.
  • Hands-on experience with Kubernetes orchestration, container management, and cloud-native deployment.
  • Knowledge of big data technologies such as Hadoop and Accumulo, including cluster management and data operations.
  • Strong customer service mindset with the ability to communicate complex technical concepts clearly.
  • Self-motivated, detail-oriented, and capable of thriving independently and within collaborative teams.
  • Willingness to participate in an on-call rotation to support after-hours operations.
  • Bachelor’s degree in Systems Engineering, Computer Science, Information Systems, Engineering, or a related field—or equivalent experience (5 additional years of Systems Engineering may substitute for a degree).
  • 20 years of Systems Engineering experience on complex programs or contracts.
  • Proven ability to plan and lead Systems Engineering initiatives.
  • Active TS/SCI security clearance with current polygraph.

Nice To Haves

  • Experience supporting mission-critical applications and infrastructure is a plus.
  • Familiarity with containerization and orchestration tools (Docker, Kubernetes).
  • Experience with big data technologies (Hadoop).
  • Scripting skills in Python or Bash.

Responsibilities

  • Operate and maintain a state-of-the-art cloud-based analytics platform, ensuring stability, reliability, and peak performance.
  • Provide Tier 1 through Tier 3 technical support, including incident management, troubleshooting, root cause analysis, and problem resolution.
  • Collaborate with cross-functional teams to deploy, monitor, and manage Kubernetes clusters, Hadoop data lakes, and Accumulo databases.
  • Perform essential operational tasks such as system health checks, log analysis, patch management, and performance tuning in a Linux environment.
  • Serve as a key point of contact for customers, delivering responsive, high-quality support to meet SLAs and exceed expectations.
  • Proactively identify potential system issues, recommend improvements, and implement solutions to enhance platform reliability and efficiency.
  • Participate in an on-call rotation to provide 24/7 operational support, ensuring rapid response to critical incidents.
  • Document operational procedures, troubleshoot guides, and best practices to foster knowledge sharing across the team.
  • Stay current with emerging cloud, big data, and open-source technologies to continually innovate and improve our platform.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service