About The Position

Markon’s partner company is seeking a High-Performance Computing (HPC) Software Engineer to design, develop, optimize, and maintain scalable parallel and distributed systems operating in advanced HPC environments for the Fort Meade Customer. This role requires deep expertise in parallel programming, performance optimization, Linux systems, and large-scale HPC architectures. The engineer will serve as a technical leader, helping drive architectural decisions, implementing high-performance solutions, and mentoring team members in advanced HPC methodologies.

Requirements

  • Active TS/SCI w/ Polygraph
  • Master’s degree in Computer Science or related field + 5 years of relevant experience or Bachelor’s degree in Computer Science or related field + 7 years of relevant experience
  • 9 years of relevant Software Engineering experience
  • Experience working with Linux CLI and Linux-based tools
  • Experience developing Bash scripts to automate processes
  • Recent software development experience with C/C++ and Python
  • Strong experience with parallel programming models such as MPI, OpenMP, and CUDA
  • Deep understanding of: Multi-threading and concurrency Memory hierarchy and cache optimization NUMA architectures Vectorization and SIMD optimization
  • Experience implementing and maintaining parallel and distributed algorithms optimized for performance and scalability
  • Subject matter expertise in parallel computing strategies and statistical modeling
  • Experience installing, configuring, optimizing, and supporting COTS/GOTS/FOSS software and libraries in Linux environments
  • Strong problem-solving, collaboration, and communication skills
  • Experience using Git version control

Nice To Haves

  • Experience with SQL databases such as MySQL, MariaDB, or PostgreSQL
  • Experience with distributed data processing frameworks, such as Apache Spark
  • Experience working with HPC job schedulers, including Slurm
  • Experience with containerization technologies such as Docker, containerd, or Podman
  • Experience using the Atlassian tool suite, including JIRA and Confluence

Responsibilities

  • Design, develop, and maintain high-performance parallel and distributed software systems
  • Optimize applications for performance across CPU, GPU, memory, storage, and network layers
  • Implement scalable algorithms for large-scale HPC environments
  • Develop automation scripts to streamline system operations and development workflows
  • Install, configure, and support COTS, GOTS, and FOSS software packages and libraries in Linux environments
  • Analyze and optimize application performance using advanced HPC techniques
  • Provide technical leadership and mentorship on parallel computing strategies and HPC best practices
  • Communicate complex HPC concepts and solutions to both technical and non-technical stakeholders

Benefits

  • Medical
  • Dental
  • Vision
  • Life Insurance
  • Short-Term Disability
  • Long-Term Disability
  • 401(k) match
  • Flexible Spending Accounts
  • EAP
  • Training and Tuition Assistance
  • Paid Time Off
  • Holidays
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service