Site Reliability Engineer

StriveworksAustin, TX
2d$110,000 - $128,000Hybrid

About The Position

Build, Deploy, and Maintain AI for an Unpredictable World Striveworks helps organizations harness the power of artificial intelligence to solve real-world national security and business challenges by serving as the command center between data, models, and business outcomes. Founded by data scientists and engineers, Striveworks set out to make the journey from deployment to ongoing optimization simple and effective. With Striveworks, organizations aren’t just deploying AI—they’re building systems that remain reliable, adaptable, and ready to scale in an unpredictable world. Mission-critical operations require models that perform where they’re deployed, scale as workloads grow, and adapt rapidly as AI capabilities advance. Striveworks meets these demands, increasing reliability and performance while lowering costs—and enabling confident, data-driven decision-making in dynamic environments. The Role As a Site Reliability Engineer at Striveworks, you’ll be challenged—and trusted—on day one to implement and manage all corporate systems. You’ll be exposed to, and gain proficiency with, a wide array of systems and infrastructure automation tools, and you will be given the opportunity to build and/or incorporate additional tools. You’ll be called on to develop solutions that prevent problems from reoccurring in the future, instead of simply mitigating the issue for today. You’ll be highly encouraged to automate solutions to reduce or eliminate “toil.”

Requirements

  • 4+ years of experience in any IT-related field
  • Experience deploying infrastructure in a cloud environment such as AWS, Azure, GCP, or OpenStack
  • Experience with virtualization and/or containerization solutions (e.g., OpenStack, Kubernetes, Docker, VMware, KVM, or Hyper-V)
  • Experience with Ansible or another configuration management solution (e.g., Chef, Puppet, or Salt)
  • Programming experience in Python or other programming/scripting languages (e.g., Bash, PowerShell, Go, Java, or JavaScript)
  • Due to the nature of this role, candidates must be a US person (a US citizen, a US national, or a Green Card holder)

Nice To Haves

  • Experience with automation and infrastructure as code, DevSecOps, CI/CD pipelines, or automated security scanning (Windows and Linux)
  • Understanding of US federal information system security policies, including Security Technical Implementation Guides (STIGs), NIST SP 800-171, NIST SP 800-53, NIST RMF, and CMMC
  • Experience with network technologies (e.g., VLANs, switches, routers, firewalls, and VPN)
  • Experience working with GPUs for compute workloads
  • Experience maintaining distributed/clustered systems

Responsibilities

  • Maintaining and developing infrastructure (as code) within both private (OpenStack) and commercial (AWS, Azure, GCP) cloud environments
  • Maintaining and developing configuration management automation for Windows laptops and Linux servers
  • Providing user support for all corporate systems

Benefits

  • Medical/dental/vision insurance
  • Voluntary life, long-term disability, accident, and hospital indemnity insurance
  • HSA and FSA (including dependent care FSA) plans
  • 401(k) plan
  • Unlimited PTO
  • Paid parental leave
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service