Infrastructure Software Engineer

NebiusLos Angeles, CA
17h$150,000 - $210,000Remote

About The Position

Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in-house AI/ML teams. Our employees work at the cutting edge of AI cloud infrastructure alongside some of the most experienced and innovative leaders and engineers in the field. Nebius operates large-scale, mission-critical bare-metal infrastructure. As a Software Engineer (Python), you will design and build systems that provision, configure, test, and manage physical hardware at scale. Your work will sit close to the metal—interfacing directly with servers, networks, and management controllers—while supporting highly automated, reliable infrastructure operations. You will collaborate closely with hardware, networking, and data center operations teams to ensure our platforms are robust, scalable, and production ready.

Requirements

  • Strong professional experience as a software engineer, with a focus on Python
  • Solid experience with Linux systems and shell scripting
  • Hands-on experience working with bare-metal servers or low-level infrastructure
  • Strong understanding of networking fundamentals (IPv4/IPv6, DHCP, DNS, PXE / network boot)
  • Experience interacting with hardware management interfaces (BMC, IPMI-like protocols, HTTP APIs)
  • Familiarity with CI/CD systems and production deployment workflows
  • Experience designing or working with NoSQL databases
  • Ability to debug complex issues spanning software, hardware, and networks
  • Strong ownership mindset and clear communication skills in a distributed team

Nice To Haves

  • Experience operating or building systems for large-scale infrastructure
  • Familiarity with ARM-based platforms in production environments
  • Background in hardware testing, validation, or factory provisioning
  • Experience with infrastructure automation or internal platform tooling
  • Contributions to open-source or internal systems software projects

Responsibilities

  • Design and develop backend services and automation in Python
  • Build and maintain systems for hardware provisioning, testing, and lifecycle management
  • Develop software that runs directly on bare-metal environments
  • Integrate with Linux systems, using Bash and low-level tooling where needed
  • Implement and maintain CI/CD pipelines for infrastructure-focused software
  • Work with networking services including IPv4/IPv6, DHCP, DNS, network boot, and server boot workflows
  • Interface with BMC controllers and management protocols (IPMI-style protocols, HTTP-based standards)
  • Enable reliable hardware interaction and automation across large fleets
  • Support ARM64 / ARM64EC architectures
  • Design and integrate NoSQL data stores for system state and orchestration data
  • Write clear documentation and contribute to operational excellence

Benefits

  • Health insurance: 100% company-paid medical, dental, and vision coverage for employees and families
  • 401(k) plan: up to 4% company match with immediate vesting
  • Parental leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers
  • Remote work reimbursement: up to $85/month for mobile and internet
  • Disability & life insurance: company-paid short-term, long-term, and life insurance coverage
  • Competitive salary and comprehensive benefits package.
  • Opportunities for professional growth within Nebius.
  • Flexible working arrangements.
  • A dynamic and collaborative work environment that values initiative and innovation.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service