Oracleposted 18 days ago
Senior
Santa Clara, CA
Publishing Industries

About the position

Oracle Cloud Infrastructure (OCI) is looking for a Senior Software Engineer to lead the development of scalable, resilient, and secure infrastructure systems that underpin the core of OCI's compute platform. This role sits within the Host Provisioning Services (HoPS) team, which owns the critical infrastructure responsible for automating the full server lifecycle from rack integration and hardware bring-up to customer-ready instance provisioning and firmware management. HoPS services operate at the intersection of bare metal hardware and full-stack orchestration frameworks. They interface directly with components like BMCs, NICs, SmartNICs, ILOMs, GPUs, and custom firmware stacks. The team builds microservices and tooling that provision, configure, secure, and validate server platforms across OCI's global fleet. As a Senior Software Engineer, you will design and deliver highly available services and automation pipelines that manage server provisioning at hyperscale, enable firmware pinning for deterministic customer environments, and deliver fleet-wide firmware updates and telemetry-based observability. You'll drive solutions to support new silicon (e.g., NVIDIA, AMD, Intel platforms), SmartNIC/HostNIC convergence, RoT security integration, and the evolution of OCI's infrastructure into next-gen clusters and composable hardware environments. You will partner closely with teams across Compute, Networking, Security, Datacenter Engineering, and Hardware Development to ensure OCI can launch, scale, and maintain new server platforms with minimal operational overhead and high reliability. This role is ideal for experienced systems engineers with a deep understanding of operating systems, hardware-software integration, distributed services, and cloud-scale automation.

Responsibilities

  • Lead the development of scalable, resilient, and secure infrastructure systems for OCI's compute platform.
  • Automate the full server lifecycle from rack integration and hardware bring-up to customer-ready instance provisioning and firmware management.
  • Design and deliver highly available services and automation pipelines for server provisioning at hyperscale.
  • Enable firmware pinning for deterministic customer environments.
  • Deliver fleet-wide firmware updates and telemetry-based observability.
  • Drive solutions to support new silicon platforms and SmartNIC/HostNIC convergence.
  • Integrate RoT security and evolve OCI's infrastructure into next-gen clusters and composable hardware environments.
  • Collaborate with teams across Compute, Networking, Security, Datacenter Engineering, and Hardware Development.

Requirements

  • Deep understanding of operating systems.
  • Experience in hardware-software integration.
  • Knowledge of distributed services.
  • Expertise in cloud-scale automation.

Benefits

  • Flexible medical options.
  • Life insurance options.
  • Retirement options.
  • Volunteer programs.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service