About The Position

The IC2 Site Reliability Engineer in our OCI Sovereign Cloud team supports daily operations for a secure, large-scale OCI-based cloud environment powering mission-critical federal government workloads. This entry-level position focuses on maintaining and supporting existing infrastructure, implementing incremental improvements, and ensuring operational health and compliance. Working within a Linux-centric environment, you will leverage scripting and basic automation to manage deployments, perform fleet maintenance, and maintain system health under the supervision and guidance of senior engineers. Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential at a company leading the way in AI and cloud solutions that impact billions of lives. True innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing a workforce that promotes opportunities for all with competitive benefits that support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs. We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling 1-888-404-2494 in the United States. Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.

Responsibilities

  • Perform routine operational tasks such as deployments, patching, fleet maintenance, and basic troubleshooting for cloud-based systems.
  • Tune team-specific alarms and thresholds, escalate incidents appropriately, and support the management of metrics, KPIs, and system health dashboards.
  • Participate in incident response by quickly triaging and escalating incidents, executing operational playbooks, and documenting issues for senior review. You will follow established procedures under supervision and contribute to root-cause analysis by gathering data and providing initial troubleshooting support.
  • Serve as a technical support point of contact, troubleshooting and resolving technical issues, assisting customers with environment setup and debugging, and providing timely communication and status updates to customers and internal teams.
  • Own, maintain, and improve runbooks to ensure consistency and clarity for operational processes.
  • Implement defined enhancements to existing tools, documentation, and monitoring solutions.
  • Collaborate closely with other team members and escalate complex issues for further investigation and resolution.
  • Participate in on-call rotations with support from senior engineers, ensuring continuity of coverage and timely response.
  • Ensure compliance with all security, operational, and documentation standards.

Benefits

  • flexible medical
  • life insurance
  • retirement options
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service