Cloud Site Reliability Engineer (SRE) for Internal Cloud. Maintain services once they are live by measuring and monitoring availability, latency and overall system health. Troubleshoot issues across the entire stack: hardware, software, application and network Perform deep dives into both systemic and latent reliability issues; partner with engineering and operation teams across the organization to produce and roll out fixes. Drive standardization efforts across multiple disciplines and services in conjunction with embedded SREs throughout the organization. Identify and drive opportunities to improve automation for the cloud services Scope and create automation for deployment, management and visibility of our services Troubleshoot issues across the entire stack: hardware, software, application and network Perform deep dives into both systemic and latent reliability issues; partner with engineering and operation teams across the organization to produce and roll out fixes. Identify and drive opportunities to improve automation for the cloud services
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Career Level
Entry Level
Education Level
No Education Listed