Be at the forefront of our Microsoft 365 Resilience efforts, by leading the development and architecture of our most critical monitoring and alerting. Identifying critical paths in highest priority scenarios including Copilot and develop and work with service teams to build robust reliability measures including Graceful Degradation and failure modes. As a Principal Software Engineer, you will transform and evolve how our critical paths are monitored, measured and designed reliably. You’ll work directly on the probes, monitoring and alerting that orchestrates the most critical paths across Microsoft 365. Empowering Microsoft’s M365 Core Platform and Copilot teams to measure reliability and monitor service health with rigor. This opportunity will allow you to dive deep on Microsoft’s M365 Core Platform, technologies, and rapidly grow your career. The M365 Foundation team is a core pillar within Microsoft’s M365 Core Platform and Services organization, responsible for ensuring the reliability, resilience, performance, and scalability of the platform that underpins Microsoft 365 services. The team drives strategic investments across AI Evaluations, Performance & Efficiency, Change Management, Reliability & Resilience, Observability & Intelligent Cloud, and Fleet & Capacity, with an emphasis on trust, security, and operational excellence. Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees, we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level