Within the CoreAI organization, the Azure Managed Redis team is responsible for delivering a fully managed, enterprise-grade Redis experience to customers worldwide. We partner closely with Redis Inc. to bring the latest Redis Enterprise capabilities to Azure, and we operate one of the largest Redis fleets on the planet. Our dataplane infrastructure team owns the critical systems that deploy, upgrade, monitor, and heal Redis clusters at massive scale — ensuring five-9's availability for workloads that demand sub-millisecond latency. We are seeking a Senior Software Engineer who is an innovator and technical leader, excels in large-scale distributed systems, navigates ambiguity with confidence, and drives end-to-end ownership. You should be an early adopter of AI-powered development tools such as GitHub Copilot and actively explore how Large Language Models and ML techniques can enhance engineering effectiveness, operational intelligence, and service reliability. In this role, you will shape the future of a global cloud service — from the deployment pipelines that push bits to 70+ regions, to the AI-powered observability systems that keep the fleet healthy. This is a high-impact opportunity to define the next generation of cloud dataplane infrastructure, influence technical strategy for a flagship Azure service, and mentor engineers while contributing code at scale. You will work closely with teams across Azure Compute, Networking, and Storage, as well as with Redis Inc., to build a world-class managed data platform. Microsoft's mission is to empower every person and every organization on the planet to achieve more, and we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive. Build the platform that powers hundreds of thousands of cloud applications worldwide. Azure Managed Redis (AMR) is a critical foundation for modern, low-latency applications — from real-time AI inference caches to globally distributed session stores and leaderboards. We are entering a pivotal transformation: significantly expanding the service footprint, modernizing the orchestration and deployment platform, and accelerating release velocity — all while raising the bar for reliability and quality. If you're excited about building planet-scale infrastructure, pushing boundaries with AI-driven engineering and operations, and working at the intersection of open-source innovation and cloud-native engineering, this is your opportunity to make a global impact. Lead architecture, design, and technical direction for the runtime infrastructure of Azure Managed Redis, one of Azure's fastest-growing data services. Design and build robust deployment, upgrade, and orchestration systems for distributed Redis clusters running across 70+ Azure regions worldwide. Drive the integration of Redis Enterprise software from Redis Inc. into Azure's managed environment, enabling seamless cluster upgrades while preserving high availability, data consistency, and performance under mission-critical workloads. Architect the end-to-end release platform across infrastructure, application, and observability layers to accelerate release velocity and improve deployment safety. Write high-quality, production-grade code in languages such as C# and Go while modeling best practices for reliability, testability, and operational excellence. Lead design and review of testing strategies with strong coverage across unit, integration, and end-to-end system tests for distributed infrastructure. Apply AI techniques (e.g., GitHub Copilot, LLMs, anomaly-detection models) to enhance DevOps, operations, and engineering workflows — detecting, diagnosing, and responding to issues across the global fleet. Build and improve observability, monitoring, and diagnostics tooling to enable rapid incident detection, root-cause analysis, and self-healing capabilities at scale. Mentor engineers on distributed systems design patterns, operational best practices, and verification strategies. Set high standards for code reviews, diagnosability, and maintainability across the team. Identify and fill gaps in deployment, orchestration, and operational tooling by building or adopting scalable and reusable solutions. Stay current with new technologies, industry trends in cloud infrastructure and data platforms, and share knowledge across the team. Lead cross-functional planning, estimation, and execution of high-impact technical initiatives with partner teams across Azure. Engage directly with internal customers, partner teams (e.g., Azure Compute, Networking, Redis Inc.), and on-call operations to align platform capabilities with service reliability and accelerate growth.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level