Principal Software Development Engineer

Expedia GroupSan Jose, CA
1d$249,000 - $348,500

About The Position

Expedia Group brands power global travel for everyone, everywhere. We design cutting-edge tech to make travel smoother and more memorable, and we create groundbreaking solutions for our partners. Our diverse, vibrant, and welcoming community is essential in driving our success. Why Join Us? To shape the future of travel, people must come first. Guided by our Values and Leadership Agreements, we foster an open culture where everyone belongs, differences are celebrated and know that when one of us wins, we all win. We provide a full benefits package, including exciting travel perks, generous time-off, parental leave, a flexible work model (with some pretty cool offices), and career development resources, all to fuel our employees' passion for travel and ensure a rewarding career journey. We’re building a more open world. Join us. Principal Software Development Engineer Our Technology Team partners with teams across Expedia Group to create innovative products, services, and tools to deliver high-quality experiences for travelers, partners, and our employees. A singular technology platform powered by cloud and data provides secure, differentiated, and personalized experiences that drive loyalty and traveler satisfaction. We are looking for a Principal Engineer to serve as the technical architect for our Cloud Platform organization which sits within our Technology division. As a Principal Engineer reporting to the VP of Cloud Platform, you will be the primary architect of our technical future. The Cloud Platform organization provides the secure, scalable cloud infrastructure, runtime platforms, and developer experience tooling that enable teams across Expedia Group to build, deploy, and operate high-quality, resilient software quickly and safely. We are seeing an explosion in code volume and service complexity. The goal for this role is to build a platform that can handle this growth without sacrificing reliability or skyrocketing our cloud bill. You’ll be responsible for making sure our architecture is composable, our developer tools are agentic, our Kubernetes footprint is efficient, and our observability stack provides signals, not just noise.

Requirements

  • Extensive professional software development experience designing, building, and operating large-scale, cloud-native distributed systems and platform services on Kubernetes.
  • Proven ownership of critical services or multi-service platforms, including responsibility for system design (LLD), API design, data modeling, deployment, and ongoing operational health.
  • Deep expertise with at least one major public cloud provider and core platform technologies (compute, networking, storage, service discovery, security, observability, and CI/CD).
  • Demonstrated ability to make high-impact architectural decisions, navigate complex trade-offs, and guide multiple teams toward coherent, long-term technical direction.
  • Familiarity with AI-driven systems, tools, or workflows and applying AI/ML concepts to real world products within cloud or platform environments.
  • Deep knowledge of observability patterns (OpenTelemetry, Prometheus, distributed tracing).
  • Expert-level understanding of Infrastructure as Code (Terraform, Pulumi) and CI/CD at scale.
  • Proficiency in Go, Rust, or similar languages used in modern platform engineering.

Nice To Haves

  • Track record of defining and evolving multi-year technical strategies for cloud and developer platform ecosystems, and successfully driving adoption of shared platforms across many teams.
  • Experience designing and operating highly available, globally distributed systems at internet scale, including capacity planning, performance optimization, and robust failure handling.
  • Safely integrates and operates AI/ML‑enabled solutions that improve outcomes, such as intelligent routing, predictive scaling, or automated remediation embedded in platform services, with appropriate safeguards.
  • Advanced experience applying AI/ML techniques to cloud and platform problems (for example, cost optimization, anomaly detection, or performance tuning) and partnering with data/ML teams to productionize these capabilities.
  • A Systems Architect: You understand the deep plumbing of the cloud (AWS/GCP, K8s, networking). You think in terms of failure domains, latencies, and unit economics.
  • Reliability-First: You’ve carried a pager for global-scale systems. You have a healthy "paranoia" about state, consistency, and cascading failures.
  • Hands-on: You still love to build. You can prototype a complex infrastructure change in a weekend to prove it works, but you have the discipline to ensure it's production-grade before it ships.

Responsibilities

  • Lead Architectural Evolution: You’ll own the move toward a Cell-Based Architecture. We need to move away from fragile, monolithic clusters and toward isolated, predictable failure domains that allow us to scale horizontally with confidence.
  • Modernize Kubernetes & Infrastructure: You’ll define our K8s strategy, focusing on multi-cluster management, service mesh, and automated scaling. You need to ensure our "Golden Path" makes it easy for engineers to do the right thing by default.
  • Hardened Reliability & Observability: You will set the standards for SRE across the org. This means moving beyond basic dashboards to causal observability, automated incident response, and rigorous SLO/SLI management. You’ll help us engineer out the root causes of systemic instability.
  • Optimize Cloud Economics: You’ll lead our FinOps technical strategy. You need to build the tooling and visibility that allows us to understand cost-per-service and ensures our infrastructure spend is directly tied to business value.
  • Support the Developer Workflow: While we are embracing AI tools, your job is to build the underlying "agent-friendly" infrastructure. This includes standardized Dev Containers and ephemeral environments that allow for fast, isolated iteration without clobbering shared state.

Benefits

  • full benefits package, including exciting travel perks, generous time-off, parental leave, a flexible work model (with some pretty cool offices), and career development resources
  • medical/dental/vision
  • paid time off
  • Employee Assistance Program
  • wellness & travel reimbursement
  • travel discounts
  • International Airlines Travel Agent (IATAN) membership
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service