Sr Site Reliability Engineer – Cloud Platform

Toyota North AmericaPlano, TX
1d

About The Position

Overview Who we are Collaborative. Respectful. A place to dream and do. These are just a few words that describe what life is like at Toyota. As one of the world’s most admired brands, Toyota is growing and leading the future of mobility through innovative, high-quality solutions designed to enhance lives and delight those we serve. We’re looking for talented team members who want to Dream. Do. Grow. with us. An important part of the Toyota family is Toyota Financial Services (TFS), the finance and insurance brand for Toyota and Lexus in North America. While TFS is a separate business entity, it is an essential part of this world-changing company- delivering on Toyota's vision to move people beyond what's possible. At TFS, you will help create best-in-class customer experience in an innovative, collaborative environment. To save time applying, Toyota does not offer sponsorship of job applicants for employment-based visas or any other work authorization for this position at this time. Who we’re looking for Toyota Financial Services is seeking a skilled and hands-on Site Reliability Engineer, Sr. – Cloud Platform to help scale and support the reliability, automation, and observability of our AWS infrastructure. In this role, you'll work closely with Cloud Platform Development, Production Engineering, and Incident Management teams to ensure our systems are resilient, self-healing, and ready for business-critical operations. This role is ideal for someone who brings deep experience in cloud infrastructure and SRE best practices, enjoys solving complex reliability challenges, and is passionate about automation and continuous improvement.

Requirements

  • Experience in SRE, DevOps, or Cloud Infrastructure roles
  • Solid understanding of SRE principles: SLIs, SLOs, error budgets, incident response
  • Hands-on experience with AWS services such as EKS, Lambda, CloudWAN, EC2, S3, RDS, Redshift, Systems Manager
  • Strong knowledge of network architecture and protocols within AWS
  • Experience building automated remediation and self-healing systems
  • Proficiency with Terraform, Python, Bash, and infrastructure as code principles
  • Experience with CI/CD tools (GitHub, Harness) and observability platforms (Dynatrace, CloudWatch)
  • Familiarity with ITSM processes and cloud security best practices
  • Excellent troubleshooting, problem-solving, and collaboration skills
  • Ability to work independently and within a cross-functional team environment

Nice To Haves

  • Bachelor’s degree in Information Technology or related field
  • AWS Certifications (e.g., DevOps Engineer, Solutions Architect)
  • Experience with integration tools like MuleSoft, Apache Camel, or message streaming platforms

Responsibilities

  • Operate and optimize cloud-native infrastructure in AWS, with a focus on EKS, Lambda, CloudWAN, Systems Manager, and ECR
  • Build and maintain self-healing automation workflows to reduce manual toil and improve uptime
  • Create and manage AWS Systems Manager (SSM) Automation Documents for operational efficiency
  • Define and track SLIs/SLOs and error budgets to improve system reliability
  • Implement observability using Dynatrace and AWS-native tools (e.g., CloudWatch)
  • Develop and maintain infrastructure as code using Terraform for repeatable, scalable deployments
  • Enhance and support CI/CD pipelines using GitHub and Harness
  • Participate in incident management, on-call rotations, and lead blameless postmortems
  • Collaborate with cloud development teams to improve architecture, delivery, and system performance
  • Troubleshoot cloud infrastructure and networking issues and perform root cause analysis (RCA)
  • Continuously identify opportunities to improve reliability, performance, and operational processes

Benefits

  • A work environment built on teamwork, flexibility, and respect
  • Professional growth and development programs to help advance your career, as well as tuition reimbursement
  • Team Member Vehicle Purchase Discount
  • Toyota Team Member Lease Vehicle Program (if applicable)
  • Comprehensive health care and wellness plans for your entire family
  • Toyota 401(k) Savings Plan featuring a company match, as well as an annual retirement contribution from Toyota regardless of whether you contribute
  • Paid holidays and paid time off
  • Referral services related to prenatal services, adoption, childcare, schools and more
  • Relocation assistance (if applicable)
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service