About The Position

Amazon Web Services (AWS) Hardware Engineering Services (HWEngS) team creates compute, storage, accelerator, and enterprise servers for Amazon’s innovative web services. We are seeking an experienced SysDE who collaborates with Server Hardware Engineers and completes the system design and software/script development work required to build next-generation servers and HW components. Our designs are industry-leading in performance, frugality and operational excellence, and are critical to the success of the AWS business and the greater than one million customers who use AWS today. Our SysDE’s solve challenging technology problems, and builds architecturally sound, high-quality servers and components to enable AWS to realize critical business strategies. Our success depends on our highly respected server infrastructure; we’re handling massive scale and rapid integration of emergent technologies. As a member of the Specialized Platforms and Servers team, you’ll be responsible for the integration of hardware/software components to build our servers, working with service teams to define products, and supporting operations in all the locations we have servers. You will interact with engineers across the company and work with an interdisciplinary team to execute product designs from concept to production including design, development, validation, and the deployment of servers at scale. You will solve design challenges across many disciplines including server firmware/software, integration into our software service control plane, manufacturing, and operations. You will deliver test plan coverage criteria, and monitor plans used to build our servers and components. You will work closely with internal teams to prototype concept designs, bring up systems, and ensure our designs are of the highest quality. You will use high judgement to solve problems individually or guide others when help is needed. This is a fast-paced, challenging position, and you’ll work with thought leaders in multiple technology areas. You’ll have high standards for yourself and everyone you work with, and you’ll be constantly looking for ways to improve your products performance, quality and cost. We’re changing an industry, and we want team members who are ready for this challenge and want to reach beyond what is possible today. Public cloud IT services represent the majority of growth in the overall IT services market and will continue to do so for several years to come. The scale of AWS, combined with an understanding of how our hardware is used, creates a unique opportunity for hardware customizations that will directly benefit AWS customers. You will work directly with engineers across the company to build next-generation hardware. You will have a direct impact on our bottom line and the ability to deliver improvements for our customers. You will be part of a growing, fast paced, and fun team. You will have ownership for the implementation of your work.

Requirements

  • 4+ years of non-internship professional software development experience
  • 3+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience
  • Knowledge of systems engineering fundamentals (networking, storage, operating systems)
  • Experience programming with at least one modern language such as C++, C#, Java, Python, Golang, PowerShell, Ruby
  • 3+ years of deploying and operating in a Linux/Unix environment experience
  • 2+ years of systems design, software development, operations, automation, and process improvement experience
  • Experience in computer architecture, or experience with general troubleshooting/debugging of hardware
  • Experience handling ambiguous or undefined challenges through strong problem solving abilities
  • BS degree in computer science, computer engineering, or related field, or 2+ years of technical work experience
  • Experience with modern technology devices in storage, networking, memory as well as a variety of interface standards and protocols (I2C, IPMI, SPI, PCIe)

Nice To Haves

  • Experience working in an Agile environment using the Scrum methodology
  • 4+ years of systems development experience
  • 5+ years of integration, testing and automation experience
  • Experience in Network protocols like DNS/DHCP/TCP, or experience in Linux and Networking protocols and experience with web-based applications and HTTP
  • Experience in written and verbal communication skills to communicate with technical and non-technical audiences, including senior leadership
  • 4+ years of server systems debug experience; debugging and root causing complex server platforms
  • 4+ years of experience contributing towards increasing durability, security, availability and scalability of systems through exploration, diagnosis and remediation
  • Experience developing, deploying, and owning cloud applications
  • Familiarity with AWS and EC2, with ideally hands on experience
  • Ability to dive deep to analyze complex issues, solve problems, and automate repetitive tasks
  • Experience in developing functional design specifications, validation plans and functional test procedures
  • Experience with server technologies: BIOS, BMC, signal integrity, memory, storage, networking, PCIe, and thermal

Responsibilities

  • As SysDE you will be part of a team that solves systemic issues, drives changes back into development, and builds mechanisms to scale and efficiently operate our infrastructure.
  • As a member of the AWS HWEngS team, you will work with other subject matter experts in compute, memory, storage technologies to develop and deliver the best customer experience in cloud computing.
  • Your day to day responsibilities will include leading the HWEngS System Development effort to define and build software and enabling tools, according to defined HWEngS Software development best practices.
  • You will be responsible for solving operational challenges to our existing fleet with the goal of improving the current customer experience as well as developing improved systems for future designs.
  • You will interface with internal HWEngS teams to ensure chosen HW and FW delivers performance, reliability, and operational health needed by the EC2 Specialized Platforms and Servers.
  • You will build, manage, and deploy pipelines for rapid deployment of new code changes to EC2 Specialized Platforms and Servers.
  • You will build monitoring tools and metrics to ensure hardware is running properly in both test and production environments.

Benefits

  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service