Server Platform SoC Validation Lead and Debug Engineer

Advanced Micro Devices, IncAustin, TX
3dHybrid

About The Position

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. We are seeking a skilled and dedicated Server Platform SoC Validation lead and Debug Engineer to join our Server Platform Solution Engineering Group’s Debug team. In this role, you will work on cutting-edge server EPYC and AI Platforms to lead validation and debug to meet the program milestones. Your responsibilities will include collaborating with Platform architecture, design, validation, firmware and software engineering teams to resolve critical issues and ensure the highest quality standards. THE PERSON: The individual has a strong understanding of the general server platform validation and SoC debug process. A successful candidate must have good verbal, as well as written, communication skills.

Requirements

  • Strong understanding of Server platform components, x86 or other complex CPU architectures. Proficiency with Linux and/or Microsoft Operating Systems is a plus.
  • Deeper domain expertise in areas such as IO interfaces - PCIe, CXL, RAS, Power management to drive comprehensive system level test-plan execution
  • Understanding of BMC firmware and features, including IPMI, Redfish, sensor monitoring, power control, and remote management is a plus
  • Demonstrable experience in designing experiments to solve problems and strong analytical skills.
  • Prior experience with computer system design and/or validation, testing tools, and environments.
  • Experience with handling and taking captures using Oscilloscopes, protocol analyzers, and JTAG based Debug Tools.
  • Proficiency in C, Python, and shell scripting for low-level development and debug
  • Excellent organizational skills and the ability to prioritize multiple workstreams and meet tight deadlines.
  • Strong networking and relationship-building skills, with the ability to drive effective decision-making across various functions and levels within the organization.
  • BS or MS degree in Electrical Engineering or related major, with 10+ years of applicable experience

Nice To Haves

  • Knowledge of pre-silicon environments (Verification, Emulation, Virtual Bring-Up) is a plus.
  • Proficiency with Linux and/or Microsoft Operating Systems is a plus.
  • Understanding of BMC firmware and features, including IPMI, Redfish, sensor monitoring, power control, and remote management is a plus

Responsibilities

  • Lead system validation plans for EPYC and AI platforms to ensure alignment with program milestone criteria, leveraging strong expertise in domains such as x86 architecture, power management, high‑speed data‑center I/O (PCIe, CXL, etc.), RAS features to drive test execution and resolve issues efficiently.
  • Collaborate with partner organizations to provide root cause analysis for platform issues in a Data center environment. The debug role is expected to provide root cause analysis for platform level, SoC logical, performance and BIOS/firmware issues
  • Improve debug capabilities and methodology over time by identifying common challenges or impediments to efficient debug and working with partner organizations like design, Firmware and software teams to drive innovation in silicon architecture, design, tools and methods.
  • Manage and track technical issues, risks, and priorities effectively with the business unit and SW Debug tools teams. Manage customer and executive communications, including program status, risks and opportunities.
  • Maintain strong communication skills, both verbal and written, to convey summary findings and recommendations to senior management.

Benefits

  • AMD benefits at a glance.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service