We are seeking an engineer to join our hardware management team. This team is responsible for the provisioning, monitoring, and support for thousands of servers supporting dozens of teams within Bloomberg, including the entire AI stack! The ideal candidate will have experience in designing, implementing, and maintaining system software that enables communication between GPUS, CPUs, and storage in scale-out AI and HPC systems. This role will also be responsible for overseeing the ongoing monitoring, support, and maintenance of our HPC/AI clusters, ensuring peak performance and reliability.