IT Operations Sr. Manager

EnCharge AISanta Clara, CA
1d$180,000 - $210,000Onsite

About The Position

The IT Operations Sr. Manager will take full ownership of the company’s IT and DevOps disciplines. The position will be responsible for managing all CPU and GPU server/compute infrastructure and will be responsible for managing all system administration, system integration, network administration, and system/network security. The individual will be required to oversee the growth and scale-out of the company’s compute infrastructure as the company evolves. The IT Operations Sr. Manager will be responsible for supporting the company’s Hardware and Software development teams, ensuring smooth and efficient execution is enabled and maintained across the development teams. The IT Operations Sr. Manager will report to the COO and is based out of our Santa Clara, CA office.

Requirements

  • 5+ years’ experience in IT/System Administration
  • Experience with the latest Cybersecurity measures and standards.
  • Experience building out companywide IT, compute server and GPU infrastructure.
  • Experience in Linux server architecture, configuration, administration, maintenance, and package management.
  • Familiarity with different operating systems (Redhat, Ubuntu, etc)
  • Experience with NFS/SMB file protocols and how they operate in Mac, Windows, and Linux environments.
  • Experience with each of the following: Scripting in Python or bash, CI/CD platforms (e.g. GitHub Actions, Gitlab CI, Jenkins)
  • CI/CD pipeline expertise (pipeline creation and maintenance, including security scanning).
  • Docker expertise (Docker image creation and security scanning).
  • Experience with GPU drivers and GPU servers
  • Experience with networking particularly with up/down b/w
  • Experience with ipmi and related remote management of servers
  • Experience with entire Microsoft Office Suite, powerapps, power automate etc
  • Experience supporting custom ASIC design CAD tools (e.g. Cadence, Synopsis, Mentor etc) license server management a plus

Responsibilities

  • Continuously monitor, troubleshoot and maintain company IT infrastructure, including management of all cloud and SaaS administration.
  • Interface and manage relationship with external IT support vendors, Server co-location vendors and CAD engineering vendors.
  • Manage internal compute and networking systems to support SW and HW design teams.
  • Manage internal dedicated GPU server infrastructure to ensure full and efficient utilization.
  • Monitor compute server usage in order to plan and implement server farm build out as required.
  • Design, manage and maintain Company intranet site and file sharing infrastructure.
  • Manage the onboarding of new employees into the company’s IT and compute environments.
  • Evaluate and manage companywide cyber security practices, across server infrastructure, individual compute HW addressing vulnerabilities to ensure compliance with industry wide practices as well as customer and vendor requirements.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service