Batch Processing System Architect

Hewlett Packard EnterpriseSpring, TX
13dHybrid

About The Position

Batch Processing System Architect This role has been designed as ‘Hybrid’ with an expectation that you will work on average 2 days per week from an HPE office. Who We Are: Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today’s complex world. Our culture thrives on finding new and better ways to accelerate what’s next. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. If you are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE. Job Description: Job Family Definition: Researches, designs, develops, configures, integrates, tests and maintains existing and new business applications and/or information systems solutions including databases through integration of technical and business requirements. Applications and infrastructure solutions include both 3rd party software and internally developed applications and infrastructure. Responsibilities include, but are not limited to, analysis of business requirements, coding of modifications or new program, creation of documentation, testing and maintenance of applications, infrastructure, and information systems including database management systems. Works within the Information Technology function, obtaining resources and working in support of objectives and strategies. Provides required documentation and participates in architecture reviews to ensure that the solutions comply with standards and use approved technologies. Management Level Definition: Contributions have visible technical impact on a product or major subcomponent. Applies in-depth professional knowledge and innovative ideas to solve complex problems. Visible contributions improve time-to-market, achieve cost reductions, or satisfy current and future unmet customer needs. Recognized internal authority on key technology area applying innovative principles and ideas. Provides technical leadership for significant project/program work. Leads or participates in cross-functional initiatives and contributes to mentorship and knowledge sharing across the organization.

Requirements

  • Hands-on experience with LSF Scheduler/Resource Manager and RTM monitoring, or similar platforms.
  • Familiarity with ASIC tools such as Cadence, Synopsys, and Mentor.
  • Strong scripting skills (Python, Bash, Perl) to manage EDA and workload management environments.
  • Understanding of Flexera Licensing and license file management.
  • 10-15 years of experience managing RHEL systems (certifications preferred).
  • Proficiency in automation tools such as Ansible.
  • Experience managing NFS servers, user file permissions, and Linux system patching.
  • Familiarity with SaltStack, VMware Aria, or similar hardening tools.
  • Strong troubleshooting skills related to networks, storage, and Linux performance.
  • Experience with incident/change management platforms like ServiceNow.
  • Ability to coordinate with architecture, networking, and infrastructure teams to resolve system-level issues.
  • Familiarity with provisioning and decommissioning processes for Linux systems.
  • Strong communication and collaboration skills to work with cross-functional teams.
  • Strong documentation skills for runbooks, operational workflows, and automated processes.
  • Ability to handle multiple priorities in a fast-paced environment.

Nice To Haves

  • RHEL certifications or equivalent Linux certifications.
  • Proven experience managing environments with large-scale deployments of Linux systems and EDA tools.
  • Experience with performance tuning for ASIC workflows, storage, and networking.
  • Experience with NoMachine (NX) or similar remote desktop applications.

Responsibilities

  • Manage and configure LSF Scheduler/Resource Manager and RTM monitoring, or similar workload management platforms, to optimize ASIC development workflows.
  • Install and update ASIC tools from vendors such as Cadence, Synopsys, and Mentor, as well as RSYNC tools and projects from other HPE ASIC lab environments.
  • Develop and maintain Python, Bash, and Perl scripts to manage the LSF environment and streamline system operations.
  • Collaborate with the ASIC tools team to update and maintain LSF configurations to support evolving engineering workflows.
  • Understand and manage Flexera Licensing and license file configurations to ensure proper operation of licensed EDA tools.
  • Occasionally assist ASIC teams in investigating and debugging tool-related issues to ensure optimal productivity.
  • Work with business units to coordinate system environment events, such as patching or planned downtime, ensuring minimal disruption to engineering workflows.
  • Collaborate with business units to ensure the continuity of business services and compliance with operational standards.
  • Manage Linux systems at scale, with 10-15 years of experience in RHEL (certifications highly preferred).
  • Develop and maintain scripts (Ansible, Bash, Python, Perl) for automated deployments, updates, and system configurations across multiple systems.
  • Ensure all deployments and updates are thoroughly tested prior to rollout to minimize impact on production environments.
  • Manage NFS servers, user file permissions, and associated storage infrastructure.
  • Patch Linux environments using COLO/PC-provided scripts (yum, RPMs, and resolving dependencies).
  • Administer Linux user accounts and authentication through LDAP and local service accounts.
  • Install and manage certificates for secure system operations.
  • Configure and support secure remote access tools, such as NoMachine (NX) and graphical desktop environments (GNOME/KDE/MATE).
  • Familiarity with Linux hardening tools, such as SaltStack or VMware Aria, and ability to work with hardened environments.
  • Measure and optimize system performance, including network, storage, and overall resource utilization.
  • Create and maintain comprehensive documentation, including operational runbooks and automated processes.
  • Collaborate with architecture teams to design best-of-breed hardware solutions for the ASIC environment.
  • Use ServiceNow to submit, track, and manage incidents, provisioning requests, and decommissioning tickets.
  • Create and manage ServiceNow change requests (CHG) to implement projects such as quarterly patching and infrastructure updates.
  • Work with infrastructure and site networking teams to resolve DNS, routing, and firewall issues to ensure seamless system operation.
  • Work with infrastructure teams and management to escalate and resolve any environment-related issues impacting performance.

Benefits

  • We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing.
  • We also invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have — whether you want to become a knowledge expert in your field or apply your skills to another division.
  • We are unconditionally inclusive in the way we work and celebrate individual uniqueness. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service