Principal IT Systems Engineer

XIFINSan Diego, CA
4d$131,000 - $190,000Onsite

About The Position

The Principal Linux Engineer is responsible for the design, implementation, and operational excellence of XiFin’s Linux-based infrastructure supporting mission-critical healthcare SaaS platforms. This role provides deep technical expertise in Linux systems engineering while helping drive reliability, scalability, security, and automation across both on-premise datacenter environments and cloud-based infrastructure. As a senior technical leader, the Principal Linux Engineer partners closely with Development, DevOps, and IT teams to build resilient and high-performing infrastructure platforms. This role requires strong expertise in enterprise Linux systems, automation, cloud platforms, and container orchestration technologies. You will play a key role in designing infrastructure solutions, resolving complex technical challenges, and mentoring engineering team members. The ideal candidate is passionate about automation, infrastructure scalability, and security by design, with experience supporting large-scale distributed systems in both physical datacenters and cloud environments such as Microsoft Azure and AWS. This position will be located at our offices in San Diego, CA.

Requirements

  • 10+ years of experience in Linux systems engineering supporting large-scale enterprise infrastructure.
  • Expert knowledge of Linux operating systems, particularly Oracle Enterprise Linux and RedHat Linux.
  • Experience managing large-scale distributed systems across on-premise datacenters and cloud platforms such as Azure and AWS.
  • Hands-on experience with automation and configuration management tools such as Ansible, Puppet, or Chef.
  • Experience implementing containerized environments using Docker and Kubernetes, including Azure Kubernetes Service (AKS).
  • Experience working with Linux-based web servers and databases including Apache, MySQL, and PostgreSQL.
  • Experience with Linux security, networking, and firewall configuration.
  • Strong troubleshooting and system diagnostics skills in complex production environments.
  • Ability to participate in 24x7 on-call rotation to support critical production systems.
  • Ability to travel 10–15% to remote company locations or datacenters as required.

Nice To Haves

  • Experience supporting enterprise SaaS environments and healthcare technology platforms.
  • Experience working with Oracle databases including Oracle 19c and higher.
  • Experience managing virtualization platforms such as VMware.
  • Familiarity with monitoring and observability tools including Splunk, Nagios, Zabbix, Grafana, or AppDynamics.
  • Experience supporting enterprise storage, networking, and security platforms.

Responsibilities

  • Design, install, configure, and maintain Linux-based servers, services, and enterprise infrastructure supporting XiFin’s SaaS platforms.
  • Develop resilient and high-performance solutions using automation and orchestration technologies such as Ansible, Terraform, Kubernetes, Docker, and Azure Kubernetes Service (AKS).
  • Ensure infrastructure environments meet requirements for high availability, scalability, performance, and security across both on-premise and cloud platforms.
  • Implement and maintain infrastructure solutions aligned with security frameworks including NIST, HITRUST, and CIS Benchmarks.
  • Develop and maintain system architecture documentation, policies, standard operating procedures, and infrastructure best practices.
  • Focus on building repeatable and automated infrastructure processes to improve operational efficiency and reliability.
  • Administer and support enterprise infrastructure including Linux and Windows servers, VMware environments, Azure services, and container platforms.
  • Manage system backups, disaster recovery processes, and infrastructure resilience planning.
  • Provide advanced troubleshooting and incident response for complex infrastructure and system issues.
  • Conduct root cause analysis and implement long-term solutions to prevent recurring technical issues.
  • Work closely with engineering teams and internal stakeholders to mitigate operational impact and restore services quickly.
  • Implement infrastructure automation and Infrastructure as Code (IaC) solutions using tools such as Ansible, Terraform, Git, GitHub, and GitHub Actions.
  • Automate provisioning, configuration, and management of infrastructure to improve deployment speed and operational consistency.
  • Partner with cross-functional teams including Engineering, DevOps, IT, and Security to align infrastructure solutions with business and product requirements.
  • Provide technical mentorship and guidance to other Linux engineers and infrastructure team members.
  • Contribute to infrastructure architecture discussions and technology decisions.
  • Monitor infrastructure health and proactively implement improvements to increase system reliability and performance.
  • Analyze system metrics, logs, and performance indicators to identify optimization opportunities.
  • Implement monitoring and alerting solutions to ensure proactive issue detection and response.

Benefits

  • Comprehensive health benefits including medical, dental, vision, and telehealth
  • 401(k) with company match and personalized financial coaching to support your financial future
  • Health Savings Account (HSA) with company contributions
  • Wellness incentives that reward your preventative healthcare activities
  • Tuition assistance to support your education and growth
  • Flexible time off and company-paid holidays
  • Social and fun events to build community at our locations!
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service