Production Support Engineer

HHAeXchangeWashington, DC
13d$83,000 - $91,000

About The Position

HHAeXchange is the leading technology platform for home and community-based care. Founded in 2008, HHAeXchange was born out of an idea to create a fully comprehensive end-to-end homecare solution to help people who are aging or have disabilities thrive in their homes and communities. Our employees are passionate about transforming the healthcare space by building the only homecare ecosystem that fully connects patients, personal care providers, managed care organizations, and states. We are looking for a Production Support Engineer to help own the reliability and operational health of our Ruby on Rails platform. This role is ideal for an early-career software engineer who is eager to learn how large production systems work, enjoys debugging real-world issues, and wants to grow into a full software engineering role over time.This is a hands-on engineering role focused on production troubleshooting, incident response, and tooling, not a call-center or ticket-routing position. You will work closely with senior engineers, DevOps, and product teams to diagnose issues, inspect data, run scripts, and build internal tools that make production issues faster to detect and resolve. When production load allows, this role will also contribute to application code and internal tooling, with a clear growth path into a broader Software Engineer role. To perform this job successfully, an individual must be able to perform each essential job duty satisfactorily with or without reasonable accommodation. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.

Requirements

  • Bachelor’s degree in Computer Science, Software Engineering, or equivalent practical experience.
  • 2+ years of experience in software development, technical support, DevOps, or a related engineering role.
  • Hands-on experience with Ruby on Rails (academic, internship, or professional).
  • Comfort using a Rails console and understanding of Rails application structure.
  • Basic working knowledge of relational databases (MySQL or PostgreSQL), including querying data.
  • Strong problem-solving skills and the ability to debug issues methodically.
  • Ability to learn new systems quickly and work effectively in a production environment.
  • Clear written and verbal communication skills, especially during incident response.
  • Willingness to explore and adopt AI tools responsibly to enhance productivity and innovation in your role

Nice To Haves

  • Experience supporting or troubleshooting production systems.
  • Familiarity with Linux environments and basic shell scripting.
  • Exposure to cloud platforms (AWS, GCP).
  • Experience with logging, monitoring, or APM tools.
  • Interest in site reliability, platform engineering, or backend systems.
  • Experience writing small internal tools, scripts, or automation.

Responsibilities

  • Troubleshoot and resolve production incidents across Rails framework services and cloud infrastructure, working from alerts, logs, metrics, and user-reported issues.
  • Use interactive application access tools safely and effectively to inspect application state, diagnose issues, and validate fixes.
  • Investigate and validate data directly in MySQL/PostgreSQL databases using read-only and controlled write access where appropriate.
  • Create and maintain scripts, Rake tasks, and internal tools to streamline incident response, data verification, and operational workflows.
  • Assist in incident response, including triage, escalation, documentation, and post-incident follow-ups.
  • Collaborate with senior engineers and DevOps to identify root causes and propose long-term fixes.
  • Build or enhance internal tools and dashboards that improve visibility into system health, data integrity, and operational risks.
  • Monitor system health, key metrics, and operational risks using dashboards and APM tools such as Datadog, New Relic, and CloudWatch.
  • Help improve runbooks, documentation, and operational playbooks for recurring issues.
  • Gradually contribute to application code changes and bug fixes outside of active incident work.
  • Take on larger ownership of application features and backend services
  • Contribute to performance, reliability, and scalability initiatives
  • Transition into a broader Software Engineer role over time
  • Other duties as assigned by supervisor or HHAeXchange leader.

Benefits

  • HHAeXchange offers competitive health plans, paid time-off, company paid holidays, 401K retirement program with a Company elected match, including other company sponsored programs.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service