Senior Database Reliability Engineer

GridwareSan Francisco, CA
12h$190,000 - $210,000

About The Position

We are seeking a Database Reliability Engineer to own and maintain Gridware’s relational databases, cloud infrastructure, and streaming platforms. This role combines traditional DBA responsibilities ensuring high availability, performance, data integrity and security of databases with infrastructure ownership, including setup and management of Kafka-based streaming pipelines, DevOps automation, and cloud platform management. You will work closely with Data Engineering, Site Reliability, and DevOps teams to proactively monitor, troubleshoot, and optimize all critical infrastructure, enabling rapid deployment of new features while ensuring reliability and data integrity.

Requirements

  • 5+ years of experience managing production relational databases and cloud infrastructure.
  • Hands-on experience with PostgreSQL, MySQL, Amazon RDS/Aurora, or similar.
  • Experience managing Kafka infrastructure and supporting streaming pipelines.
  • Familiarity with DevOps practices, automation, and Infrastructure as Code (Terraform, Ansible, or similar).
  • Proficiency in monitoring and observability for databases, streaming, and cloud infrastructure.
  • Strong troubleshooting skills for complex, multi-layered production systems.
  • Knowledge of database and infrastructure security, access control, and compliance best practices.
  • Ability to collaborate across engineering, DevOps, and Data teams.

Nice To Haves

  • Experience with analytical or NoSQL databases (Redshift, Snowflake, DynamoDB, MongoDB).
  • Containerized deployments and Kubernetes-based operators for databases or Kafka.
  • Event-driven architecture experience and distributed system troubleshooting.
  • Experience with Kafka Streams, consumer/producer tuning, and real-time pipelines.
  • Infrastructure and database automation at scale.

Responsibilities

  • Administer, monitor, and optimize relational databases (PostgreSQL, Amazon RDS) for performance, availability, and security.
  • Troubleshoot complex database and infrastructure issues, including query performance, replication, schema evolution, and event streaming pipelines.
  • Maintain and support Kafka infrastructure for company-wide streaming pipelines and integration with databases.
  • Implement backup, restore, and disaster recovery strategies for databases and streaming platforms.
  • Collaborate with DevOps and Data Engineering teams to maintain CI/CD pipelines for schema, data, and infrastructure changes.
  • Enforce database and infrastructure best practices, standards, and security policies.
  • Proactively monitor health and performance of databases, streaming pipelines, and cloud infrastructure using Grafana, Prometheus, or equivalent.
  • Contribute to Infrastructure as Code (Terraform, Ansible) for database, Kafka, and cloud infrastructure provisioning and management.
  • Support internal teams during incidents or urgent troubleshooting, balancing reliability with rapid deployment needs.

Benefits

  • Health, Dental & Vision (Gold and Platinum with some providers plans fully covered)
  • Paid parental leave
  • Alternating day off (every other Monday)
  • “Off the Grid”, a two week per year paid break for all employees.
  • Commuter allowance
  • Company-paid training
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service