Senior Database Reliability Engineer

Gridware•San Francisco, CA

12h•$190,000 - $210,000

About The Position

We are seeking a Database Reliability Engineer to own and maintain Gridwareâs relational databases, cloud infrastructure, and streaming platforms. This role combines traditional DBA responsibilities ensuring high availability, performance, data integrity and security of databases with infrastructure ownership, including setup and management of Kafka-based streaming pipelines, DevOps automation, and cloud platform management. You will work closely with Data Engineering, Site Reliability, and DevOps teams to proactively monitor, troubleshoot, and optimize all critical infrastructure, enabling rapid deployment of new features while ensuring reliability and data integrity.

Requirements

5+ years of experience managing production relational databases and cloud infrastructure.
Hands-on experience with PostgreSQL, MySQL, Amazon RDS/Aurora, or similar.
Experience managing Kafka infrastructure and supporting streaming pipelines.
Familiarity with DevOps practices, automation, and Infrastructure as Code (Terraform, Ansible, or similar).
Proficiency in monitoring and observability for databases, streaming, and cloud infrastructure.
Strong troubleshooting skills for complex, multi-layered production systems.
Knowledge of database and infrastructure security, access control, and compliance best practices.
Ability to collaborate across engineering, DevOps, and Data teams.

Nice To Haves

Experience with analytical or NoSQL databases (Redshift, Snowflake, DynamoDB, MongoDB).
Containerized deployments and Kubernetes-based operators for databases or Kafka.
Event-driven architecture experience and distributed system troubleshooting.
Experience with Kafka Streams, consumer/producer tuning, and real-time pipelines.
Infrastructure and database automation at scale.

Responsibilities

Administer, monitor, and optimize relational databases (PostgreSQL, Amazon RDS) for performance, availability, and security.
Troubleshoot complex database and infrastructure issues, including query performance, replication, schema evolution, and event streaming pipelines.
Maintain and support Kafka infrastructure for company-wide streaming pipelines and integration with databases.
Implement backup, restore, and disaster recovery strategies for databases and streaming platforms.
Collaborate with DevOps and Data Engineering teams to maintain CI/CD pipelines for schema, data, and infrastructure changes.
Enforce database and infrastructure best practices, standards, and security policies.
Proactively monitor health and performance of databases, streaming pipelines, and cloud infrastructure using Grafana, Prometheus, or equivalent.
Contribute to Infrastructure as Code (Terraform, Ansible) for database, Kafka, and cloud infrastructure provisioning and management.
Support internal teams during incidents or urgent troubleshooting, balancing reliability with rapid deployment needs.

Benefits

Health, Dental & Vision (Gold and Platinum with some providers plans fully covered)
Paid parental leave
Alternating day off (every other Monday)
âOff the Gridâ, a two week per year paid break for all employees.
Commuter allowance
Company-paid training

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume