DevOps Engineer / SRE

Fundraise Up
1dRemote

About The Position

As a DevOps Engineer / SRE, you will be a generalist with a broad impact on our entire infrastructure. You won’t be siloed into a single area like databases or monitoring; instead, you will have the opportunity to work across our full stack, from server provisioning and CI/CD pipelines to observability and database management. Your primary mission is to ensure our platform is scalable, secure, and exceptionally reliable, as even minutes of downtime are costly for the causes we support.

Requirements

  • 4+ years of experience as a DevOps Engineer, SRE, or Linux Systems Administrator.
  • A strong foundation in Linux (we use Ubuntu), including core CLI troubleshooting tools.
  • Solid experience with configuration management tools, particularly Ansible.
  • Experience working with servers (VMs and/or bare metal), including setup and troubleshooting at the OS level.
  • Proficiency in building and maintaining complex CI/CD pipelines (Jenkins experience is a major plus).
  • A good understanding of networking fundamentals, including TCP/IP and firewall configuration (iptables).
  • Experience with monitoring and observability principles (Prometheus/VictoriaMetrics stack preferred).
  • Experience working with Git.
  • Scripting ability in Bash or Python.
  • A high sense of ownership, responsibility, and attention to detail. We value professionals who are proactive and reliable.

Nice To Haves

  • Data Systems: Managing ClickHouse, MongoDB, Kafka, JupyterHub, or Airflow.
  • Observability: VictoriaMetrics or Graylog at scale.
  • Storage: Software RAID, LVM, and Full Disk Encryption (Clevis/Tang).
  • Curiosity and a hypothesis-driven mindset
  • Ability to communicate complex analytical concepts to non-technical audiences
  • Detail-oriented with a strong sense of ownership
  • Comfort working in fast-paced, data-rich environments

Responsibilities

  • Work with servers (VMs and bare metal) at the OS level and below: configuration, maintenance, and troubleshooting.
  • Automate infrastructure and routine operational tasks using Ansible and custom scripting (Bash / Python).
  • Build, maintain, and support complex CI/CD pipelines. We use scripted pipelines in Jenkins.
  • Develop and support our monitoring and observability stack (Prometheus-style metrics, VictoriaMetrics, Grafana, Graylog).
  • Work with databases and data systems, including ClickHouse and MongoDB, with a focus on monitoring and operational stability.
  • Investigate and resolve issues across Linux OS, networking, and application layers.
  • Collaborate with engineers across teams to improve system reliability and automation.
  • Take ownership of production systems and ensure stability and predictability in day-to-day operations.

Benefits

  • 31 days off
  • 100% paid telemedicine plan
  • Home Office Setup Assistance: the company offers assistance with purchasing furniture (office chair, office desk, monitor) and other items to create a comfortable workspace.
  • English learning courses
  • Relevant professional education
  • Gym or swimming pool
  • Co-working
  • Remote working.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service