About The Position

Capgemini is looking to hire an Airflow Developer to join our team.Your Role: Design, develop, and deploy Apache Airflow DAGs to orchestrate Databricks jobs.Build and maintain ETL workflows to extract, transform, and load data. Optimize Airflow DAGs for performance, scalability, and reliability. Monitor Airflow and Databricks job runs, and troubleshoot execution failures. Refactor and maintain reusable workflow components in python. Collaborate with platform, DevOps, and other data engineering teams.

Requirements

  • Hands-on experience with Apache Airflow 2.0, Databricks, PySpark, and SparkSQL.
  • Strong Python programming skills.
  • Experience designing and developing ETL processes and data pipelines in Databricks.
  • Solid understanding of scheduling, dependencies, and error handling in Airflow.

Responsibilities

  • Design, develop, and deploy Apache Airflow DAGs to orchestrate Databricks jobs.
  • Build and maintain ETL workflows to extract, transform, and load data.
  • Optimize Airflow DAGs for performance, scalability, and reliability.
  • Monitor Airflow and Databricks job runs, and troubleshoot execution failures.
  • Refactor and maintain reusable workflow components in python.
  • Collaborate with platform, DevOps, and other data engineering teams.

Benefits

  • Paid time off based on employee grade (A-F), defined by policy: Vacation: 12-25 days, depending on grade, Company paid holidays, Personal Days, Sick Leave
  • Medical, dental, and vision coverage (or provincial healthcare coordination in Canada)
  • Retirement savings plans (e.g., 401(k) in the U.S., RRSP in Canada)
  • Life and disability insurance
  • Employee assistance programs
  • Other benefits as provided by local policy and eligibility
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service