Azure Data Architect (remote US)

HarperCollins Publishers
1d$110,000 - $135,000Remote

About The Position

We are seeking a skilled Azure Data Architect & Engineer to lead the design and implementation of our modern data platform. In this dual-capacity role, you will define the architectural roadmap for our cloud data estate while remaining hands-on in building scalable pipelines, optimizing Spark performance, and migrating legacy workloads into Microsoft Fabric and Azure Synapse.

Requirements

  • 3–5+ years of experience in Data Warehousing, BI, and Cloud Data Engineering.
  • Proven track record of designing end-to-end data solutions on Microsoft Azure.
  • The Azure Stack - Azure Data Lake Storage (ADLS Gen2), Azure SQL DB, and Synapse Analytics, ADF, Fabric ecosystem
  • Expertise in Dimensional Modeling (Star/Snowflake schemas).
  • Advanced Power BI skills, including DAX, RLS implementation, and performance tuning for large datasets.
  • Strong proficiency in Python (PySpark) and SQL.
  • Understanding of Infrastructure as Code (ARM templates, Terraform, or Bicep) and hybrid cloud integration.

Nice To Haves

  • DP-203: Azure Data Engineer Associate
  • DP-600: Fabric Analytics Engineer Associate
  • DP-700: Fabric Data Engineer Associate (New/Beta)

Responsibilities

  • Architectural Leadership: Design and implement scalable, secure, and resilient cloud data architectures using the Medallion (Bronze/Silver/Gold) architecture pattern.
  • Data Pipeline Engineering: Build and maintain complex ETL/ELT workflows using Azure Data Factory (ADF) and Microsoft Fabric Data Factory.
  • Advanced Analytics & Spark: Develop high-performance data processing logic using PySpark, Spark SQL, and Python on Azure Databricks or Fabric Spark Notebooks.
  • Migration & Modernization: Lead hands-on migrations of on-premises data warehouses to Azure, ensuring minimal downtime and data integrity.
  • Performance Tuning: Perform deep code-level analysis of Spark core internals and Delta Lake logs to troubleshoot and optimize large-scale data processing.
  • Unified Analytics: Integrate Power BI with OneLake and Lakehouse architectures, ensuring seamless "Direct Lake" connectivity and optimized reporting.
  • Security & Governance: Implement robust security frameworks, including Row-Level Security (RLS), data masking, and compliance with global privacy regulations
  • DevOps & Best Practices: Champion CI/CD for data (DataOps) using Azure DevOps/GitHub, ensuring code modularity, version control, and comprehensive documentation.

Benefits

  • In addition to cash compensation, the company provides a comprehensive and highly competitive benefits package, with a variety of physical health, retirement and savings, caregiving, emotional wellbeing, transportation, and other benefits, including "elective" benefits employees may select to best fit the needs and personal situations of our diverse workforce.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service