The Data Engineer II plays a key role in building and evolving the onX Lakehouse. This role focuses on creating reliable, well-modeled data products on Google Cloud while improving how data is described, governed, discovered, and trusted across the organization. You will balance hands-on pipeline development with thoughtful design around schemas, metadata, and operational health. This is a role for engineers who think beyond moving data from A to B and care deeply about how data is understood, reused, and scaled in a modern analytics and AI-driven ecosystem. As an onX Data Engineer, your day to day responsibilities would look like: Lakehouse Development Build and maintain production-grade data pipelines that land and organize data into Apache Iceberg tables on Google Cloud. Apply best practices for data modeling, partitioning, and table design to support analytics, BI, and emerging AI use cases. Evolve existing pipelines while maintaining platform stability and reliability. Data Modeling & Quality Design clear, consistent schemas with an emphasis on usability, longevity, and downstream consumption. Implement data quality checks and validation patterns to ensure datasets are accurate, timely, and trustworthy. Make intentional tradeoffs between flexibility and structure as data products mature. Metadata & Data Discovery Enrich datasets with meaningful metadata, ownership, lineage, and freshness signals. Partner in building a shared data catalog experience that helps users understand what data exists, what it means, and how it should be used . Treat metadata as a first-class asset alongside the data itself. Cross-Functional Partnership Work closely with Business Intelligence and analytics partners to build curated domain datasets aligned with shared definitions and business logic. Collaborate with platform, analytics, and governance stakeholders to improve data consistency across domains. Operational Excellence Monitor pipeline performance, cost, and data freshness. Participate in incident response, root cause analysis, and continuous improvement of reliability (KTLO). Contribute to standards, patterns, and documentation that scale beyond individual pipelines.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed