Data Catalog Engineer

Peraton•Ashburn, VA

2d•$80,000 - $128,000

About The Position

Peraton is seeking a Data Catalog Engineer to support U.S. Customs and Border Protection (CBP) analytics and intelligence support programs. In this role, you will design, implement, and maintain enterprise data catalog and metadata management capabilities that enable secure data discovery, lineage tracking, and governance across mission and analytics platforms. The ideal candidate brings a combination of data engineering expertise, knowledge of metadata management tools, and experience with modern data architectures to enhance data visibility, accessibility, and governance across CBP analytics environments. Support may be provided across multiple mission locations: Ashburn, VA Sterling, VA Washington, DC

Requirements

5 years with BS/BA; 3 years with MS/MA. 9 years with HS diploma/equivalent can be considered in lieu of a degree.
2+ years of experience in data engineering, metadata management, or data governance platforms.
Hands-on experience with enterprise data catalog or metadata management tools.
Strong understanding of metadata types (technical, business, operational) and data lineage concepts.
Experience with Python, SQL, or scripting for metadata ingestion, automation, or integration.
Knowledge of modern data architectures, including data lakes, data warehouses, and ETL/ELT pipelines.
Ability to collaborate with technical and non-technical stakeholders to support data governance initiatives.
Ability to obtain and maintain CBP (BI) suitability.
U.S. Citizenship required

Nice To Haves

Bachelors degree or Masters degree.
Experience with Collibra, Alation, DataHub, Amundsen, or similar catalog platforms.
Familiarity with cloud data ecosystems such as AWS Glue Data Catalog, Azure Purview, or Google Data Catalog.
Experience integrating metadata with workflow orchestration tools (e.g., Airflow, Prefect, Dagster).
Knowledge of data privacy, classification, and compliance frameworks.
Background supporting federal, intelligence, or regulated data environments.

Responsibilities

Design, deploy, and maintain enterprise data catalog and metadata management platforms (e.g., Collibra, Alation, DataHub, Amundsen, or cloud-native catalog services).
Integrate and manage metadata from databases, data lakes, ETL pipelines, analytics platforms, and machine learning systems.
Develop automated metadata ingestion pipelines using APIs, connectors, and scripting to ensure catalog accuracy and completeness.
Implement data lineage tracking across datasets, data pipelines, and downstream analytics applications.
Manage catalog taxonomies, business glossaries, schema definitions, and metadata policies to support enterprise data governance.
Collaborate with data engineering, analytics, security, and governance teams to onboard new data assets and ensure proper metadata tagging and classification.
Improve data discovery and usability through curated metadata, standardized documentation, and search enhancements.
Monitor metadata quality, ingestion pipelines, and lineage coverage to ensure catalog reliability and performance.
Support data governance and compliance initiatives, including data classification, access controls, and audit readiness.
Develop documentation, standards, and best practices to support enterprise metadata management and catalog adoption.