Skip to content

Latest commit

 

History

History

lineage

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 

Custom Lineage Connector

This connector reads custom dataset lineages from a config file.

Setup

No special setup is required.

Config File

Create a YAML config file based on the following template.

Required Configurations

lineages:
  - dataset:
      platform: <data_platform>
      name: <dataset_name>
    upstreams:
      - platform: <dataset_platform>
        name: <dataset_name>
        account: <snowflake_account> # only for Snowflake
      ...
  ...

Note: You only need to specify account if the platform is SNOWFLAKE.

Optional Configurations

Output Destination

See Output Config for more information.

Examples

Here's an example specifying that BigQuery table project.db.table1 sources its data from BigQuery table project.db.table2 & Snowflake table db.schema.table3:

lineages:
  - dataset:
      platform: BIGQUERY
      name: project.db.table1
    upstreams:
      - platform: BIGQUERY
        name: project.db.table2
      - platform: SNOWFLAKE
        name: db.schema.table3
        account: snowflake_account

Testing

Follow the Installation instructions to install metaphor-connectors in your environment (or virtualenv).

Run the following command to test the connector locally:

metaphor custom.lineage <config_file>

Manually verify the output after the run finishes.