pyapacheatlas.readers.excel.ExcelReader.parse_finegrain_column_lineage#

ExcelReader.parse_finegrain_column_lineage(filepath, atlas_entities, atlas_typedefs, use_column_mapping=False)#

Read a given excel file that conforms to the excel atlas template and parse the columns into column lineages. Requires that the relationship attributes are already defined in the provided atlas type defs.

Infers column type from the target table type and an assumed “columns” relationship attribute on the table type.

Infers the column lineage process based on the provided table process (provided in the template’s table excel sheet). Looks for the first relationship type def with an endDef2 of columnLineages.

Parameters
  • filepath (str) – The xlsx file that contains your table and columns.

  • atlas_entities (list()) – A list of AtlasEntity objects representing

  • atlas_typedefs (dict(str,list(dict))) – The results of requesting all type defs from Apache Atlas, including entityDefs, relationshipDefs, etc. relationshipDefs are the only values used.

  • use_column_mapping (bool) – Should the table processes include the columnMappings attribute that represents Column Lineage in Azure Data Catalog. Defaults to False.

Returns

A list of Atlas Entities representing the spreadsheet’s inputs as their json dicts.

Return type

list(AtlasEntity)