pyapacheatlas.readers.excel.ExcelConfiguration#

class pyapacheatlas.readers.excel.ExcelConfiguration(column_sheet='FineGrainColumnLineage', table_sheet='TablesLineage', entityDef_sheet='EntityDefs', bulkEntity_sheet='BulkEntities', classificationDef_sheet='ClassificationDefs', updateLineage_sheet='UpdateLineage', columnMapping_sheet='ColumnMapping', **kwargs)#

A configuration utility to understand how your Excel file is structured.

You must have a “Columns” and “Tables” sheet. The name is configurable with the column_sheet and table_sheet properties.

The Columns sheet must contain a “Source/Target” Column and Table header. Optionally, a Classifications column can be provided for each Source/Target.

The Tables sheet must contain a “Source/Target” Table and Type along with a Process Name and Process Type. The Process is related to the mechanism by which source becomes the target (e.g. a Stored Procedure or Query).

Parameters
  • bulkEntity_sheet (str) – Defaults to “BulkEntities”

  • updateLineage_sheet (str) – Defaults to “UpdateLineage”

  • columnMapping_sheet (str) – Defaults to “ColumnMapping”

  • entityDef_sheet (str) – Defaults to “EntityDefs”

  • classificationDef_sheet (str) – Defaults to “ClassificationDefs”

  • table_sheet (str) – Defaults to “TablesLineage”

  • column_sheet (str) – Defaults to “FineGrainColumnLineage”

  • source_prefix (str) – Defaults to “Source” and represents the prefix of the columns in Excel to be considered related to the source table or column.

  • target_prefix (str) – Defaults to “Target” and represents the prefix of the columns in Excel to be considered related to the target table or column.

  • process_prefix (str) – Defaults to “Process” and represents the prefix of the columns in Excel to be considered related to the table process.

  • default_glossary (str) – Defaults to “Glossary” and identifies what the glossary suffix should be for the [Relationship] meanings column if not provided. Can be manually overridden in Excel with term@OtherGlossary as the cell value.

  • column_transformation_name (str) – Defaults to “transformation” and identifies the column that represents the transformation for a specific column.

__init__(column_sheet='FineGrainColumnLineage', table_sheet='TablesLineage', entityDef_sheet='EntityDefs', bulkEntity_sheet='BulkEntities', classificationDef_sheet='ClassificationDefs', updateLineage_sheet='UpdateLineage', columnMapping_sheet='ColumnMapping', **kwargs)#

Methods

__init__([column_sheet, table_sheet, …])