pyapacheatlas.readers.excel.ExcelConfiguration#
- class pyapacheatlas.readers.excel.ExcelConfiguration(column_sheet='FineGrainColumnLineage', table_sheet='TablesLineage', entityDef_sheet='EntityDefs', bulkEntity_sheet='BulkEntities', classificationDef_sheet='ClassificationDefs', updateLineage_sheet='UpdateLineage', columnMapping_sheet='ColumnMapping', **kwargs)#
A configuration utility to understand how your Excel file is structured.
You must have a “Columns” and “Tables” sheet. The name is configurable with the column_sheet and table_sheet properties.
The Columns sheet must contain a “Source/Target” Column and Table header. Optionally, a Classifications column can be provided for each Source/Target.
The Tables sheet must contain a “Source/Target” Table and Type along with a Process Name and Process Type. The Process is related to the mechanism by which source becomes the target (e.g. a Stored Procedure or Query).
- Parameters
bulkEntity_sheet (str) – Defaults to “BulkEntities”
updateLineage_sheet (str) – Defaults to “UpdateLineage”
columnMapping_sheet (str) – Defaults to “ColumnMapping”
entityDef_sheet (str) – Defaults to “EntityDefs”
classificationDef_sheet (str) – Defaults to “ClassificationDefs”
table_sheet (str) – Defaults to “TablesLineage”
column_sheet (str) – Defaults to “FineGrainColumnLineage”
source_prefix (str) – Defaults to “Source” and represents the prefix of the columns in Excel to be considered related to the source table or column.
target_prefix (str) – Defaults to “Target” and represents the prefix of the columns in Excel to be considered related to the target table or column.
process_prefix (str) – Defaults to “Process” and represents the prefix of the columns in Excel to be considered related to the table process.
default_glossary (str) – Defaults to “Glossary” and identifies what the glossary suffix should be for the [Relationship] meanings column if not provided. Can be manually overridden in Excel with term@OtherGlossary as the cell value.
column_transformation_name (str) – Defaults to “transformation” and identifies the column that represents the transformation for a specific column.
- __init__(column_sheet='FineGrainColumnLineage', table_sheet='TablesLineage', entityDef_sheet='EntityDefs', bulkEntity_sheet='BulkEntities', classificationDef_sheet='ClassificationDefs', updateLineage_sheet='UpdateLineage', columnMapping_sheet='ColumnMapping', **kwargs)#
Methods
__init__
([column_sheet, table_sheet, …])