Hi all,
I am lately thinking of how to make my life as Data Engineer easier and get a up-to-date documentation of the current backend relationships. The goal is to see: which tables from source are transformed into which VIEWS / TABLES downstream. I want to have a quick and accurate answer to:
- Which tables are sources for a given table in the datamodel and how are they connected?
- What is the potential impact on downstream tables if I change something on a given extraction.
- Which tables are not in use?
- Potentially even better would be a granularity of column and not only View/Table
I believe those questions belong all to the concept of Data Lineage. Which in my opinion is currently very poorly supported within Celonis.
I would like to open the discussion on how to best do this and if this is a relevant problem to more people:
- What is you workflow when you face the questions above?
- (How) are you guys documenting the SQL Backend?
- Is it done manually?