A Delta Lake table in the Lakehouse named customer_churn_params is used in churn prediction by the machine learning team. The table contains information about customers derived from a number of upstream sources. Currently, the data engineering team populates this table nightly by overwriting the table with the current valid values derived from upstream data sources.
Immediately after each update succeeds, the data engineering team would like to determine the difference between the new version and the previous version of the table.
Given the current implementation, which method can be used?
KadELbied
2 months agoarekm
6 months agoSriramiyer92
6 months, 3 weeks agoJugiboss
8 months, 2 weeks agoarekm
6 months agoJugiboss
8 months, 2 weeks agocales
8 months, 3 weeks agobenni_ale
8 months, 1 week agorobodog
10 months, 2 weeks agoHelixAbdu
11 months, 2 weeks agoc00ccb7
12 months agoDeb9753
1 year, 1 month ago