Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.
exam questions

Exam Certified Data Engineer Associate All Questions

View all questions & answers for the Certified Data Engineer Associate exam

Exam Certified Data Engineer Associate topic 1 question 36 discussion

Actual exam question from Databricks's Certified Data Engineer Associate
Question #: 36
Topic #: 1
[All Certified Data Engineer Associate Questions]

A data engineer has three tables in a Delta Live Tables (DLT) pipeline. They have configured the pipeline to drop invalid records at each table. They notice that some data is being dropped due to quality concerns at some point in the DLT pipeline. They would like to determine at which table in their pipeline the data is being dropped.
Which of the following approaches can the data engineer take to identify the table that is dropping the records?

  • A. They can set up separate expectations for each table when developing their DLT pipeline.
  • B. They cannot determine which table is dropping the records.
  • C. They can set up DLT to notify them via email when records are dropped.
  • D. They can navigate to the DLT pipeline page, click on each table, and view the data quality statistics.
  • E. They can navigate to the DLT pipeline page, click on the “Error” button, and review the present errors.
Show Suggested Answer Hide Answer
Suggested Answer: D 🗳️

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
vctrhugo
Highly Voted 1 year, 2 months ago
Selected Answer: D
D. They can navigate to the DLT pipeline page, click on each table, and view the data quality statistics. To identify the table in a Delta Live Tables (DLT) pipeline where data is being dropped due to quality concerns, the data engineer can navigate to the DLT pipeline page, click on each table in the pipeline, and view the data quality statistics. These statistics often include information about records dropped, violations of expectations, and other data quality metrics. By examining the data quality statistics for each table in the pipeline, the data engineer can determine at which table the data is being dropped.
upvoted 15 times
...
CID2024
Most Recent 2 months, 3 weeks ago
The correct answer is: D. They can navigate to the DLT pipeline page, click on each table, and view the data quality statistics. Delta Live Tables provides detailed data quality statistics for each table in the pipeline. By navigating to the DLT pipeline page and clicking on each table, the data engineer can view these statistics and determine at which table the records are being dropped due to quality concerns. This allows them to identify and address the specific issues causing the data to be dropped.
upvoted 1 times
...
3fbc31b
4 months, 2 weeks ago
Selected Answer: D
Correct answer is "D".
upvoted 1 times
...
benni_ale
7 months ago
Selected Answer: D
I would say D but I have never really tested it, still other solutions smell wrong
upvoted 1 times
...
agAshish
9 months, 3 weeks ago
D is correct By clicking on each table in the DLT pipeline page, the data engineer may be able to access data quality statistics, error logs, or other information related to dropped records. This can help them pinpoint at which table in the pipeline the data is being dropped.
upvoted 1 times
...
Diewrine
11 months, 3 weeks ago
Selected Answer: D
E is for when an error occur. But pipeline is defined to drop some records that will not result on error
upvoted 2 times
...
awofalus
1 year ago
Selected Answer: D
D is correct
upvoted 1 times
...
Atnafu
1 year, 4 months ago
E When records are dropped due to quality concerns in a DLT pipeline, the errors are logged in the event log. The data engineer can navigate to the DLT pipeline page and click on the “Error” button to view the present errors. The errors will show the table where the records were dropped. Option A: Setting up separate expectations for each table will not help the data engineer determine which table is dropping the records. Option B: The data engineer cannot determine which table is dropping the records without looking at the event log. Option C: Setting up DLT to notify the data engineer via email when records are dropped will not help the data engineer determine which table is dropping the records. Option D: Viewing the data quality statistics for each table will not help the data engineer determine which table is dropping the records.
upvoted 2 times
DavidRou
1 year ago
Don't you have to select a table generated in a single step of the pipeline to access the errors through the buttton though? Probably D is the right one here
upvoted 1 times
...
...
prasioso
1 year, 6 months ago
Selected Answer: D
Think answer is D. The pipeline is configured to drop invalid records, i.e. a SQL equivalent query with a ON VIOLATION DROP ROW clause. This will not result in a failed pipeline execution because there are no errors. Instead, you'd have to go to each table and review the quality charactistics.
upvoted 4 times
Atnafu
1 year, 4 months ago
Option D is incorrect because viewing the data quality statistics for each table will not help the data engineer identify which table is dropping the records. The data quality statistics will show the overall quality of the data in each table, but they will not show which table is dropping the records. For example, if the data quality statistics for a table show that 10% of the records are invalid, this does not mean that 10% of the records are being dropped. The invalid records could be being updated, inserted, or deleted.
upvoted 2 times
peadar_pa
3 weeks, 5 days ago
No D is correct for the reasons stated by everyone else
upvoted 1 times
...
...
...
[Removed]
1 year, 7 months ago
Is this for v2 or v3
upvoted 3 times
...
XiltroX
1 year, 7 months ago
Selected Answer: D
The correct answer is D
upvoted 4 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...