exam questions

Exam Certified Data Engineer Associate All Questions

View all questions & answers for the Certified Data Engineer Associate exam

Exam Certified Data Engineer Associate topic 1 question 118 discussion

Actual exam question from Databricks's Certified Data Engineer Associate
Question #: 118
Topic #: 1
[All Certified Data Engineer Associate Questions]

What is used by Spark to record the offset range of the data being processed in each trigger in order for Structured Streaming to reliably track the exact progress of the processing so that it can handle any kind of failure by restarting and/or reprocessing?

  • A. Checkpointing and Write-ahead Logs
  • B. Replayable Sources and Idempotent Sinks
  • C. Write-ahead Logs and Idempotent Sinks
  • D. Checkpointing and Idempotent Sinks
Show Suggested Answer Hide Answer
Suggested Answer: A 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
grygi
1 month, 2 weeks ago
Selected Answer: A
A is correct. I had this on the exam, from my results it seems so. I chose D and didn't max this area and I was sure of all other answers.
upvoted 1 times
...
MultiCloudIronMan
1 month, 3 weeks ago
Selected Answer: D
The correct answer is D. Checkpointing and Idempotent Sinks. In Structured Streaming, Spark uses checkpointing to reliably track the progress of the data being processed. Checkpointing saves the state of the streaming query, including the offset ranges of the data processed in each trigger. Idempotent sinks ensure that even if the same data is processed multiple times due to a failure and restart, the results remain consistent and correct.
upvoted 1 times
...
NzmD
2 months, 3 weeks ago
Selected Answer: A
Repeated!
upvoted 1 times
CaoMengde09
1 month, 1 week ago
Repeated and false. It’s D
upvoted 1 times
...
...
9d4d68a
5 months, 1 week ago
Repeated, Correct The correct answer is A. Checkpointing and Write-ahead Logs. Checkpointing records the progress of streaming queries, while write-ahead logs (WALs) capture the data before it is processed, allowing Spark to recover and process data reliably in case of failures.
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago