Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.
exam questions

Exam Certified Data Engineer Professional All Questions

View all questions & answers for the Certified Data Engineer Professional exam

Exam Certified Data Engineer Professional topic 1 question 161 discussion

Actual exam question from Databricks's Certified Data Engineer Professional
Question #: 161
Topic #: 1
[All Certified Data Engineer Professional Questions]

A data engineer wants to join a stream of advertisement impressions (when an ad was shown) with another stream of user clicks on advertisements to correlate when impressions led to monetizable clicks.

In the code below, Impressions is a streaming DataFrame with a watermark ("event_time", "10 minutes")



The data engineer notices the query slowing down significantly.

Which solution would improve the performance?

  • A. Joining on event time constraint: clickTime >= impressionTime AND clickTime <= impressionTime interval 1 hour
  • B. Joining on event time constraint: clickTime + 3 hours < impressionTime - 2 hours
  • C. Joining on event time constraint: clickTime == impressionTime using a leftOuter join
  • D. Joining on event time constraint: clickTime >= impressionTime - interval 3 hours and removing watermarks
Show Suggested Answer Hide Answer
Suggested Answer: A 🗳️

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
m79590530
1 month ago
Selected Answer: A
Answer A is the only possible logically. B configures clickTime to be earlier than impressionTime C says that clickTime should be the same as impressionTime with all clicks left joined to impressions D wants to remove Watermarks which will lead to memory leaks and depletion for both streams staging/aggregation purposes by Spark
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...