Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.
exam questions

Exam Certified Data Engineer Associate All Questions

View all questions & answers for the Certified Data Engineer Associate exam

Exam Certified Data Engineer Associate topic 1 question 72 discussion

Actual exam question from Databricks's Certified Data Engineer Associate
Question #: 72
Topic #: 1
[All Certified Data Engineer Associate Questions]

A Delta Live Table pipeline includes two datasets defined using STREAMING LIVE TABLE. Three datasets are defined against Delta Lake table sources using LIVE TABLE.

The table is configured to run in Development mode using the Continuous Pipeline Mode.

Assuming previously unprocessed data exists and all definitions are valid, what is the expected outcome after clicking Start to update the pipeline?

  • A. All datasets will be updated once and the pipeline will shut down. The compute resources will be terminated.
  • B. All datasets will be updated at set intervals until the pipeline is shut down. The compute resources will persist until the pipeline is shut down.
  • C. All datasets will be updated once and the pipeline will persist without any processing. The compute resources will persist but go unused.
  • D. All datasets will be updated once and the pipeline will shut down. The compute resources will persist to allow for additional testing.
  • E. All datasets will be updated at set intervals until the pipeline is shut down. The compute resources will persist to allow for additional testing.
Show Suggested Answer Hide Answer
Suggested Answer: E 🗳️

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
meow_akk
Highly Voted 1 year, 1 month ago
Ans E : Development and production modes You can optimize pipeline execution by switching between development and production modes. Use the Delta Live Tables Environment Toggle Icon buttons in the Pipelines UI to switch between these two modes. By default, pipelines run in development mode. When you run your pipeline in development mode, the Delta Live Tables system does the following: Reuses a cluster to avoid the overhead of restarts. By default, clusters run for two hours when development mode is enabled. You can change this with the pipelines.clusterShutdown.delay setting in the Configure your compute settings. Disables pipeline retries so you can immediately detect and fix errors. In production mode, the Delta Live Tables system does the following: Restarts the cluster for specific recoverable errors, including memory leaks and stale credentials. Retries execution in the event of specific errors, for example, a failure to start a cluster. https://docs.databricks.com/en/delta-live-tables/updates.html#optimize-execution
upvoted 10 times
...
806e7d2
Most Recent 2 days, 20 hours ago
Selected Answer: B
The pipeline runs in Continuous Pipeline Mode, so datasets will be updated at set intervals. Since the pipeline is in Development Mode, the compute resources will persist until manually shut down.
upvoted 1 times
...
7a22144
3 months, 1 week ago
E is correct ! Option B: "All datasets will be updated at set intervals until the pipeline is shut down. The compute resources will persist until the pipeline is shut down." This option correctly reflects that the pipeline continues running, updating datasets at intervals, and only stops when manually shut down. Compute resources persist throughout this process. Option E: "All datasets will be updated at set intervals until the pipeline is shut down. The compute resources will persist to allow for additional testing." While this is very similar, it adds the phrase "to allow for additional testing," which might imply that the resources are persisting just for testing purposes. This can be misleading because the primary reason for resource persistence in Continuous mode is to keep the pipeline active and processing data, not solely for testing.
upvoted 1 times
...
3fbc31b
4 months, 2 weeks ago
Selected Answer: E
The answer is E. The compute resources will persist even after the pipeline is shut down.
upvoted 1 times
...
benni_ale
6 months, 4 weeks ago
Selected Answer: E
e as teh cluster actually persits differently from b
upvoted 1 times
...
Garyn
10 months, 4 weeks ago
Selected Answer: E
E. All datasets will be updated at set intervals until the pipeline is shut down. The compute resources will persist to allow for additional testing. Explanation: In Development mode, Delta Live Tables persistently updates datasets at set intervals. The pipeline continuously processes incoming data until manually stopped or shut down. Compute resources, including the cluster used for processing, persist without automatic restarts or retries (as it is the behavior in Development mode). This persistence allows for ongoing processing of data, enabling additional testing or continued data processing until the pipeline is manually shut down. Therefore, option E accurately captures the behavior expected in Development mode, emphasizing the continuous update of datasets and the persistence of compute resources until the pipeline is manually terminated.
upvoted 2 times
...
kz_data
11 months, 3 weeks ago
Selected Answer: E
E seems the correct answer
upvoted 2 times
...
nedlo
11 months, 3 weeks ago
Selected Answer: B
Why E? It persists with same functionality as was before, not for "additional testing"?
upvoted 2 times
AndreFR
11 months, 1 week ago
because "The table is configured to run in Development mode" when tables are set in dev mode, "The compute resources will persist to allow for additional testing."
upvoted 1 times
AndreFR
11 months, 1 week ago
So correct answer is E
upvoted 1 times
...
...
...
55f31c8
12 months ago
Selected Answer: E
https://docs.databricks.com/en/delta-live-tables/updates.html#continuous-vs-triggered-pipeline-execution https://docs.databricks.com/en/delta-live-tables/testing.html#use-development-mode-to-run-pipeline-updates
upvoted 2 times
...
anandpsg101
1 year, 1 month ago
Selected Answer: E
E is correct
upvoted 2 times
...
SD5713
1 year, 1 month ago
Selected Answer: E
E. All datasets will be updated at set intervals until the pipeline is shut down. The compute resources will persist to allow for additional testing.
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...