Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.
exam questions

Exam Certified Machine Learning Professional All Questions

View all questions & answers for the Certified Machine Learning Professional exam

Exam Certified Machine Learning Professional topic 1 question 39 discussion

Actual exam question from Databricks's Certified Machine Learning Professional
Question #: 39
Topic #: 1
[All Certified Machine Learning Professional Questions]

A machine learning engineering team has written predictions computed in a batch job to a Delta table for querying. However, the team has noticed that the querying is running slowly. The team has already tuned the size of the data files. Upon investigating, the team has concluded that the rows meeting the query condition are sparsely located throughout each of the data files.
Based on the scenario, which of the following optimization techniques could speed up the query by colocating similar records while considering values in multiple columns?

  • A. Z-Ordering
  • B. Bin-packing
  • C. Write as a Parquet file
  • D. Data skipping
  • E. Tuning the file size
Show Suggested Answer Hide Answer
Suggested Answer: E 🗳️

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
Joy999
5 months ago
Selected Answer: A
"The team has already tuned the size of the data files" - mentioned So E is OUT
upvoted 1 times
...
hugodscarvalho
10 months ago
Selected Answer: A
Z-Ordering is a technique used in Delta Lake to colocate similar records together based on the values of multiple columns. This optimization improves query performance by reducing the amount of data that needs to be scanned to satisfy a query, particularly when filtering on multiple columns.
upvoted 1 times
...
BokNinja
11 months, 1 week ago
The correct answer is A. Z-Ordering. Z-Ordering is a technique used in Delta Lake to optimize the layout of data to improve query performance. It’s a multi-dimensional clustering technique that colocates related information in the same set of files. This colocation can significantly improve the speed of queries and analytics, especially when dealing with high-dimensional data. By using Z-Ordering, the team can ensure that rows meeting the query condition are located close together, thereby speeding up the query.
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...