Which statement describes Delta Lake optimized writes?
A.
Before a Jobs cluster terminates, OPTIMIZE is executed on all tables modified during the most recent job.
B.
An asynchronous job runs after the write completes to detect if files could be further compacted; if yes, an OPTIMIZE job is executed toward a default of 1 GB.
C.
A shuffle occurs prior to writing to try to group similar data together resulting in fewer files instead of each executor writing multiple files based on directory partitions.
D.
Optimized writes use logical partitions instead of directory partitions; because partition boundaries are only represented in metadata, fewer small files are written.
Please provide your input to Questions 144,145,146,147,149 also. Thanks in advance
upvoted 1 times
...
...
Log in to ExamTopics
Sign in:
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.
Upvoting a comment with a selected answer will also increase the vote count towards that answer by one.
So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.
benni_ale
2 weeks, 4 days agoFarid77
1 month, 2 weeks agovexor3
4 months, 1 week agoonly_vimal
3 months, 3 weeks ago