exam questions

Exam Certified Data Engineer Professional All Questions

View all questions & answers for the Certified Data Engineer Professional exam

Exam Certified Data Engineer Professional topic 1 question 24 discussion

Actual exam question from Databricks's Certified Data Engineer Professional
Question #: 24
Topic #: 1
[All Certified Data Engineer Professional Questions]

Which configuration parameter directly affects the size of a spark-partition upon ingestion of data into Spark?

  • A. spark.sql.files.maxPartitionBytes
  • B. spark.sql.autoBroadcastJoinThreshold
  • C. spark.sql.files.openCostInBytes
  • D. spark.sql.adaptive.coalescePartitions.minPartitionNum
  • E. spark.sql.adaptive.advisoryPartitionSizeInBytes
Show Suggested Answer Hide Answer
Suggested Answer: A 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
8605246
Highly Voted 1 year, 2 months ago
correct; The maximum number of bytes to pack into a single partition when reading files. This configuration is effective only when using file-based sources such as Parquet, JSON and ORC. https://spark.apache.org/docs/latest/sql-performance-tuning.html
upvoted 5 times
...
Jay_98_11
Most Recent 9 months, 2 weeks ago
Selected Answer: A
correct
upvoted 3 times
...
sturcu
1 year ago
Selected Answer: A
from the provided list, this fits best. In reality partition size/number can be influenced my many settings
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago