exam questions

Exam Certified Associate Developer for Apache Spark All Questions

View all questions & answers for the Certified Associate Developer for Apache Spark exam

Exam Certified Associate Developer for Apache Spark topic 1 question 174 discussion

Which of the following operations will always return a new DataFrame with updated partitions from DataFrame storesDF by inducing a shuffle?

  • A. storesDF.coalesce()
  • B. storesDF.rdd.getNumPartitions()
  • C. storesDF.repartition()
  • D. storesDF.union()
  • E. storesDF.intersect()
Show Suggested Answer Hide Answer
Suggested Answer: C 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
Souvik_79
1 week, 4 days ago
The correct answer is: C. storesDF.repartition() Explanation: repartition(): This operation induces a shuffle and creates a new DataFrame with the specified number of partitions. It is used when you want to increase or decrease the number of partitions in a DataFrame, and it always triggers a shuffle to evenly distribute data across the partitions.
upvoted 1 times
...
Sowwy1
3 months, 3 weeks ago
C. storesDF.repartition()
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago