Exam Certified Associate Developer for Apache Spark topic 1 question 16 discussion

Actual exam question from Databricks's Certified Associate Developer for Apache Spark

Question #: 16
Topic #: 1

[All Certified Associate Developer for Apache Spark Questions]

Which of the following operations can be used to create a new DataFrame that has 12 partitions from an original DataFrame df that has 8 partitions?

A. df.repartition(12)
B. df.cache()
C. df.partitionBy(1.5)
D. df.coalesce(12)
E. df.partitionBy(12)

Show Suggested Answer

Suggested Answer: A 🗳️

by 4be8126 at May 3, 2023, 1:15 p.m.

Comments

Submit Cancel

4be8126

Highly Voted 11 months, 3 weeks ago

Selected Answer: A

The answer is A. The repartition operation can be used to increase or decrease the number of partitions in a DataFrame. In this case, the number of partitions is being increased from 8 to 12, so we can use the repartition operation with a partition count of 12: df.repartition(12). Option B, df.cache(), is used to cache a DataFrame in memory for faster access, but it does not change the number of partitions. Option C, df.partitionBy(1.5), is not a valid operation for partitioning a DataFrame. Option D, df.coalesce(12), can be used to reduce the number of partitions in a DataFrame, but it cannot be used to increase the number of partitions beyond the current number. Option E, df.partitionBy(12), is used to partition a DataFrame by a specific column or set of columns, but it does not change the number of partitions.

upvoted 6 times

...

NuclearGandhi

Most Recent 5 months, 2 weeks ago

Selected Answer: A

nice explanation @4be8126

upvoted 1 times

...

TmData

10 months ago

Selected Answer: A

The operation that can be used to create a new DataFrame with 12 partitions from an original DataFrame df that has 8 partitions is: D. df.coalesce(12) Explanation: The coalesce() operation in Spark is used to decrease the number of partitions in a DataFrame, and it can be used to create a new DataFrame with a specific number of partitions. In this case, calling df.coalesce(12) on the original DataFrame df with 8 partitions will create a new DataFrame with 12 partitions.

upvoted 1 times

...

SonicBoom10C9

11 months, 1 week ago

Selected Answer: A

Comprehensive explanation by 4be8126, only using this comment to vote A.

upvoted 1 times

...

Exam Certified Associate Developer for Apache Spark All Questions

View all questions & answers for the Certified Associate Developer for Apache Spark exam

Exam Certified Associate Developer for Apache Spark topic 1 question 16 discussion

Comments

4be8126

NuclearGandhi

TmData

SonicBoom10C9

SY0-701