Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.
exam questions

Exam Certified Associate Developer for Apache Spark All Questions

View all questions & answers for the Certified Associate Developer for Apache Spark exam

Exam Certified Associate Developer for Apache Spark topic 1 question 156 discussion

Which of the following code blocks fails to return the number of rows in DataFrame storesDF for each distinct combination of values in column division and column storeCategory?

  • A. storesDF.groupBy((col("division"), col("storeCategory")]).count()
  • B. storesDF.groupBy("division").groupBy("storeCategory").count()
  • C. storesDF.groupBy(["division", "storeCategory"]).count()
  • D. storesDF.groupBy("division", "storeCategory").count()
  • E. storesDF.groupBy(col("division“), col("storeCategory")).count()
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
SaiPavan10
7 months, 3 weeks ago
Selected Answer: B
B is the right choice. I tested with my dataframe option B threw this error AttributeError: 'GroupedData' object has no attribute 'groupBy'
upvoted 1 times
...
YiJiaSu
9 months, 1 week ago
Selected Answer: D
D is correct !
upvoted 1 times
Ahlo
9 months ago
it is possible to run in pyspark - storesDF.groupBy("division", "storeCategory").count() Correct answer is B
upvoted 2 times
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...