Exam Certified Associate Developer for Apache Spark All Questions

View all questions & answers for the Certified Associate Developer for Apache Spark exam

Exam Certified Associate Developer for Apache Spark topic 1 question 110 discussion

Actual exam question from Databricks's Certified Associate Developer for Apache Spark

Question #: 110
Topic #: 1

[All Certified Associate Developer for Apache Spark Questions]

The code block shown below contains an error. The code block intended to return a new DataFrame that is the result of an inner join between DataFrame storesDF and DataFrame employeesDF on column storeId. Identify the error.

Code block:

StoresDF.join(employeesDF, Seq("storeId")

A. The key column storeId needs to be a string like “storeId”.
B. The key column storeId needs to be specified in an expression of both Data Frame columns like storesDF.storeId ===employeesDF.storeId.
C. The default argument to the joinType parameter is “inner” - an additional argument of “left” must be specified.
D. There is no DataFrame.join() operation - DataFrame.merge() should be used instead.
E. The key column storeId needs to be wrapped in the col() operation.

Show Suggested Answer

Suggested Answer: A 🗳️

by cookiemonster42 at Aug. 2, 2023, 7:51 p.m.

Comments

Submit Cancel

58470e1

5 months ago

Selected Answer: A

Not sure why C was selected as it's indicating INNER which is the the default for Dataframe.join and requires a string column name)

upvoted 1 times

...

deadbeef38

10 months ago

I don't see any error in the code other than a typo

upvoted 1 times

...

SaiPavan10

1 year ago

Selected Answer: A

since the default join is inner so key column should be "StoreId"

upvoted 2 times

...

Sowwy1

1 year ago

A is correct

upvoted 2 times

...

saryu

1 year, 2 months ago

A is right

upvoted 1 times

...

newusername

1 year, 5 months ago

The answer (C) is just the most wrong that could have been here

upvoted 2 times

...

cookiemonster42

1 year, 8 months ago

I think the question is corrupt. The most plausible answer is A, even though the column name is already presented as a string.

upvoted 1 times

newusername

1 year, 5 months ago

yup, 100% is smth wrong, cause storesDF.join(employeesDF, Seq("storeId")) will work

upvoted 2 times

...

Exam Certified Associate Developer for Apache Spark All Questions

View all questions & answers for the Certified Associate Developer for Apache Spark exam

Exam Certified Associate Developer for Apache Spark topic 1 question 110 discussion

Comments

58470e1

deadbeef38

SaiPavan10

Sowwy1

saryu

newusername

cookiemonster42

newusername

SY0-701