Exam Certified Associate Developer for Apache Spark topic 1 question 115 discussion

Actual exam question from Databricks's Certified Associate Developer for Apache Spark

Question #: 115
Topic #: 1

[All Certified Associate Developer for Apache Spark Questions]

Which of the following code blocks returns a DataFrame containing only the rows from DataFrame storesDF where the value in column sqft is less than or equal to 25,000 AND the value in column customerSatisfaction is greater than or equal to 30?

A. storesDF.filter(col("sqft") <= 25000 and col("customerSatisfaction") >= 30)
B. storesDF.filter(col("sqft") <= 25000 or col("customerSatisfaction") >= 30)
C. storesDF.filter(sqft) <= 25000 and customerSatisfaction >= 30)
D. storesDF.filter(col("sqft") <= 25000 & col("customerSatisfaction") >= 30)
E. storesDF.filter(sqft <= 25000) & customerSatisfaction >= 30)

Show Suggested Answer

Suggested Answer: D 🗳️

by MSH_6 at Aug. 7, 2023, 9:45 p.m.

Comments

Submit Cancel

gaco

Highly Voted 10 months, 3 weeks ago

in pyspark, all wrong as the conditions inside the filter should be wrapped inside parentesis. should be: D. storesDF.filter((col("sqft") <= 25000) & (col("customerSatisfaction") >= 30))

upvoted 5 times

...

PushpakKothekar

Most Recent 3 months ago

Selected Answer: D

For dataframe we cannot AND, OR. This applicable for only spark.sql. hence correct answer is D.

upvoted 1 times

...

Souvik_79

5 months, 2 weeks ago

Selected Answer: D

& is used as "and" in pyspark.

upvoted 2 times

...

Jgo1986

10 months, 1 week ago

The most similar is D, And and OR are not valid statements for filtering in pySpark

upvoted 2 times

...

65bd33e

10 months, 4 weeks ago

Selected Answer: D

D is the answer

upvoted 1 times

...

deadbeef38

1 year ago

Selected Answer: A

A is right

upvoted 1 times

Jgo1986

10 months, 1 week ago

no its not, ...

upvoted 1 times

...

Sowwy1

1 year, 3 months ago

It's D: https://sparkbyexamples.com/spark/spark-and-or-not-operators/ PySpark Logical operations use the bitwise operators: & for and | for or ~ for not

upvoted 2 times

...