Exam Certified Associate Developer for Apache Spark topic 1 question 153 discussion

Actual exam question from Databricks's Certified Associate Developer for Apache Spark

Question #: 153
Topic #: 1

[All Certified Associate Developer for Apache Spark Questions]

Which of the following code blocks returns a new DataFrame with column storeReview where the pattern "End" has been removed from the end of column storeReview in DataFrame storesDF?

A sample DataFrame storesDF is below:

A. storesDF.withColumn("storeReview", col("storeReview").regexp_replace(" End$", ""))
B. storesDF.withColumn("storeReview", regexp_replace(col("storeReview"), " End$", ""))
C. storesDF.withColumn("storeReview”, regexp_replace(col("storeReview"), " End$"))
D. storesDF.withColumn("storeReview", regexp_replace("storeReview", " End$", ""))
E. storesDF.withColumn("storeReview", regexp_extract(col("storeReview"), " End$", ""))

Show Suggested Answer

Suggested Answer: B 🗳️

by saryu at Feb. 2, 2024, 1:09 p.m.

Comments

Submit Cancel

max_manfred

9 months ago

the official documentation says pyspark.sql.functions.regexp_replace(str, pattern, replacement) Replace all substrings of the specified string value that match regexp with rep. so I think D is the only correct option

upvoted 1 times

...