exam questions

Exam DP-600 All Questions

View all questions & answers for the DP-600 exam

Exam DP-600 topic 1 question 113 discussion

Actual exam question from Microsoft's DP-600
Question #: 113
Topic #: 1
[All DP-600 Questions]

Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.

After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.

You have a Fabric tenant that contains a new semantic model in OneLake.

You use a Fabric notebook to read the data into a Spark DataFrame.

You need to evaluate the data to calculate the min, max, mean, and standard deviation values for all the string and numeric columns.

Solution: You use the following PySpark expression:

df.describe().show()

Does this meet the goal?

  • A. Yes
  • B. No
Show Suggested Answer Hide Answer
Suggested Answer: A 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
Martin_Nbg
Highly Voted 6 months, 3 weeks ago
I think A is correct https://learn.microsoft.com/en-us/dotnet/api/microsoft.spark.sql.dataframe.describe?view=spark-dotnet
upvoted 14 times
Gunstsings
6 months, 2 weeks ago
DataFrame.Describe = Computes basic statistics for numeric and string columns, including count, mean, stddev, min, and max. If no columns are given, this function computes statistics for all numerical or string columns.
upvoted 3 times
...
...
Gunstsings
Highly Voted 6 months, 2 weeks ago
Selected Answer: A
DataFrame.Describe = Computes basic statistics for numeric and string columns, including count, mean, stddev, min, and max. If no columns are given, this function computes statistics for all numerical or string columns.
upvoted 5 times
...
jackjack1
Most Recent 3 months, 1 week ago
Selected Answer: A
Describe as other mentioned shows all fields asked for. if we were to use summary, we would have to specify what fields we want as shown here: https://learn.microsoft.com/en-us/dotnet/api/microsoft.spark.sql.dataframe.summary?view=spark-dotnet
upvoted 1 times
...
nappi1
4 months ago
Selected Answer: A
df.describe().show() returns count, mean, stddev, min, max for all the columns (numeric and non numeric)
upvoted 1 times
...
2fe10ed
4 months, 2 weeks ago
Selected Answer: A
https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrame.describe.html
upvoted 2 times
...
maia01
4 months, 3 weeks ago
Selected Answer: B
describe: only numeric columns with limited summary summary: numeric and non-numeric, broader summary
upvoted 4 times
...
patricck
6 months ago
There's a difference in numeric and string values. It's applicable for the numeric values but not for the string values and the question mentions both data
upvoted 1 times
...
zeeneuser
6 months, 2 weeks ago
A is correct, confirming after trying the command from Notebook. Displays count as well in addition to min, max, mean, and standard deviation.
upvoted 4 times
...
moteruky
6 months, 3 weeks ago
It shows all stat for numeric values but shows only 3 stat for string(count,min and max, it doesnt account for mean and std)
upvoted 3 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago