exam questions

Exam DP-100 All Questions

View all questions & answers for the DP-100 exam

Exam DP-100 topic 7 question 3 discussion

Actual exam question from Microsoft's DP-100
Question #: 3
Topic #: 8
[All DP-100 Questions]

You need to visually identify whether outliers exist in the Age column and quantify the outliers before the outliers are removed.
Which three Azure Machine Learning Studio modules should you use? Each correct answer presents part of the solution.
NOTE: Each correct selection is worth one point.

  • A. Create Scatterplot
  • B. Summarize Data
  • C. Clip Values
  • D. Replace Discrete Values
  • E. Build Counting Transform
Show Suggested Answer Hide Answer
Suggested Answer: ABC 🗳️
B: To have a global view, the summarize data module can be used. Add the module and connect it to the data set that needs to be visualized.
A: One way to quickly identify Outliers visually is to create scatter plots.
C: The easiest way to treat the outliers in Azure ML is to use the Clip Values module. It can identify and optionally replace data values that are above or below a specified threshold.
You can use the Clip Values module in Azure Machine Learning Studio, to identify and optionally replace data values that are above or below a specified threshold. This is useful when you want to remove outliers or replace them with a mean, a constant, or other substitute value.
Reference:
https://blogs.msdn.microsoft.com/azuredev/2017/05/27/data-cleansing-tools-in-azure-machine-learning/ https://docs.microsoft.com/en-us/azure/machine-learning/studio-module-reference/clip-values

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
concernedCitizen
Highly Voted 3 years, 3 months ago
Option C is a method to fix outliers, not visualize them
upvoted 13 times
Sjefen
1 year, 7 months ago
That's not a problem, the combination of the 3 answers are supposed to do the required actions, not every option doing everything.
upvoted 2 times
...
...
phdykd
Most Recent 8 months ago
The "Clip Values" module in Azure Machine Learning Studio is used to limit the values in a column to a specified range. This can be useful in cases where there are extreme values that need to be limited to a certain threshold. However, this module does not identify or quantify outliers in a dataset. Therefore, it would not be useful for the task of identifying and quantifying outliers in the "Age" column.
upvoted 1 times
...
phdykd
8 months ago
The three Azure Machine Learning Studio modules that should be used to visually identify and quantify outliers in the Age column before they are removed are: A. Create Scatterplot: This module can be used to create a scatter plot of the data, which allows for visual identification of outliers. B. Summarize Data: This module can be used to calculate basic statistics for the Age column, such as mean, median, standard deviation, and quartiles, which can help to identify outliers. E. Build Counting Transform: This module can be used to create a frequency distribution of the Age column, which can help to identify outliers that occur with low frequency. Therefore, the correct answers are A, B, and E. The modules C and D are not relevant for identifying and quantifying outliers in the Age column.
upvoted 2 times
...
BTAB
9 months, 2 weeks ago
Selected Answer: ABC
Each correct answer presents part of the solution. Therefore, the question asks to visualize before removing. Since A & B are visualizations, and C does the removing, all 3 answers are part of the solution. But we need to do A & B before we utilize C. Answer is correct. This brings up a good point with Microsoft tests. Make sure to understand sequencing questions vs. questions that say each answer PRESENTS part of the solution.
upvoted 1 times
...
TheCyanideLancer
1 year, 9 months ago
The give answer seems to be right, below text from documentation regarding clip values module - https://docs.microsoft.com/en-us/previous-versions/azure/machine-learning/studio-module-reference/clip-values Module overview This article describes how to use the Clip Values module in Machine Learning Studio (classic), to identify and optionally replace data values that are above or below a specified threshold. This is useful when you want to remove outliers or replace them with a mean, a constant, or other substitute value
upvoted 1 times
...
RyanTsai
2 years, 1 month ago
ans: A,B,E
upvoted 4 times
...
bdsrca
2 years, 1 month ago
. Create Scatterplot .Summarize Data .Build Counting Transform
upvoted 3 times
...
Lucario95
2 years, 5 months ago
QUESTION "You need to visually identify whether outliers exist in the Age column and quantify the outliers BEFORE the outliers are removed." Thus answer C is part of the answer
upvoted 2 times
...
hima618
3 years, 1 month ago
question is only about visualization, so option c is incorrect.
upvoted 2 times
Laredo
2 years, 11 months ago
solution is correct 1. visually identify whether outliers exist in the Age column and 2. quantify the outliers before 3. the outliers are removed.
upvoted 8 times
kty
2 years, 7 months ago
I agree
upvoted 1 times
...
Dasist
2 years, 7 months ago
They ask to identify and visualize the outliers. Removing them is not asked. Therefore it's only A and B
upvoted 1 times
Dasist
2 years, 7 months ago
Seems like it cannot be only A and B as the question ask THREE modules not two. Then I must agree with C. Question is ambiguous
upvoted 3 times
...
...
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago