exam questions

Exam DP-100 All Questions

View all questions & answers for the DP-100 exam

Exam DP-100 topic 7 question 5 discussion

Actual exam question from Microsoft's DP-100
Question #: 5
Topic #: 8
[All DP-100 Questions]

HOTSPOT -
You need to configure the Edit Metadata module so that the structure of the datasets match.
Which configuration options should you select? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.
Hot Area:

Show Suggested Answer Hide Answer
Suggested Answer:

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
Yong2020
Highly Voted 3 years, 5 months ago
MedianValue should be made uncategorical to be consistent as the original formats are in text and numeric.
upvoted 18 times
...
CharlesZ
Highly Voted 3 years, 2 months ago
I think the answer should be floating and make uncategorical, becuase it's a regression model and MedianValue is the target column. Uncategorical would make sense.
upvoted 9 times
...
phdykd
Most Recent 8 months ago
For the configuration options in the Edit Metadata module, you should select: Launch column selector: Integer Unchanged: This will ensure that the MedianValue column in both datasets is recognized as an integer type and is not modified.
upvoted 1 times
...
spaceykacey
1 year, 11 months ago
Should it not be 'Integer'? The value is in $1000s.
upvoted 3 times
Mckay_
1 year ago
good observation. I totally agree with you.
upvoted 1 times
...
...
prashantjoge
2 years, 4 months ago
if your source dataset has numbers handled as text, you must change them to a numeric data type before using math operations. The supported data types are String, Integer, Double, Boolean, and DateTime. Floating point and time span is not an option For example, you might have a column that contains the numbers 0, 1, and 2, but know that the numbers actually mean "Smoker," "Non-smoker," and "Unknown." In that case, by flagging the column as categorical you ensure that the values are used only to group data and not in numeric calculations. Since it is numeric the option should be either "unchanged" or "make uncategorical". The original data is text so it should be made uncategorical
upvoted 3 times
...
brendal89
2 years, 6 months ago
if you google "make uncategorical" + edit metadata, you only get references to this particular exam question... I'm not convinced that "make uncategorical" even exists.
upvoted 2 times
...
Dasist
2 years, 7 months ago
Integer and uncategorical as the MedianValue is written in 1000 (no decimal point) and it's a regression model so must be numeric.
upvoted 3 times
...
Neuron
2 years, 8 months ago
The table that shows types indicate MedianValue is in the $1000s. It's an integer. Where did Floating Point come from? Also, the Paris data must be noncategorical, too, like the London data.
upvoted 3 times
RyuHayabusa
2 years, 8 months ago
Read the text. One MedianValue is text and the other is numerical.
upvoted 1 times
...
YipingRuan
2 years, 3 months ago
in 1000 means you can have value 4.5 in the column ($4500)
upvoted 3 times
...
...
Abhinav_nasaiitkgp
2 years, 9 months ago
I don't think it is necessary to make it uncategorical as text data is not categorical in this case which we have to make uncategorical. Answer is correct.
upvoted 4 times
allanm
2 years, 5 months ago
What text data? The case study states that The MedianValue and AvgRoomsInHouse columns both hold data in numeric format.
upvoted 1 times
...
...
122120
2 years, 9 months ago
there is no floating point. Select the Data type option if you need to assign a different data type to the selected columns. You might need to change the data type for certain operations. For example, if your source dataset has numbers handled as text, you must change them to a numeric data type before using math operations. The supported data types are String, Integer, Double, Boolean, and DateTime. If you select multiple columns, you must apply the metadata changes to all selected columns. For example, let's say you choose two or three numeric columns. You can change them all to a string data type and rename them in one operation. However, you can't change one column to a string data type and another column from a float to an integer. If you don't specify a new data type, the column metadata is unchanged. The column type and values will change after you perform the Edit Metadata operation. You can recover the original data type at any time by using Edit Metadata to reset the column data type.
upvoted 7 times
...
aziti
2 years, 10 months ago
I dont think there is such a thing as make uncategorical. Since the string version of the MedianValue we had is already not categorical we do not need to switch the MedianValue, thus leaving it unchanged would leave us with a non categorical Median integer value..maybe
upvoted 2 times
...
Rickii
3 years, 3 months ago
The answer is Floating point, Make Categorical
upvoted 2 times
Indranee
2 years, 9 months ago
To make 'MedianValues' categorical, do you mean turning values into bins of 'MedianValues'?
upvoted 1 times
...
...
Alexandra
3 years, 3 months ago
as the Paris dataset needs to match London dataset types and London has numerical data types in MoedianValues columns, shouldn't the answer be 'Integer' and 'Make Uncategorical'?
upvoted 3 times
...
pepmir
3 years, 4 months ago
"You must ensure that the datatype of the MedianValue column of the Paris dataset matches the structure of the London dataset." If we run Summary on the data, it might end up as Categorical due to the nature of data. That said "Unchanged" might do it here.
upvoted 5 times
...
ajithvajrala
3 years, 5 months ago
yes, I too agree with Yong2020.
upvoted 1 times
...
ceni99
3 years, 5 months ago
Correct me if I'm wrong, but I believe the answer should be "Make Categorical" if you don't want the MedianValue column to be numerical calculated.
upvoted 1 times
kty
2 years, 7 months ago
An initial investigation shows that the datasets are identical in structure apart from the MedianValue column. The smaller Paris dataset contains the MedianValue in text format, whereas the larger London dataset contains the MedianValue in numerical format so the answer is : floating and uncategorical
upvoted 3 times
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago