exam questions

Exam Certified Machine Learning Professional All Questions

View all questions & answers for the Certified Machine Learning Professional exam

Exam Certified Machine Learning Professional topic 1 question 2 discussion

Actual exam question from Databricks's Certified Machine Learning Professional
Question #: 2
Topic #: 1
[All Certified Machine Learning Professional Questions]

A machine learning engineer is monitoring categorical input variables for a production machine learning application. The engineer believes that missing values are becoming more prevalent in more recent data for a particular value in one of the categorical input variables.
Which of the following tools can the machine learning engineer use to assess their theory?

  • A. Kolmogorov-Smirnov (KS) test
  • B. One-way Chi-squared Test
  • C. Two-way Chi-squared Test
  • D. Jenson-Shannon distance
  • E. None of these
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
f582fe7
2 weeks, 5 days ago
Selected Answer: C
It's two-way b/c two-way doesn't rely on counts and, as long as the nulls are not evenly distributed over all categories (they are only in a single category in this scenario) it will catch this difference. A one-way is dependent on counts, so if there are nulls in one sample, it will see that as a different distribution when in fact it may not be (distribution could be the same as before, only with less counts b/c of nulls).
upvoted 1 times
...
fshb4ztm
3 months, 1 week ago
It seems like we want to determine if there's a statistical difference in one direction. Ex: between last month's data and new data, particularly observing more blanks for a specific categorical variable. It seems like a one-way Chi-squared Test is the most appropriate, what do you say?
upvoted 1 times
...
e12ec59
6 months ago
Selected Answer: B
As the question is about one input variable, so my answer would be B.
upvoted 1 times
...
ThoBustos
9 months ago
It seems like we want to determine if there's a statistical difference in one direction. Ex: between last month's data and new data, particularly observing more blanks for a specific categorical variable. It seems like a one-way Chi-squared Test is the most appropriate, what do you say?
upvoted 1 times
...
Alishahab70
11 months, 3 weeks ago
C. Two-way Chi-squared Test This tool can help the engineer determine if there is a statistically significant association between the time period and the presence of missing values in the categorical variable.
upvoted 1 times
...
ldoyle3332
1 year ago
Would the answer not be C? One-Way Chi-Squared Tests will pick up changes in the overall count of null values as a shift in distribution, even if the nulls are distributed evenly among the category values. The question specifies that they are looking for a shift in a particular class value distribution, which would be better for Two-Way Chi-Squared Test
upvoted 1 times
...
hugodscarvalho
1 year ago
Selected Answer: B
Since it's just one categorical input variable over time, the one-way Chi-squared test would be the more appropriate choice.
upvoted 1 times
...
BokNinja
1 year, 1 month ago
The correct answer is B. One-way Chi-squared Test. The Chi-squared test is a statistical hypothesis test that is used to determine whether there is a significant association between two categorical variables. In this case, the two variables could be the presence (or absence) of a value and the time period (old data vs. new data).
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago