exam questions

Exam AWS Certified Machine Learning - Specialty All Questions

View all questions & answers for the AWS Certified Machine Learning - Specialty exam

Exam AWS Certified Machine Learning - Specialty topic 1 question 339 discussion

A banking company provides financial products to customers around the world. A machine learning (ML) specialist collected transaction data from internal customers. The ML specialist split the dataset into training, testing, and validation datasets. The ML specialist analyzed the training dataset by using Amazon SageMaker Clarify. The analysis found that the training dataset contained fewer examples of customers in the 40 to 55 year-old age group compared to the other age groups.

Which type of pretraining bias did the ML specialist observe in the training dataset?

  • A. Difference in proportions of labels (DPL)
  • B. Class imbalance (CI)
  • C. Conditional demographic disparity (CDD)
  • D. Kolmogorov-Smirnov (KS)
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
MultiCloudIronMan
6 months ago
Selected Answer: B
Class Imbalance (CI): This occurs when certain classes (in this case, age groups) are underrepresented in the dataset. This can lead to biased model predictions because the model may not learn enough about the underrepresented class to make accurate predictions1.
upvoted 1 times
...
MultiCloudIronMan
7 months ago
Selected Answer: B
B is correct because.
upvoted 1 times
...
Tkhan1
7 months, 1 week ago
Selected Answer: B
B is the correct option
upvoted 1 times
...
Shivanshub
7 months, 3 weeks ago
Selected Answer: B
The type of pretraining bias observed in the training dataset, where there are fewer examples of customers in the 40 to 55 year-old age group compared to the other age groups, is: B. Class imbalance (CI) Explanation: Class imbalance (CI) refers to a situation where certain classes or groups are underrepresented in the dataset. In this case, the age group 40 to 55 is underrepresented compared to other age groups. Difference in proportions of labels (DPL) generally refers to differences in the proportions of different labels (outcomes) rather than input features like age. Conditional demographic disparity (CDD) refers to differences in outcomes for different demographic groups conditional on certain factors, not the raw distribution of demographic features. Kolmogorov-Smirnov (KS) is a statistical test used to compare distributions, but it is not specifically a type of bias. Therefore, the correct answer is B. Class imbalance (CI).
upvoted 2 times
...
aragon_saa
7 months, 3 weeks ago
Selected Answer: C
Answer is C
upvoted 1 times
...
GS_77
7 months, 3 weeks ago
Selected Answer: B
Class imbalance can lead to biased models that perform poorly on the underrepresented class or group, as the model may not have enough examples to learn the patterns and characteristics of that class effectively.
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago