exam questions

Exam AI-900 All Questions

View all questions & answers for the AI-900 exam

Exam AI-900 topic 1 question 161 discussion

Actual exam question from Microsoft's AI-900
Question #: 161
Topic #: 1
[All AI-900 Questions]

You are developing a system to predict the prices of insurance for drivers in the United Kingdom.

You need to minimize bias in the system.

What should you do?

  • A. Remove information about protected characteristics from the data before sampling.
  • B. Take a training sample that is representative of the population in the United Kingdom.
  • C. Create a training dataset that uses data from global insurers.
  • D. Take a completely random training sample.
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
redsoxfred
Highly Voted 8 months, 2 weeks ago
Selected Answer: B
B. Take a training sample that is representative of the population in the United Kingdom. To minimize bias in the system, it's important that your training data is representative of the population you're modeling. This helps ensure that the model's predictions are valid for the full range of drivers in the United Kingdom. While option A (Remove information about protected characteristics from the data before sampling) could help in some cases to reduce direct discrimination, it might not be sufficient to minimize all types of biases, as some of these characteristics might be indirectly encoded in the remaining features. Option C (Create a training dataset that uses data from global insurers) may introduce more bias since driving conditions, laws, and demographics vary greatly by country. Option D (Take a completely random training sample) could still introduce bias if the original data pool is not representative of the population you're interested in.
upvoted 9 times
...
rdemontis
Highly Voted 9 months ago
Selected Answer: B
I think correct answer is B. To minimize bias in the system, it is important to ensure that the training sample used is representative of the population in the United Kingdom. By including a diverse range of data that accurately reflects the population, you can reduce bias that may arise from underrepresentation or overrepresentation of certain groups. This approach helps to ensure fairness and avoids discriminatory practices in the prediction of insurance prices. Removing information about protected characteristics (option A) may help in some cases, but it is not sufficient on its own to address bias. Options C and D do not specifically address the need for representative sampling. (Chat GPT)
upvoted 5 times
...
Certification_Champs
Most Recent 4 months, 2 weeks ago
A. Remove information about protected characteristics from the data before sampling. To minimize bias in a system for predicting insurance prices, it is important to remove information about protected characteristics (e.g., race, gender, ethnicity) from the data before sampling. This helps prevent the model from learning and reinforcing biases based on these characteristics, which is essential for fairness in pricing and avoiding discrimination. Options B, C, and D do not directly address the issue of mitigating bias in the data and model.
upvoted 5 times
PeteColag
3 months, 2 weeks ago
This approach does not address indirect bias, where other variables might still be correlated with protected characteristics. Also, it might overlook the need for the model to be fair across different groups.
upvoted 1 times
...
...
Dip_ml_2023
6 months, 3 weeks ago
By bing bot - This is a multiple-choice question. The correct answer is B. Take a training sample that is representative of the population in the United Kingdom. This option ensures that the system learns from a diverse and balanced data set that reflects the target population. Removing information about protected characteristics (A) may not eliminate bias, as there may be other variables that are correlated with them. Using data from global insurers © may introduce irrelevant or misleading factors that are not applicable to the UK market. Taking a completely random training sample (D) may result in an unrepresentative or skewed data set that does not capture the variation in the population.
upvoted 1 times
...
DarthMengon
7 months, 1 week ago
Selected Answer: B
its exactly like the comment from redsoxfred
upvoted 3 times
...
alexein74
9 months, 3 weeks ago
Selected Answer: B
B. Take a training sample that is representative of the population in the United Kingdom.
upvoted 4 times
...
Ktroy0005
9 months, 3 weeks ago
maybe the uns is C because it says to 'reduce' bias?
upvoted 1 times
...
ctasantos
10 months, 1 week ago
Selected Answer: B
I vote for B. As suggested by other person, countries have different issues/behaviours etc
upvoted 4 times
...
master_yoda
10 months, 1 week ago
Selected Answer: B
I think is B. Global insurers can introduce different prices based on other countries issues/behaviors/security/environment, even with values normalized.
upvoted 4 times
...
Murtuza
10 months, 3 weeks ago
I think its B also
upvoted 2 times
...
XtraWest
11 months ago
i think its B (training data is representative of the population of UK)
upvoted 2 times
XtraWest
11 months ago
C also looks correct so idk
upvoted 1 times
...
...
jits1984
11 months ago
should be B
upvoted 1 times
...
Murtuza
11 months ago
do we know what the correct answer is? I am inclined to think B would be the best choice
upvoted 1 times
...
shanidad
11 months ago
Hmmm, I would have thought B, maybe D
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago