Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.
exam questions

Exam Professional Machine Learning Engineer All Questions

View all questions & answers for the Professional Machine Learning Engineer exam

Exam Professional Machine Learning Engineer topic 1 question 90 discussion

Actual exam question from Google's Professional Machine Learning Engineer
Question #: 90
Topic #: 1
[All Professional Machine Learning Engineer Questions]

You are working on a binary classification ML algorithm that detects whether an image of a classified scanned document contains a company’s logo. In the dataset, 96% of examples don’t have the logo, so the dataset is very skewed. Which metrics would give you the most confidence in your model?

  • A. F-score where recall is weighed more than precision
  • B. RMSE
  • C. F1 score
  • D. F-score where precision is weighed more than recall
Show Suggested Answer Hide Answer
Suggested Answer: A 🗳️

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
tavva_prudhvi
Highly Voted 1 year, 8 months ago
Selected Answer: A
In this scenario, the dataset is highly imbalanced, where most of the examples do not have the company's logo. Therefore, accuracy could be misleading as the model can have high accuracy by simply predicting that all images do not have the logo. F1 score is a good metric to consider in such cases, as it takes both precision and recall into account. However, since the dataset is highly skewed, we should weigh recall more than precision to ensure that the model is correctly identifying the images that do have the logo. Therefore, F-score where recall is weighed more than precision is the best metric to evaluate the performance of the model in this scenario. Option B (RMSE) is not applicable to this classification problem, and option D (F-score where precision is weighed more than recall) is not suitable for highly skewed datasets.
upvoted 11 times
...
jkkim_jt
Most Recent 1 month ago
Selected Answer: C
F1-score: Harmonic mean of precision and recall. It ensures that both fasle positive and false negatives are considerd. F1- focusing on recall may be useful if missing a logo is more costly than incorrectly identifying one.
upvoted 1 times
...
gscharly
7 months, 1 week ago
Selected Answer: C
I'd go with C. We don't know which option (less FP or less FN) is most important for business with the provided information, so we should seek a balance.
upvoted 2 times
...
etienne0
8 months, 3 weeks ago
Selected Answer: D
I think it's D.
upvoted 1 times
...
guilhermebutzke
10 months ago
Selected Answer: D
I think it could be D, but the question does not provide enough information for this. I have this feeling: If 4% have the logo, we are looking just for these ones, right? So, the 'quality of TP,' that's it, the precision, could be more interesting because we want a model that we can rely on. So, when this model Predict a image with logo, we`ll be more certain about it. If we use recall, for example, a model with 99% recall has more chance of getting the logo, but we won't have quality in this. This model could suggest a lot of images without logo. It is better to use any ML than this...
upvoted 3 times
...
pico
1 year ago
Selected Answer: C
both option A (F-score with higher weight on recall) and option C (F1 score) could be suitable depending on the specific priorities and requirements of your classification problem. If missing a company's logo is considered more problematic than having false alarms, then option A might be preferred. The F1 score (option C) is a balanced measure that considers both precision and recall, which is generally a good choice in imbalanced datasets. Ultimately, the choice between option A and option C depends on the specific goals and constraints of your application.
upvoted 2 times
...
Mickey321
1 year ago
Selected Answer: C
The question not have clear preference for recall or precision hence going with C
upvoted 3 times
...
Jarek7
1 year, 5 months ago
Selected Answer: C
Yeah, I know - everyone is voting A... To be honest I still don't understand why are you more affraid of these few FNs than FPs. In my opinion they are exactly same evil. Every documantation says that F1 is great on skewed data. You should use weighted F1 when you know what is worse for you FNs or FPs. In this case we have no any hints on it, so I would stay with ordinary F1.
upvoted 4 times
...
Voyager2
1 year, 5 months ago
Selected Answer: A
A. F-score where recall is weighed more than precision Even a model which always says that don't have the logo will have a good precision because is the most common. What we need is improve recall.
upvoted 2 times
...
M25
1 year, 6 months ago
Selected Answer: A
Went with A
upvoted 2 times
...
guilhermebutzke
1 year, 9 months ago
Selected Answer: A
I think is A. The positive Class is the minority. So, it's more important to correctly detect logos in all images that have logo (recall) than correctly detect logos in images classified with logos (precision).
upvoted 3 times
...
enghabeth
1 year, 9 months ago
Selected Answer: A
I think is D becouse u try detect TP then it's more important recall than precision
upvoted 3 times
...
ares81
1 year, 10 months ago
Selected Answer: A
Answer A is my choice.
upvoted 1 times
...
Abhijat
1 year, 11 months ago
A is correct
upvoted 1 times
...
Dataspire
1 year, 11 months ago
Selected Answer: A
less logo images. Recall should be weighted more
upvoted 3 times
...
kn29
1 year, 11 months ago
I think A. If D were the answer, the threshold would be set higher to increase PRECISION, but the low percentage of positives (4%) would allow RECALL to be extremely low. If the percentage of positives is low, greater weight should be given to RECALL. https://medium.com/@douglaspsteen/beyond-the-f-1-score-a-look-at-the-f-beta-score-3743ac2ef6e3
upvoted 4 times
...
egdiaa
1 year, 11 months ago
Answer C: F1-Score is the best for imbalanced Data like this case: https://stephenallwright.com/imbalanced-data-metric/
upvoted 4 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...