exam questions

Exam Professional Machine Learning Engineer All Questions

View all questions & answers for the Professional Machine Learning Engineer exam

Exam Professional Machine Learning Engineer topic 1 question 94 discussion

Actual exam question from Google's Professional Machine Learning Engineer
Question #: 94
Topic #: 1
[All Professional Machine Learning Engineer Questions]

You work on an operations team at an international company that manages a large fleet of on-premises servers located in few data centers around the world. Your team collects monitoring data from the servers, including CPU/memory consumption. When an incident occurs on a server, your team is responsible for fixing it. Incident data has not been properly labeled yet. Your management team wants you to build a predictive maintenance solution that uses monitoring data from the VMs to detect potential failures and then alerts the service desk team. What should you do first?

  • A. Train a time-series model to predict the machines’ performance values. Configure an alert if a machine’s actual performance values significantly differ from the predicted performance values.
  • B. Implement a simple heuristic (e.g., based on z-score) to label the machines’ historical performance data. Train a model to predict anomalies based on this labeled dataset.
  • C. Develop a simple heuristic (e.g., based on z-score) to label the machines’ historical performance data. Test this heuristic in a production environment.
  • D. Hire a team of qualified analysts to review and label the machines’ historical performance data. Train a model based on this manually labeled dataset.
Show Suggested Answer Hide Answer
Suggested Answer: C 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
mil_spyro
Highly Voted 2 years, 4 months ago
Selected Answer: C
I would go for C, it is important to have a clear understanding of what constitutes a potential failure and how to detect it. A heuristic based on z-scores, for example, can be used to flag instances where the performance values of a machine significantly differ from its historical baseline.
upvoted 9 times
...
mouthwash
Most Recent 3 months, 4 weeks ago
Selected Answer: B
Since when is developing and testing allowed in prod? Answer is B
upvoted 1 times
...
rajshiv
4 months, 3 weeks ago
Selected Answer: B
Option B - because it allows you to efficiently label the data using a heuristic approach (e.g., z-score), and then train an anomaly detection model on that labeled data.
upvoted 1 times
...
Pau1234
4 months, 3 weeks ago
Selected Answer: B
c -> no testing in prod. could lead to risks. Hence B.
upvoted 1 times
...
AB_C
5 months ago
Selected Answer: B
Why not C - Heuristic in production without a model: Directly deploying a heuristic in a production environment without testing it within a model can lead to many false positives and alert fatigue for the service desk team.
upvoted 1 times
...
pico
1 year, 7 months ago
Selected Answer: B
NOT C: when you have tested something directly in production?? Option B involves labeling historical data using heuristics, which can be a practical and quick way to get started.
upvoted 4 times
...
razmik
1 year, 10 months ago
Selected Answer: C
Vote for C Reference: Rule #1: Don’t be afraid to launch a product without machine learning. https://developers.google.com/machine-learning/guides/rules-of-ml#before_machine_learning
upvoted 1 times
...
julliet
1 year, 10 months ago
Selected Answer: C
simple solution goes first, more sophisticated one -- after
upvoted 2 times
...
M25
1 year, 11 months ago
Selected Answer: C
Went with C
upvoted 2 times
...
TNT87
2 years ago
Answer C Same as Question number 139
upvoted 2 times
...
studybrew
2 years, 1 month ago
What’s the difference between B and C?
upvoted 3 times
julliet
1 year, 11 months ago
in B you are labeling with heuristics and still develop a model in C you follow the ML-rules to adopt simple solution first and later decide if, how and where you need more sophisticated model
upvoted 2 times
...
...
tavva_prudhvi
2 years, 1 month ago
Selected Answer: C
This is the best option for this scenario because it's quick and inexpensive, and it can provide a baseline for labeling the historical performance data. Once we have labeled data, we can train a predictive maintenance model to detect potential failures and alert the service desk team.
upvoted 1 times
...
osaka_monkey
2 years, 1 month ago
why not D ?
upvoted 1 times
tavva_prudhvi
2 years, 1 month ago
While this approach may result in accurate labeling of the historical performance data, it can be time-consuming and expensive.
upvoted 1 times
...
...
John_Pongthorn
2 years, 2 months ago
Selected Answer: C
https://www.geeksforgeeks.org/z-score-for-outlier-detection-python/
upvoted 1 times
...
hiromi
2 years, 4 months ago
Selected Answer: B
I vote for B - https://developers.google.com/machine-learning/guides/rules-of-ml
upvoted 3 times
hiromi
2 years, 4 months ago
Sorry, I think C is the answer
upvoted 4 times
jamesking1103
2 years, 3 months ago
C. we need detect potential failures
upvoted 1 times
guilhermebutzke
2 years, 2 months ago
Why not B? The team wants to create a model to predict failure. So, the z-score is used to label the failure scenario, for then to use this to build a prediction model.
upvoted 2 times
tavva_prudhvi
2 years, 1 month ago
While this approach may work in some cases, it's not guaranteed to work well in this scenario because we don't know the nature of the anomalies that we want to detect. Therefore, it may be difficult to come up with a heuristic that can accurately label the historical performance data.
upvoted 2 times
evanfebrianto
1 year, 11 months ago
But testing the heuristic in a production environment without training a model could be risky and lead to false alarms or misses.
upvoted 1 times
...
...
...
...
...
...
ares81
2 years, 4 months ago
Selected Answer: A
This is really tricky, but it could be A.
upvoted 2 times
ares81
2 years, 3 months ago
Thinking about it, it should be C.
upvoted 1 times
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago