Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.
exam questions

Exam Professional Machine Learning Engineer All Questions

View all questions & answers for the Professional Machine Learning Engineer exam

Exam Professional Machine Learning Engineer topic 1 question 170 discussion

Actual exam question from Google's Professional Machine Learning Engineer
Question #: 170
Topic #: 1
[All Professional Machine Learning Engineer Questions]

You need to deploy a scikit-leam classification model to production. The model must be able to serve requests 24/7, and you expect millions of requests per second to the production application from 8 am to 7 pm. You need to minimize the cost of deployment. What should you do?

  • A. Deploy an online Vertex AI prediction endpoint. Set the max replica count to 1
  • B. Deploy an online Vertex AI prediction endpoint. Set the max replica count to 100
  • C. Deploy an online Vertex AI prediction endpoint with one GPU per replica. Set the max replica count to 1
  • D. Deploy an online Vertex AI prediction endpoint with one GPU per replica. Set the max replica count to 100
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
pikachu007
Highly Voted 10 months, 3 weeks ago
Selected Answer: B
B. Deploy an online Vertex AI prediction endpoint. Set the max replica count to 100: This option provides a higher number of replicas (100) to handle the expected high volume of requests during peak hours. While it might result in increased costs, it provides the necessary scalability to manage the incoming traffic efficiently. During non-peak hours, you can consider scaling down the replicas to reduce costs, as Vertex AI allows dynamic scaling based on demand.
upvoted 5 times
...
AzureDP900
Most Recent 5 months, 1 week ago
Option A (Deploying an online Vertex AI prediction endpoint. Set the max replica count to 1) is still a good choice for minimizing costs. By setting the max replica count to 1, you are allowing Vertex AI to scale up or down based on load, which means that during off-peak hours, you won't be paying for unnecessary instances.
upvoted 1 times
...
pinimichele01
7 months, 2 weeks ago
Selected Answer: B
see pikachu007
upvoted 1 times
...
36bdc1e
10 months, 2 weeks ago
B we don't need GPU for scikit-learn
upvoted 2 times
...
BlehMaks
10 months, 2 weeks ago
Selected Answer: B
scikit-learn doesn't support GPU https://scikit-learn.org/stable/faq.html#will-you-add-gpu-support
upvoted 2 times
...
b1a8fae
10 months, 3 weeks ago
B. scikit-learn -> no need for GPU max number of replicas -> 1 is too little if we are serving online predictions at such a massive scale (millions per second)
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...