Exam AWS Certified Machine Learning - Specialty topic 1 question 230 discussion

Exam question from Amazon's AWS Certified Machine Learning - Specialty

Question #: 230
Topic #: 1

[All AWS Certified Machine Learning - Specialty Questions]

An analytics company has an Amazon SageMaker hosted endpoint for an image classification model. The model is a custom-built convolutional neural network (CNN) and uses the PyTorch deep learning framework. The company wants to increase throughput and decrease latency for customers that use the model.

Which solution will meet these requirements MOST cost-effectively?

A. Use Amazon Elastic Inference on the SageMaker hosted endpoint.
B. Retrain the CNN with more layers and a larger dataset.
C. Retrain the CNN with more layers and a smaller dataset.
D. Choose a SageMaker instance type that has multiple GPUs.

Show Suggested Answer

Suggested Answer: A 🗳️

by sevosevo at March 18, 2023, 1:49 p.m.

Disclaimers:

- ExamTopics website is not related to, affiliated with, endorsed or authorized by Amazon.
- Trademarks, certification & product names are used for reference only and belong to Amazon.

Comments

Submit Cancel

akgarg00

4 months, 3 weeks ago

Just additional information, Elastic Inference is being deprecated and recommendation is use AWS Inferentia

upvoted 4 times

...

Mickey321

8 months, 1 week ago

Selected Answer: A

Option A can help you meet your requirements most cost-effectively because it enables you to choose the instance type that is best suited to the overall compute and memory needs of your application, and then separately specify the amount of inference acceleration that you need. This reduces inference costs by up to 75% because you no longer need to over-provision GPU compute for inference1.

upvoted 2 times

...

kaike_reis

8 months, 1 week ago

Selected Answer: A

We want to improve the inference of the model. That said, Letter B - C does not solve this problem. Letter D solves it, but at a very high cost. Letter A is correct, as we solve the problem at the lowest possible cost.

upvoted 2 times

...

oso0348

1 year, 1 month ago

Selected Answer: A

Use Amazon Elastic Inference on the SageMaker hosted endpoint would be the most cost-effective solution for increasing throughput and decreasing latency. Amazon Elastic Inference is a service that allows you to attach GPU-powered inference acceleration to Amazon SageMaker hosted endpoints and EC2 instances. By attaching an Elastic Inference accelerator to the SageMaker endpoint, you can achieve better performance with lower costs than using a larger, more expensive instance type.

upvoted 2 times

...

sevosevo

1 year, 1 month ago

Selected Answer: A

"cost efficient" therefore A based on slide 20: https://pages.awscloud.com/rs/112-TZM-766/images/AL-ML%20for%20Startups%20-%20Select%20the%20Right%20ML%20Instance.pdf

upvoted 2 times

...

Exam AWS Certified Machine Learning - Specialty All Questions

View all questions & answers for the AWS Certified Machine Learning - Specialty exam

Exam AWS Certified Machine Learning - Specialty topic 1 question 230 discussion

Comments

akgarg00

Mickey321

kaike_reis

oso0348

sevosevo

SY0-701