Exam AWS Certified Machine Learning - Specialty topic 1 question 336 discussion

Exam question from Amazon's AWS Certified Machine Learning - Specialty

Question #: 336
Topic #: 1

[All AWS Certified Machine Learning - Specialty Questions]

A media company wants to deploy a machine learning (ML) model that uses Amazon SageMaker to recommend new articles to the company’s readers. The company's readers are primarily located in a single city.

The company notices that the heaviest reader traffic predictably occurs early in the morning, after lunch, and again after work hours. There is very little traffic at other times of day. The media company needs to minimize the time required to deliver recommendations to its readers. The expected amount of data that the API call will return for inference is less than 4 MB.

Which solution will meet these requirements in the MOST cost-effective way?

A. Real-time inference with auto scaling
B. Serverless inference with provisioned concurrency
C. Asynchronous inference
D. A batch transform task

Show Suggested Answer

Suggested Answer: B 🗳️

by GS_77 at Sept. 7, 2024, 12:59 p.m.

Disclaimers:

- ExamTopics website is not related to, affiliated with, endorsed or authorized by Amazon.
- Trademarks, certification & product names are used for reference only and belong to Amazon.

Comments

Submit Cancel

MultiCloudIronMan

8 months, 1 week ago

Selected Answer: B

Best of both worlds, elastic and provisioned.

upvoted 2 times

...

VerRi

8 months, 2 weeks ago

Selected Answer: B

A is more expensive

upvoted 1 times

...

Tkhan1

9 months, 2 weeks ago

Selected Answer: B

On-demand Serverless Inference is ideal for workloads which have idle periods between traffic spurts.Optionally, you can also use Provisioned Concurrency with Serverless Inference. Serverless Inference with provisioned concurrency is a cost-effective option when you have predictable bursts in your traffic. https://docs.aws.amazon.com/sagemaker/latest/dg/serverless-endpoints.html

upvoted 3 times

...

luccabastos

9 months, 3 weeks ago

Selected Answer: A

The traffic is expected. Provisioned resouces have minimal cost.

upvoted 1 times

...

GS_77

9 months, 4 weeks ago

Selected Answer: B

By choosing serverless inference with provisioned concurrency, the media company can benefit from low latency during peak traffic periods while optimizing costs by only paying for the actual inference requests

upvoted 1 times

...