Exam AWS Certified Machine Learning Engineer - Associate MLA-C01 topic 1 question 91 discussion

Exam question from Amazon's AWS Certified Machine Learning Engineer - Associate MLA-C01

Question #: 91
Topic #: 1

[All AWS Certified Machine Learning Engineer - Associate MLA-C01 Questions]

A company runs Amazon SageMaker ML models that use accelerated instances. The models require real-time responses. Each model has different scaling requirements. The company must not allow a cold start for the models.

Which solution will meet these requirements?

A. Create a SageMaker Serverless Inference endpoint for each model. Use provisioned concurrency for the endpoints.
B. Create a SageMaker Asynchronous Inference endpoint for each model. Create an auto scaling policy for each endpoint.
C. Create a SageMaker endpoint. Create an inference component for each model. In the inference component settings, specify the newly created endpoint. Create an auto scaling policy for each inference component. Set the parameter for the minimum number of copies to at least 1.
D. Create an Amazon S3 bucket. Store all the model artifacts in the S3 bucket. Create a SageMaker multi-model endpoint. Point the endpoint to the S3 bucket. Create an auto scaling policy for the endpoint. Set the parameter for the minimum number of copies to at least 1.

Show Suggested Answer

Suggested Answer: C 🗳️

by ygn4ei at March 20, 2025, 3:26 p.m.

Disclaimers:

- ExamTopics website is not related to, affiliated with, endorsed or authorized by Amazon.
- Trademarks, certification & product names are used for reference only and belong to Amazon.

Comments

Submit Cancel

eesa

1 month ago

Selected Answer: C

✅ Explanation: Requirements Recap: Real-time inference: Needs low-latency predictions. Accelerated instances: Likely GPU-backed, costly to scale inefficiently. No cold starts: Endpoints must always be warm and responsive. Each model has different scaling needs: Must support independent scaling of each model. ✅ Why Option C is correct: Inference components are a new SageMaker feature that allow: Hosting multiple models on a single endpoint. Independent scaling of each model (component). Avoiding cold starts via minimum number of copies. Setting min invocations or min replicas ≥ 1 keeps the model always warm, eliminating cold starts. This solution meets all requirements efficiently.

upvoted 2 times

...

ygn4ei

1 month ago

Selected Answer: A

this is correct

upvoted 1 times

...

Exam AWS Certified Machine Learning Engineer - Associate MLA-C01 All Questions

View all questions & answers for the AWS Certified Machine Learning Engineer - Associate MLA-C01 exam

Exam AWS Certified Machine Learning Engineer - Associate MLA-C01 topic 1 question 91 discussion

Comments

eesa

ygn4ei

SY0-701