exam questions

Exam AWS Certified Machine Learning Engineer - Associate MLA-C01 All Questions

View all questions & answers for the AWS Certified Machine Learning Engineer - Associate MLA-C01 exam

Exam AWS Certified Machine Learning Engineer - Associate MLA-C01 topic 1 question 91 discussion

A company runs Amazon SageMaker ML models that use accelerated instances. The models require real-time responses. Each model has different scaling requirements. The company must not allow a cold start for the models.

Which solution will meet these requirements?

  • A. Create a SageMaker Serverless Inference endpoint for each model. Use provisioned concurrency for the endpoints.
  • B. Create a SageMaker Asynchronous Inference endpoint for each model. Create an auto scaling policy for each endpoint.
  • C. Create a SageMaker endpoint. Create an inference component for each model. In the inference component settings, specify the newly created endpoint. Create an auto scaling policy for each inference component. Set the parameter for the minimum number of copies to at least 1.
  • D. Create an Amazon S3 bucket. Store all the model artifacts in the S3 bucket. Create a SageMaker multi-model endpoint. Point the endpoint to the S3 bucket. Create an auto scaling policy for the endpoint. Set the parameter for the minimum number of copies to at least 1.
Show Suggested Answer Hide Answer
Suggested Answer: C 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
eesa
1 week, 2 days ago
Selected Answer: C
✅ Explanation: Requirements Recap: Real-time inference: Needs low-latency predictions. Accelerated instances: Likely GPU-backed, costly to scale inefficiently. No cold starts: Endpoints must always be warm and responsive. Each model has different scaling needs: Must support independent scaling of each model. ✅ Why Option C is correct: Inference components are a new SageMaker feature that allow: Hosting multiple models on a single endpoint. Independent scaling of each model (component). Avoiding cold starts via minimum number of copies. Setting min invocations or min replicas ≥ 1 keeps the model always warm, eliminating cold starts. This solution meets all requirements efficiently.
upvoted 2 times
...
ygn4ei
1 week, 3 days ago
Selected Answer: A
this is correct
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago