exam questions

Exam AWS Certified Machine Learning - Specialty All Questions

View all questions & answers for the AWS Certified Machine Learning - Specialty exam

Exam AWS Certified Machine Learning - Specialty topic 1 question 204 discussion

A retail company uses a machine learning (ML) model for daily sales forecasting. The model has provided inaccurate results for the past 3 weeks. At the end of each day, an AWS Glue job consolidates the input data that is used for the forecasting with the actual daily sales data and the predictions of the model. The AWS Glue job stores the data in Amazon S3.

The company's ML team determines that the inaccuracies are occurring because of a change in the value distributions of the model features. The ML team must implement a solution that will detect when this type of change occurs in the future.

Which solution will meet these requirements with the LEAST amount of operational overhead?

  • A. Use Amazon SageMaker Model Monitor to create a data quality baseline. Confirm that the emit_metrics option is set to Enabled in the baseline constraints file. Set up an Amazon CloudWatch alarm for the metric.
  • B. Use Amazon SageMaker Model Monitor to create a model quality baseline. Confirm that the emit_metrics option is set to Enabled in the baseline constraints file. Set up an Amazon CloudWatch alarm for the metric.
  • C. Use Amazon SageMaker Debugger to create rules to capture feature values Set up an Amazon CloudWatch alarm for the rules.
  • D. Use Amazon CloudWatch to monitor Amazon SageMaker endpoints. Analyze logs in Amazon CloudWatch Logs to check for data drift.
Show Suggested Answer Hide Answer
Suggested Answer: A 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
hichemck
Highly Voted 1 year, 4 months ago
Selected Answer: A
A is correct. "If the statistical nature of the data that your model receives while in production drifts away from the nature of the baseline data it was trained on, the model begins to lose accuracy in its predictions. Amazon SageMaker Model Monitor uses rules to detect data drift and alerts you when it happens." https://docs.aws.amazon.com/sagemaker/latest/dg/model-monitor-data-quality.html
upvoted 10 times
...
loict
Most Recent 7 months, 2 weeks ago
Selected Answer: A
A. YES - Model Monitor can validate distribution of input data B. NO - a model quality baseline is for model performance eg. precision, F1 score, etc. C. NO - Model Monitor is the right tool D. NO - Model Monitor is the right tool
upvoted 1 times
...
kaike_reis
8 months, 2 weeks ago
Selected Answer: A
it's a problem of monitoring data distributions.
upvoted 1 times
...
Mickey321
8 months, 3 weeks ago
Selected Answer: A
The reason for this choice is that Amazon SageMaker Model Monitor is a feature of Amazon SageMaker that allows you to monitor and analyze your machine learning models in production. Model Monitor can automatically detect data drift and other data quality issues by comparing your live data with a baseline dataset that you provide1. Model Monitor can also emit metrics and alerts when it detects violations of the constraints that you define or that it suggests based on the baseline2.
upvoted 1 times
...
AjoseO
1 year, 2 months ago
Selected Answer: A
A is correct. The best solution to meet the requirements is to use Amazon SageMaker Model Monitor to create a data quality baseline. The ML team can set up a data quality baseline to detect when the input data to the model has drifted significantly from the historical distribution of the data. When data drift occurs, the Model Monitor emits a metric that can trigger an alarm in Amazon CloudWatch. The ML team can use this alarm to investigate and take corrective action. Option B is incorrect because model quality baseline monitors model performance, not the input data quality. Option C is incorrect because Amazon SageMaker Debugger is used to debug machine learning models and to identify problems with model training, not data quality. Option D is incorrect because Amazon CloudWatch does not provide any features to monitor data drift in the input data used for the machine learning model.
upvoted 3 times
...
drcok87
1 year, 2 months ago
Selected Answer: A
Data monitor https://docs.aws.amazon.com/sagemaker/latest/dg/model-monitor-data-quality.html Properties of independent variables changes due to seasonality, customer preferences Model monitor https://docs.aws.amazon.com/sagemaker/latest/dg/model-monitor-model-quality.html Concept of what is spam email, changes over time "The company's ML team determines that the inaccuracies are occurring because of a change in the value distributions of the model features.” They know model features that is data for model input is changing so we monitor data https://pair-code.github.io/what-if-tool/learn/tutorials/features-overview/ a
upvoted 1 times
...
drcok87
1 year, 2 months ago
Data monitor https://docs.aws.amazon.com/sagemaker/latest/dg/model-monitor-data-quality.html Properties of independent variables changes due to seasonality, customer preferences Model monitor https://docs.aws.amazon.com/sagemaker/latest/dg/model-monitor-model-quality.html Concept of what is spam email, changes over time "The company's ML team determines that the inaccuracies are occurring because of a change in the value distributions of the model features.” They know model features that is data for model input is changing so we monitor data https://pair-code.github.io/what-if-tool/learn/tutorials/features-overview/ a
upvoted 1 times
...
dunhill
1 year, 4 months ago
I think the answer is B. Data quality can be monitored via model monitor model quality baseline.
upvoted 1 times
...
VinceCar
1 year, 4 months ago
Selected Answer: B
B. Since it's "a change in the value distributions of the model features".
upvoted 1 times
kaike_reis
8 months, 2 weeks ago
model features = data
upvoted 1 times
...
...
tsangckl
1 year, 5 months ago
What is the difference of ans A and B?
upvoted 1 times
pass3in3mon
1 year, 4 months ago
model quality baseline vs data quality baseline
upvoted 1 times
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago