exam questions

Exam AWS Certified Machine Learning - Specialty All Questions

View all questions & answers for the AWS Certified Machine Learning - Specialty exam

Exam AWS Certified Machine Learning - Specialty topic 1 question 328 discussion

A machine learning (ML) specialist at a manufacturing company uses Amazon SageMaker DeepAR to forecast input materials and energy requirements for the company. Most of the data in the training dataset is missing values for the target variable. The company stores the training dataset as JSON files.

The ML specialist develop a solution by using Amazon SageMaker DeepAR to account for the missing values in the training dataset.

Which approach will meet these requirements with the LEAST development effort?

  • A. Impute the missing values by using the linear regression method. Use the entire dataset and the imputed values to train the DeepAR model.
  • B. Replace the missing values with not a number (NaN). Use the entire dataset and the encoded missing values to train the DeepAR model.
  • C. Impute the missing values by using a forward fill. Use the entire dataset and the imputed values to train the DeepAR model.
  • D. Impute the missing values by using the mean value. Use the entire dataset and the imputed values to train the DeepAR model.
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
MultiCloudIronMan
6 months, 1 week ago
Selected Answer: B
Best to allow DeepAR to workout the missing value during training. This may allow it to adopt several approaches to find the one with the best fit.
upvoted 1 times
...
AIWave
1 year, 1 month ago
Selected Answer: B
DeepAR is a supervised RNNs and is able to handle missing values directly within the model. Instead of pre-processing the data to impute missing values externally, DeepAR can work directly with missing values encoded as NaN.
upvoted 2 times
...
KAST_424
1 year, 1 month ago
Selected Answer: B
target—An array of floating-point values or integers that represent the time series. You can encode missing values as null literals, or as "NaN" strings in JSON, or as nan floating-point values in Parquet.
upvoted 2 times
...
Peter_Hsieh
1 year, 1 month ago
Selected Answer: B
https://docs.aws.amazon.com/sagemaker/latest/dg/deepar.html
upvoted 3 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago