Exam AWS Certified Machine Learning - Specialty All Questions

View all questions & answers for the AWS Certified Machine Learning - Specialty exam

Exam AWS Certified Machine Learning - Specialty topic 1 question 30 discussion

Exam question from Amazon's AWS Certified Machine Learning - Specialty

Question #: 30
Topic #: 1

[All AWS Certified Machine Learning - Specialty Questions]

When submitting Amazon SageMaker training jobs using one of the built-in algorithms, which common parameters MUST be specified? (Choose three.)

A. The training channel identifying the location of training data on an Amazon S3 bucket.
B. The validation channel identifying the location of validation data on an Amazon S3 bucket.
C. The IAM role that Amazon SageMaker can assume to perform tasks on behalf of the users.
D. Hyperparameters in a JSON array as documented for the algorithm used.
E. The Amazon EC2 instance class specifying whether training will be run using CPU or GPU.
F. The output path specifying where on an Amazon S3 bucket the trained model will persist.

Show Suggested Answer

Suggested Answer: ACF 🗳️

by DonaldCMLIN at Nov. 16, 2019, 7:05 a.m.

Disclaimers:

- ExamTopics website is not related to, affiliated with, endorsed or authorized by Amazon.
- Trademarks, certification & product names are used for reference only and belong to Amazon.

Comments

Submit Cancel

DonaldCMLIN

Highly Voted 3 years, 9 months ago

THE ANSWER SHOUD BE CEF IAM ROLE, INSTANCE TYPE, OUTPUT PATH

upvoted 29 times

hamimelon

2 years, 6 months ago

Why not A? You don't need to tell Sagemaker where the training data is located?

upvoted 3 times

You need to specify the InputDataConfig, but it does not need to be "S3" I think the reason why A and B are wrong, not because data location is not required, but because it doesn't need to be S3, it can be Amazon S3, EFS, or FSx location

upvoted 1 times

...

HaiHN

3 years, 8 months ago

Should be C, E, F From the SageMaker notebook example: https://github.com/aws/amazon-sagemaker-examples/blob/master/introduction_to_amazon_algorithms/semantic_segmentation_pascalvoc/semantic_segmentation_pascalvoc.ipynb # Create the sagemaker estimator object. ss_model = sagemaker.estimator.Estimator(training_image, role, train_instance_count = 1, train_instance_type = 'ml.p3.2xlarge', train_volume_size = 50, train_max_run = 360000, output_path = s3_output_location, base_job_name = 'ss-notebook-demo', sagemaker_session = sess)

upvoted 12 times

uninit

2 years, 5 months ago

It says InstanceClass - CPU/GPU in the question, not InstanceType

upvoted 6 times

...

mirik

2 years ago

instance type has default value.

upvoted 3 times

...

VB

Highly Voted 3 years, 8 months ago

From here https://docs.aws.amazon.com/zh_tw/sagemaker/latest/dg/API_CreateTrainingJob.html .. the only "Required: Yes" attributes are: 1. AlgorithmSpecification (in this TrainingInputMode is Required - i.e. File or Pipe) 2. OutputDataConfig (in this S3OutputPath is Required - where the model artifacts are stored) 3. ResourceConfig (in this EC2 InstanceType and VolumeSizeInGB are required) 4. RoleArn (..The Amazon Resource Name (ARN) of an IAM role that Amazon SageMaker can assume to perform tasks on your behalf...the caller of this API must have the iam:PassRole permission.) 5. StoppingCondition 6. TrainingJobName (The name of the training job. The name must be unique within an AWS Region in an AWS account.) From the given options in the questions.. we have 2, 3, and 4 above. so, the answer is CEF.

upvoted 27 times

cloud_trail

3 years, 8 months ago

This is the best explanation that CEF is the right answer, IMO. The document at that url is very informative. It also specifically states that InputDataConfig is NOT required. Having said that, I have no idea how the model will train if it doesn't know where to find the training data, but that is what the document says. If someone can explain that, I'd like to hear the explanation.

upvoted 7 times

cloud_trail

3 years, 8 months ago

If I see this question on the actual exam, I'm going with AEF. The model absolutely must know where the training data is. I have seen other documentation that does confirm that you need the location of the input data, the compute instance and location to output the model artifacts.

upvoted 3 times

CloudGuru_ZA

3 years, 8 months ago

but you also need to specify the service role sagemaker should use otherwise it will not be able to perform actions on your behalf like provisioning the training instances.

upvoted 2 times

...

rafaelo

3 years, 6 months ago

Perfect explanation. It is CEF

upvoted 1 times

...

JK1977

2 years, 1 month ago

The question is asking about built in algorithms. It should be ADE. See https://docs.aws.amazon.com/zh_tw/sagemaker/latest/dg/API_CreateTrainingJob.html

upvoted 1 times

...

OAmine

1 year, 9 months ago

for "3. ResourceConfig", only VolumeSizeInGB is required. So, it's not about the instance type. Check: https://docs.aws.amazon.com/zh_tw/sagemaker/latest/APIReference/API_ResourceConfig.html

upvoted 1 times

...

JonSno

Most Recent 4 months, 2 weeks ago

Selected Answer: ACF

Reason: When submitting Amazon SageMaker training jobs using built-in algorithms, the following parameters must be specified: Training Data Location (A) SageMaker requires the training dataset's location in Amazon S3. Provided as a channel input in the training job. IAM Role (C) SageMaker needs IAM permissions to access data from S3 and execute tasks on behalf of the user. Model Output Path (F) Specifies the S3 bucket location where the trained model artifacts will be stored.

upvoted 2 times

...

AbhayD

5 months, 2 weeks ago

Selected Answer: ACF

Instance type is required but not specific class CPU/GPU. Sagamkaer can handle that.

upvoted 1 times

...

MultiCloudIronMan

8 months, 1 week ago

Selected Answer: ACF

These parameters ensure that the training job has access to the necessary data, permissions, and storage locations to function correctly.

upvoted 1 times

...

MultiCloudIronMan

8 months, 1 week ago

Selected Answer: ACF

Options B, D, and E are important but not always mandatory for every training job. For example, validation data (Option B) is not always required, and hyperparameters (Option D) and instance types (Option E) can have default values or be optional depending on the specific algorithm and setup.

upvoted 1 times

...

amlgeek

9 months ago

import boto3 import sagemaker sess = sagemaker.Session() # Example for the linear learner linear = sagemaker.estimator.Estimator( container, role, # role (c) instance_count=1, instance_type="ml.c4.xlarge", # instance type (e) output_path=output_location, # output path (f) sagemaker_session=sess, )

upvoted 1 times

...

kiran15789

10 months, 2 weeks ago

Selected Answer: CEF

Going with cef

upvoted 1 times

...

ML_2

10 months, 3 weeks ago

Selected Answer: CEF

ANSWER IS CEF Here from Amazon docs InputDataConfig An array of Channel objects. Each channel is a named input source. InputDataConfig describes the input data and its location. Required: No OutputDataConfig Specifies the path to the S3 location where you want to store model artifacts. SageMaker creates subfolders for the artifacts. Required: Yes ResourceConfig - Identifies the resources, ML compute instances, and ML storage volumes to deploy for model training. In distributed training, you specify more than one instance. Required: Yes

upvoted 1 times

...

RathanKalluri

11 months, 3 weeks ago

CEF https://docs.aws.amazon.com/sagemaker/latest/APIReference/API_CreateTrainingJob.html#API_CreateTrainingJob_RequestParameters

upvoted 1 times

...

ninomfr64

1 year ago

Based on https://docs.aws.amazon.com/sagemaker/latest/APIReference/API_CreateTrainingJob.html Required parameters are: - AlgorithmSpecification (registry path of the Docker image with the training algorithm) - OutputDataConfig (path to the S3 location where you want to store model artifacts) - ResourceConfig (resources, including the ML compute instances and ML storage volumes, to use for model training) - RoleArn - StoppingCondition (time limit for training job) - TrainingJobName Thus, the answer is: C E F wording for option E is inaccurate "EC2 instance class specifying whether training will be run using CPU or GPU" but they do it on purpose

upvoted 1 times

...

rookiee1111

1 year, 2 months ago

Selected Answer: ACF

The input channel and output channel are mandatory, as the training job needs to know where to get the input data from and where to publish the model artifact. IAM role is also needed, for AWS services. others are not mandatory, validation channel is not mandatory for instance in case of unsupervised learning, likewise hyper params can be be auto tuned for as well as the ec2 instance types can be default ones that will be picked

upvoted 2 times

...

Denise123

1 year, 2 months ago

As they narrowed it to S3, A is incorrect BUT when submitting Amazon SageMaker training jobs using one of the built-in algorithms, it is a MUST to identify the location of training data. While Amazon S3 is commonly used for storing training data, other sources like Docker containers, DynamoDB, or local disks of training instances can also be used. Therefore, specifying the location of training data is essential for SageMaker to know where to access the data during training. So the right answer is CEF for me for this case... However if A was saying identify the location of training data, I think option A would be included in the MUST parameter.

upvoted 1 times

...

sachin80

1 year, 2 months ago

InputDataConffig is optional in create_training_job.Please check thte parameters that are required. So answer is CEF: https://docs.aws.amazon.com/sagemaker/latest/APIReference/API_CreateTrainingJob.html

upvoted 1 times

...

sachin80

1 year, 2 months ago

InputDataConffig is optional in create_training_job.Please check thte parameters that are required. So answer is SEF: https://docs.aws.amazon.com/sagemaker/latest/APIReference/API_CreateTrainingJob.html

upvoted 1 times

...

vkbajoria

1 year, 2 months ago

Selected Answer: CEF

Input is required only when calling Fit method. When initializing the Estimator, we do not need input

upvoted 1 times

...

rav009

1 year, 3 months ago

Selected Answer: ACF

I open the sagemaker and tested. A C F B is not needed for non-supervised algorithm.

upvoted 2 times

...

Load full discussion...

Exam AWS Certified Machine Learning - Specialty All Questions

View all questions & answers for the AWS Certified Machine Learning - Specialty exam

Exam AWS Certified Machine Learning - Specialty topic 1 question 30 discussion

Comments

DonaldCMLIN

hamimelon

ZSun

HaiHN

uninit

mirik

VB

cloud_trail

cloud_trail

CloudGuru_ZA

rafaelo

JK1977

OAmine

JonSno

AbhayD

MultiCloudIronMan

MultiCloudIronMan

amlgeek

kiran15789

ML_2

RathanKalluri

ninomfr64

rookiee1111

Denise123

sachin80

sachin80

vkbajoria

rav009