Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.

Unlimited Access

Get Unlimited Contributor Access to the all ExamTopics Exams!
Take advantage of PDF Files for 1000+ Exams along with community discussions and pass IT Certification Exams Easily.

Exam AWS Certified Machine Learning - Specialty topic 1 question 30 discussion

When submitting Amazon SageMaker training jobs using one of the built-in algorithms, which common parameters MUST be specified? (Choose three.)

  • A. The training channel identifying the location of training data on an Amazon S3 bucket.
  • B. The validation channel identifying the location of validation data on an Amazon S3 bucket.
  • C. The IAM role that Amazon SageMaker can assume to perform tasks on behalf of the users.
  • D. Hyperparameters in a JSON array as documented for the algorithm used.
  • E. The Amazon EC2 instance class specifying whether training will be run using CPU or GPU.
  • F. The output path specifying where on an Amazon S3 bucket the trained model will persist.
Show Suggested Answer Hide Answer
Suggested Answer: AEF 🗳️

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
DonaldCMLIN
Highly Voted 2 years, 7 months ago
THE ANSWER SHOUD BE CEF IAM ROLE, INSTANCE TYPE, OUTPUT PATH
upvoted 28 times
hamimelon
1 year, 4 months ago
Why not A? You don't need to tell Sagemaker where the training data is located?
upvoted 1 times
ZSun
1 year, 1 month ago
You need to specify the InputDataConfig, but it does not need to be "S3" I think the reason why A and B are wrong, not because data location is not required, but because it doesn't need to be S3, it can be Amazon S3, EFS, or FSx location
upvoted 1 times
...
...
HaiHN
2 years, 6 months ago
Should be C, E, F From the SageMaker notebook example: https://github.com/aws/amazon-sagemaker-examples/blob/master/introduction_to_amazon_algorithms/semantic_segmentation_pascalvoc/semantic_segmentation_pascalvoc.ipynb # Create the sagemaker estimator object. ss_model = sagemaker.estimator.Estimator(training_image, role, train_instance_count = 1, train_instance_type = 'ml.p3.2xlarge', train_volume_size = 50, train_max_run = 360000, output_path = s3_output_location, base_job_name = 'ss-notebook-demo', sagemaker_session = sess)
upvoted 11 times
uninit
1 year, 3 months ago
It says InstanceClass - CPU/GPU in the question, not InstanceType
upvoted 4 times
...
mirik
10 months, 3 weeks ago
instance type has default value.
upvoted 3 times
...
...
...
VB
Highly Voted 2 years, 7 months ago
From here https://docs.aws.amazon.com/zh_tw/sagemaker/latest/dg/API_CreateTrainingJob.html .. the only "Required: Yes" attributes are: 1. AlgorithmSpecification (in this TrainingInputMode is Required - i.e. File or Pipe) 2. OutputDataConfig (in this S3OutputPath is Required - where the model artifacts are stored) 3. ResourceConfig (in this EC2 InstanceType and VolumeSizeInGB are required) 4. RoleArn (..The Amazon Resource Name (ARN) of an IAM role that Amazon SageMaker can assume to perform tasks on your behalf...the caller of this API must have the iam:PassRole permission.) 5. StoppingCondition 6. TrainingJobName (The name of the training job. The name must be unique within an AWS Region in an AWS account.) From the given options in the questions.. we have 2, 3, and 4 above. so, the answer is CEF.
upvoted 24 times
cloud_trail
2 years, 6 months ago
This is the best explanation that CEF is the right answer, IMO. The document at that url is very informative. It also specifically states that InputDataConfig is NOT required. Having said that, I have no idea how the model will train if it doesn't know where to find the training data, but that is what the document says. If someone can explain that, I'd like to hear the explanation.
upvoted 7 times
cloud_trail
2 years, 6 months ago
If I see this question on the actual exam, I'm going with AEF. The model absolutely must know where the training data is. I have seen other documentation that does confirm that you need the location of the input data, the compute instance and location to output the model artifacts.
upvoted 3 times
CloudGuru_ZA
2 years, 6 months ago
but you also need to specify the service role sagemaker should use otherwise it will not be able to perform actions on your behalf like provisioning the training instances.
upvoted 2 times
...
...
...
rafaelo
2 years, 4 months ago
Perfect explanation. It is CEF
upvoted 1 times
...
JK1977
11 months, 3 weeks ago
The question is asking about built in algorithms. It should be ADE. See https://docs.aws.amazon.com/zh_tw/sagemaker/latest/dg/API_CreateTrainingJob.html
upvoted 1 times
...
OAmine
7 months, 3 weeks ago
for "3. ResourceConfig", only VolumeSizeInGB is required. So, it's not about the instance type. Check: https://docs.aws.amazon.com/zh_tw/sagemaker/latest/APIReference/API_ResourceConfig.html
upvoted 1 times
...
...
rookiee1111
Most Recent 2 weeks, 1 day ago
Selected Answer: ACF
The input channel and output channel are mandatory, as the training job needs to know where to get the input data from and where to publish the model artifact. IAM role is also needed, for AWS services. others are not mandatory, validation channel is not mandatory for instance in case of unsupervised learning, likewise hyper params can be be auto tuned for as well as the ec2 instance types can be default ones that will be picked
upvoted 1 times
...
Denise123
2 weeks, 4 days ago
As they narrowed it to S3, A is incorrect BUT when submitting Amazon SageMaker training jobs using one of the built-in algorithms, it is a MUST to identify the location of training data. While Amazon S3 is commonly used for storing training data, other sources like Docker containers, DynamoDB, or local disks of training instances can also be used. Therefore, specifying the location of training data is essential for SageMaker to know where to access the data during training. So the right answer is CEF for me for this case... However if A was saying identify the location of training data, I think option A would be included in the MUST parameter.
upvoted 1 times
...
sachin80
2 weeks, 4 days ago
InputDataConffig is optional in create_training_job.Please check thte parameters that are required. So answer is CEF: https://docs.aws.amazon.com/sagemaker/latest/APIReference/API_CreateTrainingJob.html
upvoted 1 times
...
sachin80
2 weeks, 4 days ago
InputDataConffig is optional in create_training_job.Please check thte parameters that are required. So answer is SEF: https://docs.aws.amazon.com/sagemaker/latest/APIReference/API_CreateTrainingJob.html
upvoted 1 times
...
vkbajoria
1 month ago
Selected Answer: CEF
Input is required only when calling Fit method. When initializing the Estimator, we do not need input
upvoted 1 times
...
rav009
1 month, 2 weeks ago
Selected Answer: ACF
I open the sagemaker and tested. A C F B is not needed for non-supervised algorithm.
upvoted 2 times
...
vkbajoria
2 months, 3 weeks ago
C, E, F The trick was the training channel, but all the data channel are passed during when actually training the model using fit method
upvoted 1 times
...
VR10
2 months, 3 weeks ago
E is not important, some models could simply work on the default of CPU. A is a must and E is a must too. C is important for permission handling on S3 etc. It has to be A, C, F
upvoted 2 times
VR10
2 months, 3 weeks ago
Correction, having gone thru the doc more closely, there is no default for instance type. So the choices should be A, C, E.
upvoted 1 times
...
...
Alice1234
3 months, 1 week ago
A. The training channel identifying the location of training data on an Amazon S3 bucket: This is where SageMaker will get the input data for training the model. C. The IAM role that Amazon SageMaker can assume to perform tasks on behalf of the users: This role provides SageMaker the necessary permissions to access AWS resources. F. The output path specifying where on an Amazon S3 bucket the trained model will persist: After training, the model artifacts need to be saved in a specified S3 bucket location.
upvoted 1 times
...
Swagata23
4 months, 1 week ago
Selected Answer: CEF
Please go through the lab https://catalog.us-east-1.prod.workshops.aws/workshops/63069e26-921c-4ce1-9cc7-dd882ff62575/en-US/lab2
upvoted 1 times
...
phdykd
4 months, 1 week ago
ACF is answer
upvoted 1 times
...
Neet1983
4 months, 2 weeks ago
Selected Answer: ACF
A. The training channel identifying the location of training data on an Amazon S3 bucket: This is essential because SageMaker needs to know where to find the data for training. C. The IAM role that Amazon SageMaker can assume to perform tasks on behalf of the users: SageMaker requires permissions to access resources on behalf of the user, and this is provided by specifying an IAM role with the necessary policies attached. F. The output path specifying where on an Amazon S3 bucket the trained model will persist: After the model is trained, SageMaker needs to save the output, which includes the model artifacts, to a specified S3 location.
upvoted 3 times
...
mcwithimp
4 months, 3 weeks ago
Selected Answer: CEF
https://docs.aws.amazon.com/zh_tw/sagemaker/latest/dg/API_CreateTrainingJob.html The answer should be CEF The only attributes strictly required("Required: Yes" ) are: TrainingJobName AlgorithmSpecification OutputDataConfig ResourceConfig RoleArn StoppingCondition So, why is 'InputDataConfig' strictly required? When 'InputDataConfig' is not needed: Algorithm with Pre-loaded Data where the data is already embedded or hardcoded within the training script or Docker container or pre-defined datasets available within SageMaker Algorithm Generating (training) Data such as synthetic data generation or reinforcement learning scenarios
upvoted 1 times
...
rdoty
4 months, 4 weeks ago
Can confirm, the answer is CEF... I just went into SageMaker console, and tried to create a training job... You must have an IAM Role for permissions. Leaving the instance class/instance type blank is not possible it is always autofilled with a default instance, therefore REQUIRED. Output path for the model after training is also required, you cannot create a training job without these fields... therefore the answer is CEF - IAM Role - Instance Class/Type - Output path for completed model
upvoted 2 times
...
vikaspd
5 months, 2 weeks ago
Selected Answer: CEF
Required Parameters: Sagemaker Required Parameters Algorothm Specs - path to docker image Output Data Config -  (path to S3) Resource Confg - Instance type and storage volume Role ARN Stopping Conditions Training job Name
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...