Unlimited Access

Get Unlimited Contributor Access to the all ExamTopics Exams!
Take advantage of PDF Files for 1000+ Exams along with community discussions and pass IT Certification Exams Easily.

Get Unlimited Access

Amazon Discussions

Exam AWS Certified Machine Learning - Specialty topic 1 question 26 discussion

Exam question from Amazon's AWS Certified Machine Learning - Specialty

Question #: 26
Topic #: 1

[All AWS Certified Machine Learning - Specialty Questions]

A Machine Learning Specialist kicks off a hyperparameter tuning job for a tree-based ensemble model using Amazon SageMaker with Area Under the ROC Curve
(AUC) as the objective metric. This workflow will eventually be deployed in a pipeline that retrains and tunes hyperparameters each night to model click-through on data that goes stale every 24 hours.
With the goal of decreasing the amount of time it takes to train these models, and ultimately to decrease costs, the Specialist wants to reconfigure the input hyperparameter range(s).
Which visualization will accomplish this?

A. A histogram showing whether the most important input feature is Gaussian.
B. A scatter plot with points colored by target variable that uses t-Distributed Stochastic Neighbor Embedding (t-SNE) to visualize the large number of input variables in an easier-to-read dimension.
C. A scatter plot showing the performance of the objective metric over each training iteration.
D. A scatter plot showing the correlation between maximum tree depth and the objective metric.

Show Suggested Answer

Suggested Answer: B 🗳️

by heihei at Dec. 13, 2019, 8:57 a.m.

Disclaimers:

- ExamTopics website is not related to, affiliated with, endorsed or authorized by Amazon.
- Trademarks, certification & product names are used for reference only and belong to Amazon.

Comments

Submit Cancel

cloud_trail

Highly Voted 2 years, 7 months ago

This is a very tricky question. The idea is to reconfigure the ranges of the hyperparameters. A refers to a feature, not a hyperparameter. A is out. C refers to training the model, not optimizing the range of hyperparameters. C is out. Now it gets tricky. D will let you find determine what the approximately best tree depth is. That's good. That's what you're trying to do but it's only one of many hyperparameters. It's the best choice so far. B is tricky. t-SNE does help you visualize multidimensional data but option B refers to input variables, not hyperparameters. For this very tricky question, I would do with D. It's the only one that accomplishes the task of limiting the range of a hyperparameter, even if it is only one of them.

upvoted 48 times

cnethers

2 years, 7 months ago

It's good to see someone keeping a thoughtful and curious mind to this question. I too have the same conclusion, not an easy question.

upvoted 3 times

...

ovokpus

1 year, 10 months ago

But, how do you optimize hyperparameters without training experiments? That is why C is the best option. You get a value for each unique combination of hyperparameters.

upvoted 1 times

...

Dr_Kiko

2 years, 6 months ago

B is also wrong as t-SNE picture is not actionable - good visual but ... that's it. try pictures here https://lvdmaaten.github.io/tsne/

upvoted 1 times

...

AddiWei

2 years, 2 months ago

When you are tuning hyperparameters you are literally training multiple models and searching for the best ones.

upvoted 2 times

...

heihei

Highly Voted 2 years, 8 months ago

B doesn't make sense I think it's D

upvoted 14 times

...

VR10

Most Recent 2 months, 4 weeks ago

Option C. See it is doing a scatter plot on the metric for each iteration. Each iteration is running with a certain set of hyper parameters. So if I plot this. and I find which iteration has the best metric, I could simply pick up those set of hyperparameters. D will only led to the tuning of maximum tree depth. I am not sure which option would satisfy the goal to decrease cost but just looking at maximum tree depth doesnt seem right to me. It might be a way to just look at the tree depth and tune just that parameter and since you are only tuning 1 paramter, it may be cheaper, but would that lead to a usable model? I think it should be option C.

upvoted 1 times

...

Regu7

4 months ago

On what basis the correct answers are provided in this platform? Are they assuming this is the correct answer or it is taken from somewhere ?

upvoted 1 times

...

elvin_ml_qayiran25091992razor

6 months, 1 week ago

Selected Answer: D

D IS THE CORRECT

upvoted 1 times

...

Rejju

8 months, 1 week ago

Selected Answer: C

Option D, can also be useful in hyperparameter tuning for tree-based ensemble models, especially if the maximum tree depth is one of the hyperparameters you want to optimize. However, when the goal is to decrease training time and costs by reconfiguring input hyperparameter ranges, a scatter plot showing the performance of the objective metric over each training iteration (Option C) is generally more directly related to the hyperparameter tuning process. It helps you track how the model's performance changes during hyperparameter tuning, which is critical for making decisions about which hyperparameter ranges to explore further. Option D is valuable for understanding the relationship between maximum tree depth and the objective metric, but it might not provide as comprehensive insights into the overall hyperparameter tuning process compared to Option C.

upvoted 1 times

...

loict

8 months, 1 week ago

Selected Answer: D

A. NO - it is about data discovery B. NO - it is about data discovery C. MIGHT - (NO) is a training iteration the overnight training the question is referring to ? (YES) Or each HPO training within each night ? D. YES - the less ambiguous answer

upvoted 1 times

...

DavidRou

8 months, 1 week ago

I think that C should be the right answer. The specialist can monitor how the model works by changing hyperparameters' values in each training iteration.

upvoted 1 times

...

Mickey321

8 months, 3 weeks ago

Selected Answer: D

Option D

upvoted 1 times

...

kaike_reis

9 months, 3 weeks ago

Selected Answer: D

A and B are wrong, because is totally out of question context. C is for monitoring a model, it doesn't help to change your HP range. D is the only answer that applies to the question.

upvoted 3 times

...

Venkatesh_Babu

9 months, 4 weeks ago

Selected Answer: C

I think it should be c

upvoted 1 times

...

CKS1210

11 months ago

Selected Answer: C

By plotting the performance of the objective metric (AUC) over each training iteration, the Specialist can analyze how different hyperparameter configurations affect the model's performance. This visualization helps in understanding which hyperparameter combinations lead to better results and allows the Specialist to identify areas of improvement.

upvoted 1 times

...

mirik

11 months, 2 weeks ago

D: By analyzing this relationship, the Specialist can adjust the range of maximum tree depth values used during hyperparameter tuning to decrease training time and costs.

upvoted 1 times

...

earthMover

12 months ago

Selected Answer: D

D Seems like the best answer. When answer is considered correct who is making that call an is there any justification provided for us to learn from?

upvoted 2 times

...

Valcilio

1 year, 2 months ago

Selected Answer: D

It's about parameters, not about dimensionality.

upvoted 2 times

...

hug_c0sm0s

1 year, 2 months ago

Selected Answer: D

BING chat chooses this answer, and provides an explanation:The correct answer is D. A scatter plot showing the correlation between maximum tree depth and the objective metric. This visualization will help the Machine Learning Specialist to understand how changes in maximum tree depth affect the performance of the model with respect to the objective metric (AUC). By analyzing this relationship, the Specialist can adjust the range of maximum tree depth values used during hyperparameter tuning to decrease training time and costs.

upvoted 1 times

...

bakarys

1 year, 2 months ago

Selected Answer: C

- C This visualization will allow the Machine Learning Specialist to track the performance of the model over time as different hyperparameter configurations are tried. By analyzing this plot, the Specialist can identify which hyperparameters are leading to better model performance and adjust the input hyperparameter range(s) accordingly, with the goal of decreasing the time it takes to train the models and decrease costs.

upvoted 1 times

cpal012

1 year, 1 month ago

Agree, this is my thought process too. I hope the exam is no where near this nebulous

upvoted 1 times

...

Load full discussion...

Unlimited Access

Exam AWS Certified Machine Learning - Specialty topic 1 question 26 discussion

Comments

cloud_trail

cnethers

ovokpus

Dr_Kiko

AddiWei

heihei

VR10

Regu7

elvin_ml_qayiran25091992razor

Rejju

loict

DavidRou

Mickey321

kaike_reis

Venkatesh_Babu

CKS1210

mirik

earthMover

Valcilio

hug_c0sm0s

bakarys

cpal012

Get IT Certification

New Version GCP Professional Cloud Architect Certificate & Helpful Information

The 5 Most In-Demand Project Management Certifications of 2019