Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.

Unlimited Access

Get Unlimited Contributor Access to the all ExamTopics Exams!
Take advantage of PDF Files for 1000+ Exams along with community discussions and pass IT Certification Exams Easily.

Exam AWS Certified Machine Learning - Specialty topic 1 question 26 discussion

A Machine Learning Specialist kicks off a hyperparameter tuning job for a tree-based ensemble model using Amazon SageMaker with Area Under the ROC Curve
(AUC) as the objective metric. This workflow will eventually be deployed in a pipeline that retrains and tunes hyperparameters each night to model click-through on data that goes stale every 24 hours.
With the goal of decreasing the amount of time it takes to train these models, and ultimately to decrease costs, the Specialist wants to reconfigure the input hyperparameter range(s).
Which visualization will accomplish this?

  • A. A histogram showing whether the most important input feature is Gaussian.
  • B. A scatter plot with points colored by target variable that uses t-Distributed Stochastic Neighbor Embedding (t-SNE) to visualize the large number of input variables in an easier-to-read dimension.
  • C. A scatter plot showing the performance of the objective metric over each training iteration.
  • D. A scatter plot showing the correlation between maximum tree depth and the objective metric.
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
cloud_trail
Highly Voted 2 years, 7 months ago
This is a very tricky question. The idea is to reconfigure the ranges of the hyperparameters. A refers to a feature, not a hyperparameter. A is out. C refers to training the model, not optimizing the range of hyperparameters. C is out. Now it gets tricky. D will let you find determine what the approximately best tree depth is. That's good. That's what you're trying to do but it's only one of many hyperparameters. It's the best choice so far. B is tricky. t-SNE does help you visualize multidimensional data but option B refers to input variables, not hyperparameters. For this very tricky question, I would do with D. It's the only one that accomplishes the task of limiting the range of a hyperparameter, even if it is only one of them.
upvoted 48 times
cnethers
2 years, 7 months ago
It's good to see someone keeping a thoughtful and curious mind to this question. I too have the same conclusion, not an easy question.
upvoted 3 times
...
ovokpus
1 year, 10 months ago
But, how do you optimize hyperparameters without training experiments? That is why C is the best option. You get a value for each unique combination of hyperparameters.
upvoted 1 times
...
Dr_Kiko
2 years, 6 months ago
B is also wrong as t-SNE picture is not actionable - good visual but ... that's it. try pictures here https://lvdmaaten.github.io/tsne/
upvoted 1 times
...
AddiWei
2 years, 2 months ago
When you are tuning hyperparameters you are literally training multiple models and searching for the best ones.
upvoted 2 times
...
...
heihei
Highly Voted 2 years, 8 months ago
B doesn't make sense I think it's D
upvoted 14 times
...
VR10
Most Recent 2 months, 4 weeks ago
Option C. See it is doing a scatter plot on the metric for each iteration. Each iteration is running with a certain set of hyper parameters. So if I plot this. and I find which iteration has the best metric, I could simply pick up those set of hyperparameters. D will only led to the tuning of maximum tree depth. I am not sure which option would satisfy the goal to decrease cost but just looking at maximum tree depth doesnt seem right to me. It might be a way to just look at the tree depth and tune just that parameter and since you are only tuning 1 paramter, it may be cheaper, but would that lead to a usable model? I think it should be option C.
upvoted 1 times
...
Regu7
4 months ago
On what basis the correct answers are provided in this platform? Are they assuming this is the correct answer or it is taken from somewhere ?
upvoted 1 times
...
elvin_ml_qayiran25091992razor
6 months, 1 week ago
Selected Answer: D
D IS THE CORRECT
upvoted 1 times
...
Rejju
8 months, 1 week ago
Selected Answer: C
Option D, can also be useful in hyperparameter tuning for tree-based ensemble models, especially if the maximum tree depth is one of the hyperparameters you want to optimize. However, when the goal is to decrease training time and costs by reconfiguring input hyperparameter ranges, a scatter plot showing the performance of the objective metric over each training iteration (Option C) is generally more directly related to the hyperparameter tuning process. It helps you track how the model's performance changes during hyperparameter tuning, which is critical for making decisions about which hyperparameter ranges to explore further. Option D is valuable for understanding the relationship between maximum tree depth and the objective metric, but it might not provide as comprehensive insights into the overall hyperparameter tuning process compared to Option C.
upvoted 1 times
...
loict
8 months, 1 week ago
Selected Answer: D
A. NO - it is about data discovery B. NO - it is about data discovery C. MIGHT - (NO) is a training iteration the overnight training the question is referring to ? (YES) Or each HPO training within each night ? D. YES - the less ambiguous answer
upvoted 1 times
...
DavidRou
8 months, 1 week ago
I think that C should be the right answer. The specialist can monitor how the model works by changing hyperparameters' values in each training iteration.
upvoted 1 times
...
Mickey321
8 months, 3 weeks ago
Selected Answer: D
Option D
upvoted 1 times
...
kaike_reis
9 months, 3 weeks ago
Selected Answer: D
A and B are wrong, because is totally out of question context. C is for monitoring a model, it doesn't help to change your HP range. D is the only answer that applies to the question.
upvoted 3 times
...
Venkatesh_Babu
9 months, 4 weeks ago
Selected Answer: C
I think it should be c
upvoted 1 times
...
CKS1210
11 months ago
Selected Answer: C
By plotting the performance of the objective metric (AUC) over each training iteration, the Specialist can analyze how different hyperparameter configurations affect the model's performance. This visualization helps in understanding which hyperparameter combinations lead to better results and allows the Specialist to identify areas of improvement.
upvoted 1 times
...
mirik
11 months, 2 weeks ago
D: By analyzing this relationship, the Specialist can adjust the range of maximum tree depth values used during hyperparameter tuning to decrease training time and costs.
upvoted 1 times
...
earthMover
12 months ago
Selected Answer: D
D Seems like the best answer. When answer is considered correct who is making that call an is there any justification provided for us to learn from?
upvoted 2 times
...
Valcilio
1 year, 2 months ago
Selected Answer: D
It's about parameters, not about dimensionality.
upvoted 2 times
...
hug_c0sm0s
1 year, 2 months ago
Selected Answer: D
BING chat chooses this answer, and provides an explanation:The correct answer is D. A scatter plot showing the correlation between maximum tree depth and the objective metric. This visualization will help the Machine Learning Specialist to understand how changes in maximum tree depth affect the performance of the model with respect to the objective metric (AUC). By analyzing this relationship, the Specialist can adjust the range of maximum tree depth values used during hyperparameter tuning to decrease training time and costs.
upvoted 1 times
...
bakarys
1 year, 2 months ago
Selected Answer: C
- C This visualization will allow the Machine Learning Specialist to track the performance of the model over time as different hyperparameter configurations are tried. By analyzing this plot, the Specialist can identify which hyperparameters are leading to better model performance and adjust the input hyperparameter range(s) accordingly, with the goal of decreasing the time it takes to train the models and decrease costs.
upvoted 1 times
cpal012
1 year, 1 month ago
Agree, this is my thought process too. I hope the exam is no where near this nebulous
upvoted 1 times
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...