exam questions

Exam AWS Certified Machine Learning - Specialty All Questions

View all questions & answers for the AWS Certified Machine Learning - Specialty exam

Exam AWS Certified Machine Learning - Specialty topic 1 question 72 discussion

A Data Scientist is building a model to predict customer churn using a dataset of 100 continuous numerical features. The Marketing team has not provided any insight about which features are relevant for churn prediction. The Marketing team wants to interpret the model and see the direct impact of relevant features on the model outcome. While training a logistic regression model, the Data Scientist observes that there is a wide gap between the training and validation set accuracy.
Which methods can the Data Scientist use to improve the model performance and satisfy the Marketing team's needs? (Choose two.)

  • A. Add L1 regularization to the classifier
  • B. Add features to the dataset
  • C. Perform recursive feature elimination
  • D. Perform t-distributed stochastic neighbor embedding (t-SNE)
  • E. Perform linear discriminant analysis
Show Suggested Answer Hide Answer
Suggested Answer: AC 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
bluer1
Highly Voted 1 year, 12 months ago
AC - correct answer
upvoted 13 times
...
lynn22
Most Recent 4 months, 2 weeks ago
Selected Answer: AE
I think ACE are all correct
upvoted 1 times
...
loict
7 months, 1 week ago
Selected Answer: AC
A. YES - standard for overfitting B. NO - we have already too much overfitting C. YES - feature elimination can reduce model complexity and thus overfitting D. NO - that does dimensionnality reduction to 2D or 3D, for visualization; we want more than a few features E. NO - LDA is an alternative to logistic regression; it may not address overfitting
upvoted 4 times
...
Mickey321
8 months ago
Selected Answer: AC
A due to fitting C Recursive feature elimination (RFE) is a wrapper method that iteratively removes features based on their importance scores from a classifier. RFE starts with all features and then eliminates the least important ones until a desired number of features is reached. This can help to reduce the dimensionality of the dataset and improve the model performance by removing irrelevant or redundant features. The Marketing team can then interpret the model by looking at the remaining features and their importance scores.
upvoted 1 times
...
kaike_reis
8 months, 3 weeks ago
Selected Answer: AC
AC are the correct
upvoted 1 times
...
earthMover
11 months ago
Selected Answer: AC
How can we add features to the dataset provided.... we can't make them up from thin air. Hopefully the moderators can provide some insight on this. I was thinking of paying for this site but the answers are all over the place.
upvoted 1 times
...
bakarys
1 year, 2 months ago
Selected Answer: AC
A. Add L1 regularization to the classifier and C. Perform recursive feature elimination are the methods that can be used to improve the model performance and satisfy the Marketing team's needs. Explanation: A. Adding L1 regularization to the logistic regression classifier can help to improve the model performance and reduce overfitting. This can also help to highlight the relevant features for churn prediction as L1 regularization can shrink the coefficients of irrelevant features to zero. C. Recursive feature elimination can be used to select the most relevant features for the model. This can help to improve the model performance and highlight the relevant features for churn prediction.
upvoted 3 times
...
AjoseO
1 year, 2 months ago
Selected Answer: AC
A. Adding L1 regularization can help to reduce overfitting by shrinking the coefficients of less important features towards zero, which can improve the model's generalization performance on the validation set. C. Recursive feature elimination is a feature selection technique that removes the least important feature at each iteration and trains the model on the remaining features until a desired number of features is reached. This method can be used to identify the most relevant features for the prediction task and reduce the dimensionality of the dataset, leading to improved model performance and interpretability for the Marketing team.
upvoted 2 times
...
wisoxe8356
1 year, 4 months ago
AC - Key: logistic regression model = non linear in terms of Odds and Probability, however it is linear in terms of Log Odds. Key: Large gap between training & validation = overfitting => 5 techniques to prevent overfitting: 1. Simplifying the model | 2. Early stopping 3. Use data argumentation | 4. Use regularization | 5. Use dropouts A - yes to avoid overfitting (although i am thinking it is talking about regressor) Not B - add feature will lead to overfitting C - feature elimination - prevent overfitting Not D - t-SNE is a nonlinear dimensionality reduction technique Not E - find feature correlation only - Linear discriminant analysis (LDA), normal discriminant analysis (NDA), or discriminant function analysis is a generalization of Fisher's linear discriminant, a method used in statistics and other fields, to find a linear combination of features that characterizes or separates two or more classes of objects or events.
upvoted 4 times
...
itallomd
1 year, 5 months ago
L1 won't do naturally the feature elimination? I guess AB
upvoted 1 times
...
Atreides457
1 year, 7 months ago
why not A & D? or C & D? does not t-SNE grant the marketing team's wish for visualization of relationships? or are we to presume that A&C are best as C (recursive feature elimination) grants us some visualization of feature importance.
upvoted 2 times
...
tgaos
1 year, 10 months ago
Selected Answer: AC
AC is correct
upvoted 3 times
...
NeverMinda
1 year, 10 months ago
Selected Answer: AC
overfitting: add regularization, remove features
upvoted 4 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago