Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.
exam questions

Exam AWS Certified Machine Learning - Specialty All Questions

View all questions & answers for the AWS Certified Machine Learning - Specialty exam

Exam AWS Certified Machine Learning - Specialty topic 1 question 286 discussion

A digital media company wants to build a customer churn prediction model by using tabular data. The model should clearly indicate whether a customer will stop using the company's services. The company wants to clean the data because the data contains some empty fields, duplicate values, and rare values.

Which solution will meet these requirements with the LEAST development effort?

  • A. Use SageMaker Canvas to automatically clean the data and to prepare a categorical model.
  • B. Use SageMaker Data Wrangler to clean the data. Use the built-in SageMaker XGBoost algorithm to train a classification model.
  • C. Use SageMaker Canvas automatic data cleaning and preparation tools. Use the built-in SageMaker XGBoost algorithm to train a regression model.
  • D. Use SageMaker Data Wrangler to clean the data. Use the SageMaker Autopilot to train a regression model
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
xiaoeason
Highly Voted 11 months ago
B 1.Data Cleaning: SageMaker Data Wrangler is designed for data preparation tasks, including handling missing values, duplicates, and rare values. It provides a visual interface to clean and transform tabular data efficiently. This addresses the data cleaning requirements mentioned in the question. 2.Model Training: Using the built-in SageMaker XGBoost algorithm is a common and effective choice for classification tasks like customer churn prediction. XGBoost is a powerful and widely used algorithm for binary classification problems.
upvoted 7 times
...
aquanaveen
Highly Voted 11 months ago
Selected Answer: B
B. Use SageMaker Data Wrangler to clean the data. Use the built-in SageMaker XGBoost algorithm to train a classification model. Explanation: SageMaker Data Wrangler: SageMaker Data Wrangler is designed for efficient data cleaning and preparation. It provides a visual interface that simplifies the process of cleaning tabular data, handling missing values, and addressing duplicate or rare values. Data Wrangler can generate the necessary preprocessing code automatically, reducing the development effort. SageMaker XGBoost (for Classification): XGBoost is a popular and powerful algorithm for classification tasks, including customer churn prediction. SageMaker provides a built-in XGBoost algorithm, making it easy to train a classification model without the need for extensive coding.
upvoted 7 times
...
MultiCloudIronMan
Most Recent 1 week, 5 days ago
Selected Answer: A
Option B involves using SageMaker Data Wrangler to clean the data and the built-in SageMaker XGBoost algorithm to train a classification model. While this is a valid approach, it requires more manual intervention and development effort compared to using SageMaker Canvas.
upvoted 1 times
...
SamHan
1 month ago
Selected Answer: A
A is correct
upvoted 1 times
...
pandkast
4 months, 3 weeks ago
Selected Answer: B
SageMaker Canvas is an excellent tool for those without ML expertise to build models, but it may not provide the detailed control needed for data cleaning and may not be as robust as Data Wrangler for complex cleaning tasks.
upvoted 2 times
...
Peter_Hsieh
6 months, 1 week ago
Selected Answer: A
https://aws.amazon.com/tw/about-aws/whats-new/2022/05/amazon-sagemaker-canvas-adds-new-data-capabilities-usability-updates/
upvoted 1 times
...
JonSno
6 months, 1 week ago
Answer A- Sagemaker Canvas + categorical model Reason : SageMaker Canvas: SageMaker Canvas is a no-code machine learning tool that allows users to perform data preparation, feature engineering, and model training with minimal technical expertise. It automatically handles tasks like data cleaning, including the removal of duplicates, filling missing values, and managing rare categories. Categorical Model: A categorical (classification) model is the correct type for churn prediction, as it aims to classify whether a customer will stop using the service (churn) or not. SageMaker Canvas provides user-friendly tools to build and evaluate this type of model.
upvoted 1 times
...
F1Fan
7 months, 2 weeks ago
While Amazon SageMaker Canvas can perform automatic data cleaning and preparation, it has certain limitations when it comes to handling complex data cleaning tasks. SageMaker Canvas is designed for building machine learning models with minimal code and effort, primarily targeting business analysts and non-technical users. It provides a guided user interface and automates many steps in the machine learning pipeline, including data cleaning and preparation. However, SageMaker Canvas has a set of built-in data cleaning and preparation operations, which may not be sufficient for handling all types of data quality issues or complex data transformations. If the data requires more advanced cleaning techniques or custom transformations, SageMaker Data Wrangler (option B) would be a better choice.
upvoted 1 times
...
vkbajoria
8 months, 2 weeks ago
Selected Answer: A
A is correct Canvas can do without writing single line of code
upvoted 3 times
...
Stokvisss
8 months, 2 weeks ago
Selected Answer: A
This can be done without code using SageMaker Canvas: https://aws.amazon.com/blogs/machine-learning/predicting-customer-churn-with-no-code-machine-learning-using-amazon-sagemaker-canvas/ Hence, A is right.
upvoted 1 times
...
kyuhuck
9 months, 1 week ago
Selected Answer: A
The best solution, meeting the requirements with the least development effort and correctly addressing the problem nature, is: A. Use SageMaker Canvas to automatically clean the data and to prepare a categorical model. This option leverages the simplicity and automatic features of SageMaker Canvas, ensuring minimal development effort while accurately targeting the need for a classification model in customer churn prediction.
upvoted 1 times
...
taustin2
10 months, 3 weeks ago
Selected Answer: A
See: https://aws.amazon.com/blogs/machine-learning/predicting-customer-churn-with-no-code-machine-learning-using-amazon-sagemaker-canvas/ Canvas also does no-code data cleaning and preparation. So, least development effort is Canvas.
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...