Exam AWS Certified Machine Learning Engineer - Associate MLA-C01 topic 1 question 12 discussion

Exam question from Amazon's AWS Certified Machine Learning Engineer - Associate MLA-C01

Question #: 12
Topic #: 1

[All AWS Certified Machine Learning Engineer - Associate MLA-C01 Questions]

Case study -
An ML engineer is developing a fraud detection model on AWS. The training dataset includes transaction logs, customer profiles, and tables from an on-premises MySQL database. The transaction logs and customer profiles are stored in Amazon S3.
The dataset has a class imbalance that affects the learning of the model's algorithm. Additionally, many of the features have interdependencies. The algorithm is not capturing all the desired underlying patterns in the data.
The training dataset includes categorical data and numerical data. The ML engineer must prepare the training dataset to maximize the accuracy of the model.
Which action will meet this requirement with the LEAST operational overhead?

A. Use AWS Glue to transform the categorical data into numerical data.
B. Use AWS Glue to transform the numerical data into categorical data.
C. Use Amazon SageMaker Data Wrangler to transform the categorical data into numerical data.
D. Use Amazon SageMaker Data Wrangler to transform the numerical data into categorical data.

Show Suggested Answer

Suggested Answer: C 🗳️

by GiorgioGss at Nov. 27, 2024, 3:41 p.m.

Disclaimers:

- ExamTopics website is not related to, affiliated with, endorsed or authorized by Amazon.
- Trademarks, certification & product names are used for reference only and belong to Amazon.

Comments

Submit Cancel

ninomfr64

3 months, 3 weeks ago

Selected Answer: C

You need to transform category to numeric as ML model works with numbers, thus it is either A or C. Data Wrangler provides a builtin transformation to encode categorical data - https://docs.aws.amazon.com/sagemaker/latest/dg/data-wrangler-transform.html#data-wrangler-transform-cat-encode while Glue doesn't provide a managed transformation for encoding data - https://docs.aws.amazon.com/glue/latest/dg/edit-jobs-transforms.html

upvoted 1 times

...

Pofmagic

3 months, 4 weeks ago

Selected Answer: C

Data Wrangler can be used for encoding categorical data, i.e. the process of creating a numerical representation for categories. Categorical encoding encodes categorical data that is in string format into arrays of integers. Data Wrangler supports ordinal and a one-hot encoding, also similarity encoding (more advanced). https://docs.aws.amazon.com/sagemaker/latest/dg/data-wrangler-transform.html#data-wrangler-transform-cat-encode AWS Glue also has Data science recipe steps for One Hot Encoding and Categorical Mapping. https://docs.aws.amazon.com/databrew/latest/dg/recipe-actions.data-science.html However Data Wrangler is more user-friendly with visual and natural language interfaces for less operational overhead

upvoted 1 times

...

GiorgioGss

4 months, 4 weeks ago

Selected Answer: C

https://docs.aws.amazon.com/sagemaker/latest/dg/data-wrangler-transform.html

upvoted 3 times

...

Exam AWS Certified Machine Learning Engineer - Associate MLA-C01 All Questions

View all questions & answers for the AWS Certified Machine Learning Engineer - Associate MLA-C01 exam

Exam AWS Certified Machine Learning Engineer - Associate MLA-C01 topic 1 question 12 discussion

Comments

ninomfr64

Pofmagic

GiorgioGss

SY0-701