exam questions

Exam AWS Certified Machine Learning Engineer - Associate MLA-C01 All Questions

View all questions & answers for the AWS Certified Machine Learning Engineer - Associate MLA-C01 exam

Exam AWS Certified Machine Learning Engineer - Associate MLA-C01 topic 1 question 54 discussion

A company uses Amazon SageMaker for its ML workloads. The company's ML engineer receives a 50 MB Apache Parquet data file to build a fraud detection model. The file includes several correlated columns that are not required.
What should the ML engineer do to drop the unnecessary columns in the file with the LEAST effort?

  • A. Download the file to a local workstation. Perform one-hot encoding by using a custom Python script.
  • B. Create an Apache Spark job that uses a custom processing script on Amazon EMR.
  • C. Create a SageMaker processing job by calling the SageMaker Python SDK.
  • D. Create a data flow in SageMaker Data Wrangler. Configure a transform step.
Show Suggested Answer Hide Answer
Suggested Answer: D 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
Saransundar
1 month, 2 weeks ago
Selected Answer: D
Parquet data file → SageMaker Data Wrangler → Explore data → Transform → Drop unnecessary columns → Clean and preprocess data → Export to S3 → Fraud detection model
upvoted 1 times
...
GiorgioGss
1 month, 3 weeks ago
Selected Answer: D
https://docs.aws.amazon.com/sagemaker/latest/dg/data-wrangler-transform.html
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago