exam questions

Exam AWS Certified Machine Learning Engineer - Associate MLA-C01 All Questions

View all questions & answers for the AWS Certified Machine Learning Engineer - Associate MLA-C01 exam

Exam AWS Certified Machine Learning Engineer - Associate MLA-C01 topic 1 question 20 discussion

A company has a large, unstructured dataset. The dataset includes many duplicate records across several key attributes.
Which solution on AWS will detect duplicates in the dataset with the LEAST code development?

  • A. Use Amazon Mechanical Turk jobs to detect duplicates.
  • B. Use Amazon QuickSight ML Insights to build a custom deduplication model.
  • C. Use Amazon SageMaker Data Wrangler to pre-process and detect duplicates.
  • D. Use the AWS Glue FindMatches transform to detect duplicates.
Show Suggested Answer Hide Answer
Suggested Answer: D 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
Saransundar
Highly Voted 1 month ago
Selected Answer: D
AWS Glue FindMatches is specifically designed to identify duplicate or matching records in datasets without requiring labeled training data. It uses machine learning to find fuzzy matches and allows customization to fine-tune the matching process, making it ideal for this scenario.
upvoted 5 times
...
feelgoodfactor
Most Recent 3 weeks, 2 days ago
Selected Answer: D
The AWS Glue FindMatches transform is the most appropriate solution because it is specifically designed to detect duplicates, requires minimal development effort, and scales efficiently for large datasets.
upvoted 3 times
...
nakidal495
1 month, 1 week ago
Selected Answer: A
I'm not sure but I think this is the correct answer.
upvoted 1 times
...
GiorgioGss
1 month, 1 week ago
Selected Answer: D
https://aws.amazon.com/about-aws/whats-new/2021/11/aws-glue-findmatches-new-data-existing-dataset/ "allows you to identify duplicate or matching records in your dataset"
upvoted 4 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago