exam questions

Exam AWS Certified Machine Learning - Specialty All Questions

View all questions & answers for the AWS Certified Machine Learning - Specialty exam

Exam AWS Certified Machine Learning - Specialty topic 1 question 133 discussion

A company is converting a large number of unstructured paper receipts into images. The company wants to create a model based on natural language processing
(NLP) to find relevant entities such as date, location, and notes, as well as some custom entities such as receipt numbers.
The company is using optical character recognition (OCR) to extract text for data labeling. However, documents are in different structures and formats, and the company is facing challenges with setting up the manual workflows for each document type. Additionally, the company trained a named entity recognition (NER) model for custom entity detection using a small sample size. This model has a very low confidence score and will require retraining with a large dataset.
Which solution for text extraction and entity detection will require the LEAST amount of effort?

  • A. Extract text from receipt images by using Amazon Textract. Use the Amazon SageMaker BlazingText algorithm to train on the text for entities and custom entities.
  • B. Extract text from receipt images by using a deep learning OCR model from the AWS Marketplace. Use the NER deep learning model to extract entities.
  • C. Extract text from receipt images by using Amazon Textract. Use Amazon Comprehend for entity detection, and use Amazon Comprehend custom entity recognition for custom entity detection.
  • D. Extract text from receipt images by using a deep learning OCR model from the AWS Marketplace. Use Amazon Comprehend for entity detection, and use Amazon Comprehend custom entity recognition for custom entity detection.
Show Suggested Answer Hide Answer
Suggested Answer: C 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
exam_prep
Highly Voted 2 years, 4 months ago
C is the correct answer. You definitely need Amazon Textract service which eliminate options B & D. Between A & C - Comprehend will quicker.
upvoted 15 times
...
vkbajoria
Most Recent 6 months, 3 weeks ago
Selected Answer: C
Textract and Comprehend will do the job
upvoted 1 times
...
james2033
7 months, 2 weeks ago
Selected Answer: C
Keywords 'Amazon Textract' and 'Amazon Comprehend'
upvoted 1 times
...
Mickey321
1 year, 1 month ago
Selected Answer: C
C indeed due to least effort
upvoted 1 times
...
kaike_reis
1 year, 2 months ago
Selected Answer: C
C is correct
upvoted 1 times
...
ADVIT
1 year, 3 months ago
I think C
upvoted 1 times
...
alp_ileri
1 year, 7 months ago
Selected Answer: C
I go for C
upvoted 2 times
...
Valcilio
1 year, 7 months ago
Selected Answer: C
C is the best answer, textract is to extract data from documents and comprehend to understand the filling, objective or origin of a file.
upvoted 1 times
...
damaldon
1 year, 8 months ago
C is correct, you can extract Entity information easily with Comprehend. https://aws.amazon.com/comprehend/features/
upvoted 4 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago