Exam AWS Certified Solutions Architect - Associate SAA-C03 All Questions

View all questions & answers for the AWS Certified Solutions Architect - Associate SAA-C03 exam

Exam AWS Certified Solutions Architect - Associate SAA-C03 topic 1 question 65 discussion

Exam question from Amazon's AWS Certified Solutions Architect - Associate SAA-C03

Question #: 65
Topic #: 1

[All AWS Certified Solutions Architect - Associate SAA-C03 Questions]

A hospital recently deployed a RESTful API with Amazon API Gateway and AWS Lambda. The hospital uses API Gateway and Lambda to upload reports that are in PDF format and JPEG format. The hospital needs to modify the Lambda code to identify protected health information (PHI) in the reports.
Which solution will meet these requirements with the LEAST operational overhead?

A. Use existing Python libraries to extract the text from the reports and to identify the PHI from the extracted text.
B. Use Amazon Textract to extract the text from the reports. Use Amazon SageMaker to identify the PHI from the extracted text.
C. Use Amazon Textract to extract the text from the reports. Use Amazon Comprehend Medical to identify the PHI from the extracted text.
D. Use Amazon Rekognition to extract the text from the reports. Use Amazon Comprehend Medical to identify the PHI from the extracted text.

Show Suggested Answer

Suggested Answer: C 🗳️

by tubtab at Oct. 13, 2022, 10:14 a.m.

Disclaimers:

- ExamTopics website is not related to, affiliated with, endorsed or authorized by Amazon.
- Trademarks, certification & product names are used for reference only and belong to Amazon.

Comments

Submit Cancel

Buruguduystunstugudunstuy

Highly Voted 2 years, 7 months ago

Selected Answer: C

The correct solution is C: Use Amazon Textract to extract the text from the reports. Use Amazon Comprehend Medical to identify the PHI from the extracted text. Option C: Using Amazon Textract to extract the text from the reports, and Amazon Comprehend Medical to identify the PHI from the extracted text, would be the most efficient solution as it would involve the least operational overhead. Textract is specifically designed for extracting text from documents, and Comprehend Medical is a fully managed service that can accurately identify PHI in medical text. This solution would require minimal maintenance and would not incur any additional costs beyond the usage fees for Textract and Comprehend Medical.

upvoted 21 times

Buruguduystunstugudunstuy

2 years, 7 months ago

Option A: Using existing Python libraries to extract the text and identify the PHI from the text would require the hospital to maintain and update the libraries as needed. This would involve operational overhead in terms of keeping the libraries up to date and debugging any issues that may arise. Option B: Using Amazon SageMaker to identify the PHI from the extracted text would involve additional operational overhead in terms of setting up and maintaining a SageMaker model, as well as potentially incurring additional costs for using SageMaker. Option D: Using Amazon Rekognition to extract the text from the reports would not be an effective solution, as Rekognition is primarily designed for image recognition and would not be able to accurately extract text from PDF or JPEG files.

upvoted 8 times

...

Clouddon

Most Recent 1 week, 4 days ago

Selected Answer: C

Use Textract to extract text from medical reports (PDFs, JPEGs) and Comprehend Medical to detect PHI — with minimal code and maximum accuracy.

upvoted 1 times

...

satyaammm

6 months, 2 weeks ago

Selected Answer: C

Textract is more suitable than Rekognition as it is build to scan text documents and Comprehend helps identify PHI.

upvoted 1 times

...

PaulGa

10 months, 1 week ago

Selected Answer: C

Ans C - Textract to 'read' data; Comprehend to assess whether its PHI

upvoted 2 times

...

awsgeek75

1 year, 6 months ago

Selected Answer: C

Textract = Extract text from PDF/iamges Comprehend Medical = PHI ABD are wrong products for this requirement so won't achieve the results

upvoted 3 times

...

djgodzilla

1 year, 7 months ago

Selected Answer: C

Both Rekognition and Textract possess the ability to detect text within images, yet they are optimized for differing applications. Rekognition specializes in identifying text located spatially within an image, for instance, words displayed on street signs, t-shirts, or license plates. Its typical use cases encompass visual search, content filtering, deriving insights from content, among others. However, it's not the ideal choice for images containing more than 100 words, as this exceeds its limitation. On the other hand, Textract is tailored more towards processing documents and PDFs, offering a comprehensive suite for Optical Character Recognition (OCR). It proves useful in scenarios involving financial reports, medical records, receipts, ID documents, and more.

upvoted 4 times

...

Ruffyit

1 year, 8 months ago

upvoted 2 times

...

AWSStudyBuddy

1 year, 9 months ago

Selected Answer: C

• Amazon Textract: This program is made to extract text and data from scanned documents, such as pictures and PDFs. It helps to retain the formatting of the report by automatically extracting text while preserving the document's layout. Identifying and extracting medical information, including protected health information (PHI), from unstructured text is the specialty of Amazon Comprehend Medical. Medical entities that are frequently included in reporting on healthcare, such as ailments, drugs, and more, can be recognized by it.

upvoted 2 times

...

Chiquitabandita

1 year, 10 months ago

with the choices here, I would go with C, but if offered, I would use amazon textract for the text and use Macie to do the scanning of text files, not comprehend.

upvoted 2 times

...

Guru4Cloud

1 year, 11 months ago

Selected Answer: C

Here's why: Amazon Textract has built-in support to extract text from PDFs and images, eliminating the need to build this yourself with Python libraries. Amazon Comprehend Medical has pre-trained machine learning models to identify PHI entities out-of-the-box, avoiding the need to train your own SageMaker model. Using these fully managed AWS services minimizes operational overhead of maintaining machine learning models yourself.

upvoted 1 times

...

miki111

1 year, 12 months ago

Option C is the right answer.

upvoted 2 times

...

cookieMr

2 years ago

Selected Answer: C

C leverages capabilities of Textract, which is a service that automatically extracts text and data from documents, including PDF and JPEG. By using Textract, hospital can extract text content from reports without need for additional custom code or libraries. Once text is extracted, hospital can then use Comprehend Medical, a natural language processing service specifically designed for medical text, to analyze and identify PHI. It can recognize medical entities such as medical conditions, treatments, and patient information. A. suggests using existing Python libraries, which would require hospital to develop and maintain custom code for text extraction and PHI identification. B and D involve using Textract along with SageMaker or Rekognition, respectively, for PHI identification. While these options could work, they introduce additional complexity by incorporating machine learning models and training.

upvoted 2 times

...