Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.
exam questions

Exam AWS Certified Solutions Architect - Associate SAA-C03 All Questions

View all questions & answers for the AWS Certified Solutions Architect - Associate SAA-C03 exam

Exam AWS Certified Solutions Architect - Associate SAA-C03 topic 1 question 65 discussion

A hospital recently deployed a RESTful API with Amazon API Gateway and AWS Lambda. The hospital uses API Gateway and Lambda to upload reports that are in PDF format and JPEG format. The hospital needs to modify the Lambda code to identify protected health information (PHI) in the reports.
Which solution will meet these requirements with the LEAST operational overhead?

  • A. Use existing Python libraries to extract the text from the reports and to identify the PHI from the extracted text.
  • B. Use Amazon Textract to extract the text from the reports. Use Amazon SageMaker to identify the PHI from the extracted text.
  • C. Use Amazon Textract to extract the text from the reports. Use Amazon Comprehend Medical to identify the PHI from the extracted text.
  • D. Use Amazon Rekognition to extract the text from the reports. Use Amazon Comprehend Medical to identify the PHI from the extracted text.
Show Suggested Answer Hide Answer
Suggested Answer: C 🗳️

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
Buruguduystunstugudunstuy
Highly Voted 1 year, 11 months ago
Selected Answer: C
The correct solution is C: Use Amazon Textract to extract the text from the reports. Use Amazon Comprehend Medical to identify the PHI from the extracted text. Option C: Using Amazon Textract to extract the text from the reports, and Amazon Comprehend Medical to identify the PHI from the extracted text, would be the most efficient solution as it would involve the least operational overhead. Textract is specifically designed for extracting text from documents, and Comprehend Medical is a fully managed service that can accurately identify PHI in medical text. This solution would require minimal maintenance and would not incur any additional costs beyond the usage fees for Textract and Comprehend Medical.
upvoted 19 times
Buruguduystunstugudunstuy
1 year, 11 months ago
Option A: Using existing Python libraries to extract the text and identify the PHI from the text would require the hospital to maintain and update the libraries as needed. This would involve operational overhead in terms of keeping the libraries up to date and debugging any issues that may arise. Option B: Using Amazon SageMaker to identify the PHI from the extracted text would involve additional operational overhead in terms of setting up and maintaining a SageMaker model, as well as potentially incurring additional costs for using SageMaker. Option D: Using Amazon Rekognition to extract the text from the reports would not be an effective solution, as Rekognition is primarily designed for image recognition and would not be able to accurately extract text from PDF or JPEG files.
upvoted 8 times
...
...
PaulGa
Most Recent 2 months, 1 week ago
Selected Answer: C
Ans C - Textract to 'read' data; Comprehend to assess whether its PHI
upvoted 2 times
...
awsgeek75
10 months, 1 week ago
Selected Answer: C
Textract = Extract text from PDF/iamges Comprehend Medical = PHI ABD are wrong products for this requirement so won't achieve the results
upvoted 3 times
...
djgodzilla
11 months, 1 week ago
Selected Answer: C
Both Rekognition and Textract possess the ability to detect text within images, yet they are optimized for differing applications. Rekognition specializes in identifying text located spatially within an image, for instance, words displayed on street signs, t-shirts, or license plates. Its typical use cases encompass visual search, content filtering, deriving insights from content, among others. However, it's not the ideal choice for images containing more than 100 words, as this exceeds its limitation. On the other hand, Textract is tailored more towards processing documents and PDFs, offering a comprehensive suite for Optical Character Recognition (OCR). It proves useful in scenarios involving financial reports, medical records, receipts, ID documents, and more.
upvoted 4 times
...
Ruffyit
1 year ago
The correct solution is C: Use Amazon Textract to extract the text from the reports. Use Amazon Comprehend Medical to identify the PHI from the extracted text. Option C: Using Amazon Textract to extract the text from the reports, and Amazon Comprehend Medical to identify the PHI from the extracted text, would be the most efficient solution as it would involve the least operational overhead. Textract is specifically designed for extracting text from documents, and Comprehend Medical is a fully managed service that can accurately identify PHI in medical text. This solution would require minimal maintenance and would not incur any additional costs beyond the usage fees for Textract and Comprehend Medical.
upvoted 2 times
...
AWSStudyBuddy
1 year, 1 month ago
Selected Answer: C
• Amazon Textract: This program is made to extract text and data from scanned documents, such as pictures and PDFs. It helps to retain the formatting of the report by automatically extracting text while preserving the document's layout. Identifying and extracting medical information, including protected health information (PHI), from unstructured text is the specialty of Amazon Comprehend Medical. Medical entities that are frequently included in reporting on healthcare, such as ailments, drugs, and more, can be recognized by it.
upvoted 2 times
...
Chiquitabandita
1 year, 2 months ago
with the choices here, I would go with C, but if offered, I would use amazon textract for the text and use Macie to do the scanning of text files, not comprehend.
upvoted 2 times
...
Guru4Cloud
1 year, 3 months ago
Selected Answer: C
Here's why: Amazon Textract has built-in support to extract text from PDFs and images, eliminating the need to build this yourself with Python libraries. Amazon Comprehend Medical has pre-trained machine learning models to identify PHI entities out-of-the-box, avoiding the need to train your own SageMaker model. Using these fully managed AWS services minimizes operational overhead of maintaining machine learning models yourself.
upvoted 1 times
...
miki111
1 year, 4 months ago
Option C is the right answer.
upvoted 2 times
...
cookieMr
1 year, 5 months ago
Selected Answer: C
C leverages capabilities of Textract, which is a service that automatically extracts text and data from documents, including PDF and JPEG. By using Textract, hospital can extract text content from reports without need for additional custom code or libraries. Once text is extracted, hospital can then use Comprehend Medical, a natural language processing service specifically designed for medical text, to analyze and identify PHI. It can recognize medical entities such as medical conditions, treatments, and patient information. A. suggests using existing Python libraries, which would require hospital to develop and maintain custom code for text extraction and PHI identification. B and D involve using Textract along with SageMaker or Rekognition, respectively, for PHI identification. While these options could work, they introduce additional complexity by incorporating machine learning models and training.
upvoted 2 times
...
channn
1 year, 7 months ago
Key word: hospital!
upvoted 1 times
...
alexiscloud
1 year, 7 months ago
Answer C:
upvoted 1 times
...
Chirantan
1 year, 11 months ago
Selected Answer: C Amazon Textract is a machine learning (ML) service that automatically extracts text, handwriting, and data from scanned documents.
upvoted 3 times
...
career360guru
1 year, 11 months ago
Selected Answer: C
Option C
upvoted 1 times
...
SONA_M_
1 year, 11 months ago
WHY OPTION D IS WRONG
upvoted 1 times
s_fun
1 year, 10 months ago
D is wrong only because Amazon Rekognition doesn't read text, only explicit image contents.
upvoted 3 times
...
mj61
1 year, 10 months ago
B/C you use TextTract to extract text not Rekognition.
upvoted 1 times
...
...
k1kavi1
1 year, 11 months ago
Selected Answer: C
Agreed
upvoted 1 times
...
Rameez1
1 year, 11 months ago
C is correct Textract- for extracting the text and Comprehend to identify the medical info https://aws.amazon.com/comprehend/medical/
upvoted 3 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...