Exam AWS Certified Solutions Architect - Associate SAA-C03 topic 1 question 788 discussion

Exam question from Amazon's AWS Certified Solutions Architect - Associate SAA-C03

Question #: 788
Topic #: 1

[All AWS Certified Solutions Architect - Associate SAA-C03 Questions]

A company has stored 10 TB of log files in Apache Parquet format in an Amazon S3 bucket. The company occasionally needs to use SQL to analyze the log files.

Which solution will meet these requirements MOST cost-effectively?

A. Create an Amazon Aurora MySQL database. Migrate the data from the S3 bucket into Aurora by using AWS Database Migration Service (AWS DMS). Issue SQL statements to the Aurora database.
B. Create an Amazon Redshift cluster. Use Redshift Spectrum to run SQL statements directly on the data in the S3 bucket.
C. Create an AWS Glue crawler to store and retrieve table metadata from the S3 bucket. Use Amazon Athena to run SQL statements directly on the data in the S3 bucket.
D. Create an Amazon EMR cluster. Use Apache Spark SQL to run SQL statements directly on the data in the S3 bucket.

Show Suggested Answer

Suggested Answer: C 🗳️

by Andy_09 at Feb. 6, 2024, 8:11 a.m.

Disclaimers:

- ExamTopics website is not related to, affiliated with, endorsed or authorized by Amazon.
- Trademarks, certification & product names are used for reference only and belong to Amazon.

Comments

Submit Cancel

LeonSauveterre

6 months, 1 week ago

Selected Answer: C

A - Aurora is cool but migrating 10 TB of data incurs significant costs and operational overhead. B - Redshift Spectrum allows querying data directly in S3 without loading it into Redshift, but costs are really high especially for infrequent use. C - Athena is serverless and charges only for the data scanned by queries. Glue Crawler automatically extracts metadata and schema information from the Parquet files. No need to migrate anything. D - Just by the look of it I know I'll go bankrupt if I choose that.

upvoted 1 times

...