Exam AWS Certified Solutions Architect - Associate SAA-C03 All Questions

View all questions & answers for the AWS Certified Solutions Architect - Associate SAA-C03 exam

Exam AWS Certified Solutions Architect - Associate SAA-C03 topic 1 question 156 discussion

Exam question from Amazon's AWS Certified Solutions Architect - Associate SAA-C03

Question #: 156
Topic #: 1

[All AWS Certified Solutions Architect - Associate SAA-C03 Questions]

A company produces batch data that comes from different databases. The company also produces live stream data from network sensors and application APIs. The company needs to consolidate all the data into one place for business analytics. The company needs to process the incoming data and then stage the data in different Amazon S3 buckets. Teams will later run one-time queries and import the data into a business intelligence tool to show key performance indicators (KPIs).
Which combination of steps will meet these requirements with the LEAST operational overhead? (Choose two.)

A. Use Amazon Athena for one-time queries. Use Amazon QuickSight to create dashboards for KPIs.
B. Use Amazon Kinesis Data Analytics for one-time queries. Use Amazon QuickSight to create dashboards for KPIs.
C. Create custom AWS Lambda functions to move the individual records from the databases to an Amazon Redshift cluster.
D. Use an AWS Glue extract, transform, and load (ETL) job to convert the data into JSON format. Load the data into multiple Amazon OpenSearch Service (Amazon Elasticsearch Service) clusters.
E. Use blueprints in AWS Lake Formation to identify the data that can be ingested into a data lake. Use AWS Glue to crawl the source, extract the data, and load the data into Amazon S3 in Apache Parquet format.

Show Suggested Answer

Suggested Answer: AE 🗳️

by Wazhija at Oct. 18, 2022, 7:28 a.m.

Disclaimers:

- ExamTopics website is not related to, affiliated with, endorsed or authorized by Amazon.
- Trademarks, certification & product names are used for reference only and belong to Amazon.

Comments

Submit Cancel

Wazhija

Highly Voted 2 years, 6 months ago

Selected Answer: AE

I believe AE makes the most sense

upvoted 17 times

...

Six_Fingered_Jose

Highly Voted 2 years, 5 months ago

Selected Answer: AE

yeah AE makes sense, only E is working with S3 here and questions wants them to be in S3

upvoted 12 times

...

Dharmarajan

Most Recent 2 months, 1 week ago

Selected Answer: AE

A&E - least operational overhead - E over D

upvoted 1 times

...

sdelena

2 months, 2 weeks ago

Selected Answer: AE

A,E is ok

upvoted 1 times

...

PaulGa

6 months ago

Selected Answer: AD

Ans A, D - A everyone seems to agree; I choose D over E because Parquet is aimed at columnar data - and that is not specified and may restrict query type access

upvoted 2 times

...

jaradat02

8 months, 3 weeks ago

Selected Answer: AE

AE satisfies the requirements that demand that the data should be stored in s3 and a one-time analytic will run on it.

upvoted 3 times

...

lofzee

10 months, 3 weeks ago

Selected Answer: AE

C and D = too much overhead B = incorrect because Athena is used for one time queries. That leaves A and E

upvoted 3 times

...

awsgeek75

1 year, 3 months ago

Selected Answer: AE

A is a given due to Athena and QuickSight option. Between C and E, the AWS Lake Formation is a more managed solution so it should have less operational overhead that writing Custom AWS Lambda. AE should be preferred over AC.

upvoted 4 times

awsgeek75

1 year, 3 months ago

E is only confusing because of Apache Parquet format (like a grid?) what's the point of that in the context of this quesiton?

upvoted 5 times

...

Guru4Cloud

1 year, 8 months ago

Selected Answer: AE

The reasons are: AWS Lake Formation and Glue provide automated data lake creation with minimal coding. Glue crawlers identify sources and ETL jobs load to S3. Athena allows ad-hoc queries directly on S3 data with no infrastructure to manage. QuickSight provides easy cloud BI for dashboards. Options C and D require significant custom coding for ETL and queries. Redshift and OpenSearch would require additional setup and management overhead.

upvoted 8 times

...

Mia2009687

1 year, 9 months ago

Selected Answer: AE

It combines data from database and stream data, so data lake needs to be used. And it wants to do one time query, so Athena is better.

upvoted 4 times

...

TTaws

1 year, 9 months ago

@Golcha once the data comes from different sources then you use GLUE

upvoted 3 times

...

Jeeva28

1 year, 10 months ago

Selected Answer: AC

Less Overhead with option AC .No need to manage

upvoted 1 times

pentium75

1 year, 3 months ago

But C moves the data to Redshift while the question says you want it in S3 (and Athena from answer A also needs it in S3).

upvoted 4 times

...

Golcha

2 years ago

Selected Answer: AC

No specific use case for GLUE

upvoted 1 times

TTaws

1 year, 9 months ago

once the data comes from different sources then you use GLUE

upvoted 3 times

...

pentium75

1 year, 3 months ago

C moves the data to Redshift while the question says you want it in S3 (and Athena from answer A also needs it in S3).

upvoted 2 times

...

TECHNOWARRIOR

2 years ago

The Apache Parquet format is a performance-oriented, column-based data format designed for storage and retrieval. It is generally faster for reads than writes because of its columnar storage layout and a pre-computed schema that is written with the data into the files. AWS Glue’s Parquet writer offers fast write performance and flexibility to handle evolving datasets. You can use AWS Glue to read Parquet files from Amazon S3 and from streaming sources as well as write Parquet files to Amazon S3. When using AWS Glue to build a data lake foundation, it automatically crawls your Amazon S3 data, identifies data formats, and then suggests schemas for use with other AWS analytic services[1][2][3][4].

upvoted 6 times

...

TECHNOWARRIOR

2 years ago

ANSWER - AE:Amazon Athena is the best choice for running one-time queries on streaming data. Although Amazon Kinesis Data Analytics provides an easy and familiar standard SQL language to analyze streaming data in real-time, it is designed for continuous queries rather than one-time queries[1]. On the other hand, Amazon Athena is a serverless interactive query service that allows querying data in Amazon S3 using SQL. It is optimized for ad-hoc querying and is ideal for running one-time queries on streaming data[2].AWS Lake Formation uses as a central place to have all your data for analytics purposes (E). Athena integrate perfect with S3 and can makes queries (A).

upvoted 5 times

...

jcramos

2 years ago

Selected Answer: AE

AWS Lake Formation uses as a central place to have all your data for analytics purposes (E). Athena integrate perfect with S3 and can makes queries (A).

upvoted 5 times

jcramos

2 years ago

Why S3 in Apache Parquet? https://aws.amazon.com/about-aws/whats-new/2018/12/amazon-s3-announces-parquet-output-format-for-inventory/

upvoted 2 times

...

JiyuKim

2 years, 2 months ago

Can anyone please explain me why B cannot be an answer?

upvoted 6 times

Shrestwt

1 year, 12 months ago

Kinesis Data Analytics is designed for continuous queries rather than one-time queries.

upvoted 8 times

...

Load full discussion...

Exam AWS Certified Solutions Architect - Associate SAA-C03 All Questions

View all questions & answers for the AWS Certified Solutions Architect - Associate SAA-C03 exam

Exam AWS Certified Solutions Architect - Associate SAA-C03 topic 1 question 156 discussion

Comments

Wazhija

Six_Fingered_Jose

Dharmarajan

sdelena

PaulGa

jaradat02

lofzee

awsgeek75

awsgeek75

Guru4Cloud

Mia2009687

TTaws

Jeeva28

pentium75

Golcha

TTaws

pentium75

TECHNOWARRIOR

TECHNOWARRIOR

jcramos

jcramos

JiyuKim

Shrestwt

SY0-701