Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.

Unlimited Access

Get Unlimited Contributor Access to the all ExamTopics Exams!
Take advantage of PDF Files for 1000+ Exams along with community discussions and pass IT Certification Exams Easily.

Exam AWS Certified Solutions Architect - Associate SAA-C03 topic 1 question 156 discussion

A company produces batch data that comes from different databases. The company also produces live stream data from network sensors and application APIs. The company needs to consolidate all the data into one place for business analytics. The company needs to process the incoming data and then stage the data in different Amazon S3 buckets. Teams will later run one-time queries and import the data into a business intelligence tool to show key performance indicators (KPIs).
Which combination of steps will meet these requirements with the LEAST operational overhead? (Choose two.)

  • A. Use Amazon Athena for one-time queries. Use Amazon QuickSight to create dashboards for KPIs.
  • B. Use Amazon Kinesis Data Analytics for one-time queries. Use Amazon QuickSight to create dashboards for KPIs.
  • C. Create custom AWS Lambda functions to move the individual records from the databases to an Amazon Redshift cluster.
  • D. Use an AWS Glue extract, transform, and load (ETL) job to convert the data into JSON format. Load the data into multiple Amazon OpenSearch Service (Amazon Elasticsearch Service) clusters.
  • E. Use blueprints in AWS Lake Formation to identify the data that can be ingested into a data lake. Use AWS Glue to crawl the source, extract the data, and load the data into Amazon S3 in Apache Parquet format.
Show Suggested Answer Hide Answer
Suggested Answer: AC 🗳️

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
Wazhija
Highly Voted 1 year, 8 months ago
Selected Answer: AE
I believe AE makes the most sense
upvoted 16 times
...
Six_Fingered_Jose
Highly Voted 1 year, 8 months ago
Selected Answer: AE
yeah AE makes sense, only E is working with S3 here and questions wants them to be in S3
upvoted 11 times
...
lofzee
Most Recent 1 month, 1 week ago
Selected Answer: AE
C and D = too much overhead B = incorrect because Athena is used for one time queries. That leaves A and E
upvoted 1 times
...
awsgeek75
6 months, 1 week ago
Selected Answer: AE
A is a given due to Athena and QuickSight option. Between C and E, the AWS Lake Formation is a more managed solution so it should have less operational overhead that writing Custom AWS Lambda. AE should be preferred over AC.
upvoted 3 times
awsgeek75
5 months, 3 weeks ago
E is only confusing because of Apache Parquet format (like a grid?) what's the point of that in the context of this quesiton?
upvoted 2 times
...
...
Guru4Cloud
10 months, 3 weeks ago
Selected Answer: AE
The reasons are: AWS Lake Formation and Glue provide automated data lake creation with minimal coding. Glue crawlers identify sources and ETL jobs load to S3. Athena allows ad-hoc queries directly on S3 data with no infrastructure to manage. QuickSight provides easy cloud BI for dashboards. Options C and D require significant custom coding for ETL and queries. Redshift and OpenSearch would require additional setup and management overhead.
upvoted 7 times
...
Mia2009687
1 year ago
Selected Answer: AE
It combines data from database and stream data, so data lake needs to be used. And it wants to do one time query, so Athena is better.
upvoted 2 times
...
TTaws
1 year ago
@Golcha once the data comes from different sources then you use GLUE
upvoted 1 times
...
Jeeva28
1 year, 1 month ago
Selected Answer: AC
Less Overhead with option AC .No need to manage
upvoted 1 times
pentium75
6 months, 1 week ago
But C moves the data to Redshift while the question says you want it in S3 (and Athena from answer A also needs it in S3).
upvoted 3 times
...
...
Golcha
1 year, 2 months ago
Selected Answer: AC
No specific use case for GLUE
upvoted 1 times
TTaws
1 year ago
once the data comes from different sources then you use GLUE
upvoted 2 times
...
pentium75
6 months, 1 week ago
C moves the data to Redshift while the question says you want it in S3 (and Athena from answer A also needs it in S3).
upvoted 2 times
...
...
TECHNOWARRIOR
1 year, 2 months ago
The Apache Parquet format is a performance-oriented, column-based data format designed for storage and retrieval. It is generally faster for reads than writes because of its columnar storage layout and a pre-computed schema that is written with the data into the files. AWS Glue’s Parquet writer offers fast write performance and flexibility to handle evolving datasets. You can use AWS Glue to read Parquet files from Amazon S3 and from streaming sources as well as write Parquet files to Amazon S3. When using AWS Glue to build a data lake foundation, it automatically crawls your Amazon S3 data, identifies data formats, and then suggests schemas for use with other AWS analytic services[1][2][3][4].
upvoted 4 times
...
TECHNOWARRIOR
1 year, 2 months ago
ANSWER - AE:Amazon Athena is the best choice for running one-time queries on streaming data. Although Amazon Kinesis Data Analytics provides an easy and familiar standard SQL language to analyze streaming data in real-time, it is designed for continuous queries rather than one-time queries[1]. On the other hand, Amazon Athena is a serverless interactive query service that allows querying data in Amazon S3 using SQL. It is optimized for ad-hoc querying and is ideal for running one-time queries on streaming data[2].AWS Lake Formation uses as a central place to have all your data for analytics purposes (E). Athena integrate perfect with S3 and can makes queries (A).
upvoted 4 times
...
jcramos
1 year, 3 months ago
Selected Answer: AE
AWS Lake Formation uses as a central place to have all your data for analytics purposes (E). Athena integrate perfect with S3 and can makes queries (A).
upvoted 2 times
jcramos
1 year, 3 months ago
Why S3 in Apache Parquet? https://aws.amazon.com/about-aws/whats-new/2018/12/amazon-s3-announces-parquet-output-format-for-inventory/
upvoted 1 times
...
...
JiyuKim
1 year, 4 months ago
Can anyone please explain me why B cannot be an answer?
upvoted 6 times
Shrestwt
1 year, 2 months ago
Kinesis Data Analytics is designed for continuous queries rather than one-time queries.
upvoted 6 times
...
...
ashishvineetlko
1 year, 5 months ago
can anyone help me in below question 36. A company has a Java application that uses Amazon Simple Queue Service (Amazon SOS) to parse messages. The application cannot parse messages that are large on 256KB size. The company wants to implement a solution to give the application the ability to parse messages as large as 50 MB. Which solution will meet these requirements with the FEWEST changes to the code? a) Use the Amazon SOS Extended Client Library for Java to host messages that are larger than 256 KB in Amazon S3. b) Use Amazon EventBridge to post large messages from the application instead of Aaron SOS c) Change the limit in Amazon SQS to handle messages that are larger than 256 KB d) Store messages that are larger than 256 KB in Amazon Elastic File System (Amazon EFS) Configure Amazon SQS to reference this location in the messages.
upvoted 1 times
skondey
1 year, 4 months ago
I will do "A" as well.
upvoted 1 times
...
ProfXsamson
1 year, 5 months ago
A would probably be the best answer. Sqs extended client library is for Java apps.
upvoted 1 times
...
...
bullrem
1 year, 5 months ago
Selected Answer: DE
I believe DE makes the most sense
upvoted 1 times
pentium75
6 months, 1 week ago
Why use OpenSearch service?
upvoted 2 times
...
...
ShinobiGrappler
1 year, 5 months ago
Selected Answer: AE
stored in s3 -> data lake -> athena (process the SQL parquet format)-> quicksight visualize
upvoted 5 times
...
Zerotn3
1 year, 6 months ago
Selected Answer: BE
While Amazon Athena is a fully managed service that makes it easy to analyze data stored in Amazon S3 using SQL, it is primarily designed for running ad-hoc queries on data stored in Amazon S3. It may not be the best choice for running one-time queries on streaming data, as it is not designed to process data in real-time. Additionally, using Amazon Athena for one-time queries on streaming data could potentially lead to higher operational overhead, as you would need to set up and maintain the necessary infrastructure to stream the data into Amazon S3, and then query the data using Athena. Using Amazon Kinesis Data Analytics, as mentioned in option B, would be a better choice for running one-time queries on streaming data, as it is specifically designed to process data in real-time and can automatically scale to match the incoming data rate.
upvoted 2 times
JayBee65
1 year, 6 months ago
"Company needs to consolidate all the data into one place" -> S3 bucket, which is happening in E, which means Athena would not have an issue, so A is ok.
upvoted 4 times
jainparag1
1 year, 5 months ago
Absolutely, querying data is after staging and so Athena fits perfectly.
upvoted 1 times
...
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
ex Want to SAVE BIG on Certification Exam Prep?
close
ex Unlock All Exams with ExamTopics Pro 75% Off
  • arrow Choose From 1000+ Exams
  • arrow Access to 10 Exams per Month
  • arrow PDF Format Available
  • arrow Inline Discussions
  • arrow No Captcha/Robot Checks
Limited Time Offer
Ends in