exam questions

Exam AWS Certified Data Engineer - Associate DEA-C01 All Questions

View all questions & answers for the AWS Certified Data Engineer - Associate DEA-C01 exam

Exam AWS Certified Data Engineer - Associate DEA-C01 topic 1 question 66 discussion

A financial company wants to use Amazon Athena to run on-demand SQL queries on a petabyte-scale dataset to support a business intelligence (BI) application. An AWS Glue job that runs during non-business hours updates the dataset once every day. The BI application has a standard data refresh frequency of 1 hour to comply with company policies.
A data engineer wants to cost optimize the company's use of Amazon Athena without adding any additional infrastructure costs.
Which solution will meet these requirements with the LEAST operational overhead?

  • A. Configure an Amazon S3 Lifecycle policy to move data to the S3 Glacier Deep Archive storage class after 1 day.
  • B. Use the query result reuse feature of Amazon Athena for the SQL queries.
  • C. Add an Amazon ElastiCache cluster between the BI application and Athena.
  • D. Change the format of the files that are in the dataset to Apache Parquet.
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
rralucard_
Highly Voted 1 year, 2 months ago
Selected Answer: B
https://docs.aws.amazon.com/athena/latest/ug/performance-tuning.html Use the Query Result Reuse Feature of Amazon Athena. This leverages Athena's built-in feature to reduce redundant data scans and thus lowers query costs.
upvoted 6 times
DevoteamAnalytix
11 months, 3 weeks ago
Yes, seems to be B: https://aws.amazon.com/de/blogs/big-data/reduce-cost-and-improve-query-performance-with-amazon-athena-query-result-reuse/
upvoted 1 times
...
...
Ell89
Most Recent 1 month, 4 weeks ago
Selected Answer: D
D query result reuse will benefit the same queries that are being re-run, it wont benefit new queries. parquet will benefit all queries.
upvoted 1 times
...
rsmf
6 months ago
Selected Answer: B
Why not D? The question specifies the option with the least overhead, and it clearly states that the Glue job runs once a day. Since the data for that day will not change, there’s no need for additional overhead.
upvoted 1 times
...
MinTheRanger
8 months, 3 weeks ago
D. Because "query reuse feature" is reliable only when it's identical but here hourly refresh might be on data related to that hour.
upvoted 1 times
...
MinTheRanger
8 months, 3 weeks ago
Why not D?
upvoted 3 times
...
Ousseyni
1 year ago
Selected Answer: B
B. Use the query result reuse feature of Amazon Athena for the SQL queries.
upvoted 2 times
...
FuriouZ
1 year ago
Selected Answer: B
It's B: Glacier adds more retrieval time and the other options cost some money
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago