exam questions

Exam AWS Certified Data Engineer - Associate DEA-C01 All Questions

View all questions & answers for the AWS Certified Data Engineer - Associate DEA-C01 exam

Exam AWS Certified Data Engineer - Associate DEA-C01 topic 1 question 60 discussion

An airline company is collecting metrics about flight activities for analytics. The company is conducting a proof of concept (POC) test to show how analytics can provide insights that the company can use to increase on-time departures.
The POC test uses objects in Amazon S3 that contain the metrics in .csv format. The POC test uses Amazon Athena to query the data. The data is partitioned in the S3 bucket by date.
As the amount of data increases, the company wants to optimize the storage solution to improve query performance.
Which combination of solutions will meet these requirements? (Choose two.)

  • A. Add a randomized string to the beginning of the keys in Amazon S3 to get more throughput across partitions.
  • B. Use an S3 bucket that is in the same account that uses Athena to query the data.
  • C. Use an S3 bucket that is in the same AWS Region where the company runs Athena queries.
  • D. Preprocess the .csv data to JSON format by fetching only the document keys that the query requires.
  • E. Preprocess the .csv data to Apache Parquet format by fetching only the data blocks that are needed for predicates.
Show Suggested Answer Hide Answer
Suggested Answer: CE 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
rralucard_
Highly Voted 8 months, 4 weeks ago
Selected Answer: CE
https://docs.aws.amazon.com/athena/latest/ug/performance-tuning.html
upvoted 6 times
...
Ramdi1
Most Recent 1 month, 2 weeks ago
Selected Answer: CE
C - Reduces latency and network costs → When Athena queries S3 data in the same AWS Region, data does not cross AWS Regions, improving performance. Lower query execution time → No inter-region data transfer delays. Cost-Effective → AWS charges for cross-region data transfers, but querying within the same region avoids these costs. E - Parquet is a columnar storage format → Queries can fetch only needed columns, reducing scanning costs.
upvoted 1 times
...
tgv
5 months ago
Selected Answer: CE
I will go with C and E.
upvoted 1 times
...
matasejem
6 months, 2 weeks ago
C is not mentioned anywhere in the https://docs.aws.amazon.com/athena/latest/ug/performance-tuning.html
upvoted 1 times
...
damaldon
7 months, 3 weeks ago
Answer C and E
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago