exam questions

Exam AWS Certified Data Engineer - Associate DEA-C01 All Questions

View all questions & answers for the AWS Certified Data Engineer - Associate DEA-C01 exam

Exam AWS Certified Data Engineer - Associate DEA-C01 topic 1 question 11 discussion

A data engineer needs Amazon Athena queries to finish faster. The data engineer notices that all the files the Athena queries use are currently stored in uncompressed .csv format. The data engineer also notices that users perform most queries by selecting a specific column.
Which solution will MOST speed up the Athena query performance?

  • A. Change the data format from .csv to JSON format. Apply Snappy compression.
  • B. Compress the .csv files by using Snappy compression.
  • C. Change the data format from .csv to Apache Parquet. Apply Snappy compression.
  • D. Compress the .csv files by using gzip compression.
Show Suggested Answer Hide Answer
Suggested Answer: C 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
milofficial
Highly Voted 1 year, 3 months ago
Selected Answer: C
If the exam would only have these kinds of questions everyone would be blessed
upvoted 11 times
[Removed]
1 year, 3 months ago
Hahahaha! I believe that this kind of question is only for the beta calibration purpose. They won't be in the final exam version.
upvoted 1 times
...
...
TonyStark0122
Highly Voted 1 year, 2 months ago
C. Change the data format from .csv to Apache Parquet. Apply Snappy compression. Explanation: Apache Parquet is a columnar storage format optimized for analytical queries. It is highly efficient for query performance, especially when queries involve selecting specific columns, as it allows for column pruning and predicate pushdown optimizations.
upvoted 6 times
...
Scotty_Nguyen
Most Recent 1 month ago
Selected Answer: C
C is correct
upvoted 1 times
...
GabrielSGoncalves
9 months ago
Selected Answer: C
C is the way to do It based on best practices recommended by AWS (https://aws.amazon.com/pt/blogs/big-data/top-10-performance-tuning-tips-for-amazon-athena/)
upvoted 1 times
...
hnk
11 months, 2 weeks ago
Selected Answer: C
C is correct
upvoted 1 times
...
k350Secops
11 months, 3 weeks ago
Selected Answer: C
switching to Apache Parquet format with Snappy compression offers the most significant improvement in Athena query performance, especially for queries that select specific columns
upvoted 1 times
...
d8945a1
11 months, 3 weeks ago
Selected Answer: C
Parquet is columnar storage and the question specifies that users performs most queries by selecting a specific column.
upvoted 1 times
...
wa212
1 year ago
Selected Answer: C
https://aws.amazon.com/jp/blogs/news/top-10-performance-tuning-tips-for-amazon-athena/
upvoted 2 times
...
Alcee
1 year, 2 months ago
C easy
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago