exam questions

Exam AWS Certified Data Engineer - Associate DEA-C01 All Questions

View all questions & answers for the AWS Certified Data Engineer - Associate DEA-C01 exam

Exam AWS Certified Data Engineer - Associate DEA-C01 topic 1 question 67 discussion

A company's data engineer needs to optimize the performance of table SQL queries. The company stores data in an Amazon Redshift cluster. The data engineer cannot increase the size of the cluster because of budget constraints.
The company stores the data in multiple tables and loads the data by using the EVEN distribution style. Some tables are hundreds of gigabytes in size. Other tables are less than 10 MB in size.
Which solution will meet these requirements?

  • A. Keep using the EVEN distribution style for all tables. Specify primary and foreign keys for all tables.
  • B. Use the ALL distribution style for large tables. Specify primary and foreign keys for all tables.
  • C. Use the ALL distribution style for rarely updated small tables. Specify primary and foreign keys for all tables.
  • D. Specify a combination of distribution, sort, and partition keys for all tables.
Show Suggested Answer Hide Answer
Suggested Answer: C 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
rralucard_
Highly Voted 1 year, 2 months ago
Selected Answer: C
Use the ALL Distribution Style for Rarely Updated Small Tables. This approach optimizes the performance of joins involving these smaller tables and is a common best practice in Redshift data warehousing. For the larger tables, maintaining the EVEN distribution style or considering a KEY-based distribution (if there are common join columns) could be more appropriate.
upvoted 8 times
...
Tester_TKK
Most Recent 1 week, 2 days ago
Selected Answer: C
D is wrong. There is no partition key in Redshift
upvoted 1 times
...
jk15997
5 months, 2 weeks ago
why not D?
upvoted 3 times
...
pypelyncar
10 months, 3 weeks ago
Selected Answer: C
For small tables (less than 10 MB in size) that are rarely updated, using the ALL distribution style can provide better query performance. With the ALL distribution style, each compute node stores a copy of the entire table, eliminating the need for data redistribution or shuffling during certain queries. This can significantly improve query performance, especially for joins and aggregations involving small tables.
upvoted 3 times
...
DevoteamAnalytix
11 months, 4 weeks ago
Selected Answer: C
"ALL distribution is appropriate only for relatively slow moving tables; that is, tables that are not updated frequently or extensively." (https://docs.aws.amazon.com/redshift/latest/dg/c_choosing_dist_sort.html)
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago