Exam AWS Certified Data Engineer - Associate DEA-C01 topic 1 question 153 discussion

Exam question from Amazon's AWS Certified Data Engineer - Associate DEA-C01

Question #: 153
Topic #: 1

[All AWS Certified Data Engineer - Associate DEA-C01 Questions]

A retail company uses an Amazon Redshift data warehouse and an Amazon S3 bucket. The company ingests retail order data into the S3 bucket every day.

The company stores all order data at a single path within the S3 bucket. The data has more than 100 columns. The company ingests the order data from a third-party application that generates more than 30 files in CSV format every day. Each CSV file is between 50 and 70 MB in size.

The company uses Amazon Redshift Spectrum to run queries that select sets of columns. Users aggregate metrics based on daily orders. Recently, users have reported that the performance of the queries has degraded. A data engineer must resolve the performance issues for the queries.

Which combination of steps will meet this requirement with LEAST developmental effort? (Choose two.)

A. Configure the third-party application to create the files in a columnar format.
B. Develop an AWS Glue ETL job to convert the multiple daily CSV files to one file for each day.
C. Partition the order data in the S3 bucket based on order date.
D. Configure the third-party application to create the files in JSON format.
E. Load the JSON data into the Amazon Redshift table in a SUPER type column.

Show Suggested Answer

Suggested Answer: AC 🗳️

by emupsx1 at Nov. 25, 2024, 12:30 a.m.

Disclaimers:

- ExamTopics website is not related to, affiliated with, endorsed or authorized by Amazon.
- Trademarks, certification & product names are used for reference only and belong to Amazon.

Comments

Submit Cancel

Ell89

2 months ago

Selected Answer: AC

using parqueet or ORC is efficient and so will be partitioning by order date so the range of data is lower

upvoted 1 times

...

italiancloud2025

2 months, 1 week ago

Selected Answer: BC

No, porque la opción A implica modificar la aplicación de terceros para que genere archivos en formato columnar, lo cual puede ser más complejo o inviable, mientras que la opción B utiliza un job de Glue para consolidar los CSV sin tocar la fuente. La opción C sigue siendo esencial para particionar por fecha y optimizar las consultas.

upvoted 1 times

...

emupsx1

5 months ago

Selected Answer: AC

https://docs.aws.amazon.com/redshift/latest/dg/r_SUPER_type.html

upvoted 1 times

...

Exam AWS Certified Data Engineer - Associate DEA-C01 All Questions

View all questions & answers for the AWS Certified Data Engineer - Associate DEA-C01 exam

Exam AWS Certified Data Engineer - Associate DEA-C01 topic 1 question 153 discussion

Comments

Ell89

italiancloud2025

emupsx1

SY0-701