exam questions

Exam AWS Certified Data Engineer - Associate DEA-C01 All Questions

View all questions & answers for the AWS Certified Data Engineer - Associate DEA-C01 exam

Exam AWS Certified Data Engineer - Associate DEA-C01 topic 1 question 114 discussion

A company plans to use Amazon Kinesis Data Firehose to store data in Amazon S3. The source data consists of 2 MB .csv files. The company must convert the .csv files to JSON format. The company must store the files in Apache Parquet format.

Which solution will meet these requirements with the LEAST development effort?

  • A. Use Kinesis Data Firehose to convert the .csv files to JSON. Use an AWS Lambda function to store the files in Parquet format.
  • B. Use Kinesis Data Firehose to convert the .csv files to JSON and to store the files in Parquet format.
  • C. Use Kinesis Data Firehose to invoke an AWS Lambda function that transforms the .csv files to JSON and stores the files in Parquet format.
  • D. Use Kinesis Data Firehose to invoke an AWS Lambda function that transforms the .csv files to JSON. Use Kinesis Data Firehose to store the files in Parquet format.
Show Suggested Answer Hide Answer
Suggested Answer: D 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
qwertyuio
Highly Voted 8 months ago
Selected Answer: D
https://docs.aws.amazon.com/firehose/latest/dev/record-format-conversion.html
upvoted 8 times
LR2023
8 months ago
why do you need lambda in the middle, per you link Amazon Data Firehose can convert the format of your input data from JSON to Apache Parquet or Apache ORC before storing the data in Amazon S3...my choice is B
upvoted 2 times
mzansikiller
7 months ago
https://docs.aws.amazon.com/firehose/latest/dev/data-transformation.html https://docs.aws.amazon.com/firehose/latest/dev/record-format-conversion.html there is need is need to invoke Lambda
upvoted 2 times
Eleftheriia
4 months, 3 weeks ago
It is stated that " If you want to convert an input format other than JSON, such as comma-separated values (CSV) or structured text, you can use AWS Lambda to transform it to JSON first"
upvoted 2 times
...
...
...
...
mzansikiller
Highly Voted 7 months, 4 weeks ago
Answer D https://docs.aws.amazon.com/firehose/latest/dev/record-format-conversion.html Amazon Data Firehose can convert the format of your input data from JSON to Apache Parquet or Apache ORC before storing the data in Amazon S3. Parquet and ORC are columnar data formats that save space and enable faster queries compared to row-oriented formats like JSON. If you want to convert an input format other than JSON, such as comma-separated values (CSV) or structured text, you can use AWS Lambda to transform it to JSON first. For more information, see Transform data in Amazon Data Firehose.
upvoted 6 times
...
saurwt
Most Recent 1 day, 3 hours ago
Selected Answer: D
Amazon Kinesis Data Firehose does not natively support CSV to JSON conversion. However, it does support JSON to Parquet conversion. Given that, the best approach with the least development effort is: D. Use Kinesis Data Firehose to invoke an AWS Lambda function that transforms the .csv files to JSON. Use Kinesis Data Firehose to store the files in Parquet format.
upvoted 1 times
...
Ramdi1
6 days, 1 hour ago
Selected Answer: D
Kinesis Data Firehose natively supports data format conversion to Parquet, reducing development effort. AWS Lambda is needed only for the CSV to JSON conversion, as Firehose does not support direct CSV to JSON transformation. Firehose then automatically converts JSON to Parquet and stores it in S3, minimizing custom code.
upvoted 1 times
...
Salam9
1 month, 2 weeks ago
Selected Answer: B
https://aws.amazon.com/ar/about-aws/whats-new/2016/12/amazon-kinesis-firehose-can-now-prepare-and-transform-streaming-data-before-loading-it-to-data-stores/
upvoted 1 times
...
kailu
2 months, 1 week ago
Selected Answer: C
Lambda handles both the CSV-to-JSON and JSON-to-Parquet transformations before Firehose stores the data in Amazon S3
upvoted 1 times
...
zoneout
2 months, 3 weeks ago
Selected Answer: D
If you want to convert an input format other than JSON, such as comma-separated values (CSV) or structured text, you can use AWS Lambda to transform it to JSON first and then you can use Amazon Data Firehose can convert the format of your input data from JSON to Apache Parquet or Apache ORC.
upvoted 1 times
...
kailu
2 months, 3 weeks ago
Selected Answer: C
I would go with C. D is close but Kinesis Data Firehose does not really store files in Parquet format.
upvoted 1 times
...
michele_scar
4 months ago
Selected Answer: D
https://docs.aws.amazon.com/firehose/latest/dev/record-format-conversion.html You need firstly a JSON (using Lambda) to be able using Kinesis to store it in Parquet
upvoted 1 times
...
rsmf
4 months, 3 weeks ago
Selected Answer: D
Firehose can't convert csv to json. So, that's D
upvoted 2 times
...
PashoQ
6 months ago
Selected Answer: D
If you want to convert an input format other than JSON, such as comma-separated values (CSV) or structured text, you can use AWS Lambda to transform it to JSON first. For more information
upvoted 2 times
...
mzansikiller
7 months ago
Selected Answer: D
Amazon Data Firehose can convert the format of your input data from JSON to Apache Parquet or Apache ORC before storing the data in Amazon S3. Parquet and ORC are columnar data formats that save space and enable faster queries compared to row-oriented formats like JSON. If you want to convert an input format other than JSON, such as comma-separated values (CSV) or structured text, you can use AWS Lambda to transform it to JSON first. For more information, see Transform source data in Amazon Data Firehose. Answer D
upvoted 3 times
...
Shanmahi
7 months, 1 week ago
Selected Answer: B
Kinesis Data Firehose: It has built-in support for data transformation and format conversion. It can directly convert incoming data from .csv to JSON format and then further convert the data to Apache Parquet format before storing it in Amazon S3. Minimal Development Effort: This option requires the least development effort because Kinesis Data Firehose handles both the transformation (from .csv to JSON) and the format conversion (to Parquet) natively. No additional AWS Lambda functions or custom code are needed.
upvoted 2 times
...
MinTheRanger
7 months, 2 weeks ago
Selected Answer: B
B. Why? Amazon Data Firehose can convert the format of your input data from JSON to Apache Parquet or Apache ORC before storing the data in Amazon S3. https://docs.aws.amazon.com/firehose/latest/dev/record-format-conversion.html With that LEAST development effort, why do we need to use Lambda additionally? :D
upvoted 4 times
mzansikiller
7 months ago
read to understand: Amazon Data Firehose can convert the format of your input data from JSON to Apache Parquet or Apache ORC before storing the data in Amazon S3. Parquet and ORC are columnar data formats that save space and enable faster queries compared to row-oriented formats like JSON. If you want to convert an input format other than JSON, such as comma-separated values (CSV) or structured text, you can use AWS Lambda to transform it to JSON first. For more information, see Transform source data in Amazon Data Firehose.
upvoted 1 times
...
...
valuedate
7 months, 3 weeks ago
Option D - Need to convert the inout data from .csv to JSON first. Firehose can't do that without the help of a lambda function in this case. After firehose can convert to .parquet and deliver it to s3
upvoted 3 times
...
HunkyBunky
8 months, 2 weeks ago
Selected Answer: B
B - least development efforts
upvoted 2 times
...
Alagong
8 months, 2 weeks ago
Selected Answer: B
By using the built-in transformation and format conversion features of Kinesis Data Firehose, you achieve the desired result with minimal custom development, thereby meeting the requirements efficiently and cost-effectively.
upvoted 4 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago