exam questions

Exam AWS Certified Data Engineer - Associate DEA-C01 All Questions

View all questions & answers for the AWS Certified Data Engineer - Associate DEA-C01 exam

Exam AWS Certified Data Engineer - Associate DEA-C01 topic 1 question 114 discussion

A company plans to use Amazon Kinesis Data Firehose to store data in Amazon S3. The source data consists of 2 MB .csv files. The company must convert the .csv files to JSON format. The company must store the files in Apache Parquet format.

Which solution will meet these requirements with the LEAST development effort?

  • A. Use Kinesis Data Firehose to convert the .csv files to JSON. Use an AWS Lambda function to store the files in Parquet format.
  • B. Use Kinesis Data Firehose to convert the .csv files to JSON and to store the files in Parquet format.
  • C. Use Kinesis Data Firehose to invoke an AWS Lambda function that transforms the .csv files to JSON and stores the files in Parquet format.
  • D. Use Kinesis Data Firehose to invoke an AWS Lambda function that transforms the .csv files to JSON. Use Kinesis Data Firehose to store the files in Parquet format.
Show Suggested Answer Hide Answer
Suggested Answer: D 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
qwertyuio
Highly Voted 6 months, 3 weeks ago
Selected Answer: D
https://docs.aws.amazon.com/firehose/latest/dev/record-format-conversion.html
upvoted 7 times
LR2023
6 months, 3 weeks ago
why do you need lambda in the middle, per you link Amazon Data Firehose can convert the format of your input data from JSON to Apache Parquet or Apache ORC before storing the data in Amazon S3...my choice is B
upvoted 2 times
mzansikiller
5 months, 2 weeks ago
https://docs.aws.amazon.com/firehose/latest/dev/data-transformation.html https://docs.aws.amazon.com/firehose/latest/dev/record-format-conversion.html there is need is need to invoke Lambda
upvoted 2 times
Eleftheriia
3 months, 2 weeks ago
It is stated that " If you want to convert an input format other than JSON, such as comma-separated values (CSV) or structured text, you can use AWS Lambda to transform it to JSON first"
upvoted 1 times
...
...
...
...
mzansikiller
Highly Voted 6 months, 2 weeks ago
Answer D https://docs.aws.amazon.com/firehose/latest/dev/record-format-conversion.html Amazon Data Firehose can convert the format of your input data from JSON to Apache Parquet or Apache ORC before storing the data in Amazon S3. Parquet and ORC are columnar data formats that save space and enable faster queries compared to row-oriented formats like JSON. If you want to convert an input format other than JSON, such as comma-separated values (CSV) or structured text, you can use AWS Lambda to transform it to JSON first. For more information, see Transform data in Amazon Data Firehose.
upvoted 5 times
...
Salam9
Most Recent 1 week, 4 days ago
Selected Answer: B
https://aws.amazon.com/ar/about-aws/whats-new/2016/12/amazon-kinesis-firehose-can-now-prepare-and-transform-streaming-data-before-loading-it-to-data-stores/
upvoted 1 times
...
kailu
3 weeks, 6 days ago
Selected Answer: C
Lambda handles both the CSV-to-JSON and JSON-to-Parquet transformations before Firehose stores the data in Amazon S3
upvoted 1 times
...
zoneout
1 month, 1 week ago
Selected Answer: D
If you want to convert an input format other than JSON, such as comma-separated values (CSV) or structured text, you can use AWS Lambda to transform it to JSON first and then you can use Amazon Data Firehose can convert the format of your input data from JSON to Apache Parquet or Apache ORC.
upvoted 1 times
...
kailu
1 month, 2 weeks ago
Selected Answer: C
I would go with C. D is close but Kinesis Data Firehose does not really store files in Parquet format.
upvoted 1 times
...
michele_scar
2 months, 3 weeks ago
Selected Answer: D
https://docs.aws.amazon.com/firehose/latest/dev/record-format-conversion.html You need firstly a JSON (using Lambda) to be able using Kinesis to store it in Parquet
upvoted 1 times
...
rsmf
3 months, 2 weeks ago
Selected Answer: D
Firehose can't convert csv to json. So, that's D
upvoted 2 times
...
PashoQ
4 months, 3 weeks ago
Selected Answer: D
If you want to convert an input format other than JSON, such as comma-separated values (CSV) or structured text, you can use AWS Lambda to transform it to JSON first. For more information
upvoted 2 times
...
mzansikiller
5 months, 2 weeks ago
Selected Answer: D
Amazon Data Firehose can convert the format of your input data from JSON to Apache Parquet or Apache ORC before storing the data in Amazon S3. Parquet and ORC are columnar data formats that save space and enable faster queries compared to row-oriented formats like JSON. If you want to convert an input format other than JSON, such as comma-separated values (CSV) or structured text, you can use AWS Lambda to transform it to JSON first. For more information, see Transform source data in Amazon Data Firehose. Answer D
upvoted 3 times
...
Shanmahi
5 months, 4 weeks ago
Selected Answer: B
Kinesis Data Firehose: It has built-in support for data transformation and format conversion. It can directly convert incoming data from .csv to JSON format and then further convert the data to Apache Parquet format before storing it in Amazon S3. Minimal Development Effort: This option requires the least development effort because Kinesis Data Firehose handles both the transformation (from .csv to JSON) and the format conversion (to Parquet) natively. No additional AWS Lambda functions or custom code are needed.
upvoted 2 times
...
MinTheRanger
6 months, 1 week ago
Selected Answer: B
B. Why? Amazon Data Firehose can convert the format of your input data from JSON to Apache Parquet or Apache ORC before storing the data in Amazon S3. https://docs.aws.amazon.com/firehose/latest/dev/record-format-conversion.html With that LEAST development effort, why do we need to use Lambda additionally? :D
upvoted 4 times
mzansikiller
5 months, 2 weeks ago
read to understand: Amazon Data Firehose can convert the format of your input data from JSON to Apache Parquet or Apache ORC before storing the data in Amazon S3. Parquet and ORC are columnar data formats that save space and enable faster queries compared to row-oriented formats like JSON. If you want to convert an input format other than JSON, such as comma-separated values (CSV) or structured text, you can use AWS Lambda to transform it to JSON first. For more information, see Transform source data in Amazon Data Firehose.
upvoted 1 times
...
...
valuedate
6 months, 2 weeks ago
Option D - Need to convert the inout data from .csv to JSON first. Firehose can't do that without the help of a lambda function in this case. After firehose can convert to .parquet and deliver it to s3
upvoted 3 times
...
HunkyBunky
7 months, 1 week ago
Selected Answer: B
B - least development efforts
upvoted 2 times
...
Alagong
7 months, 1 week ago
Selected Answer: B
By using the built-in transformation and format conversion features of Kinesis Data Firehose, you achieve the desired result with minimal custom development, thereby meeting the requirements efficiently and cost-effectively.
upvoted 4 times
...
Bmaster
7 months, 1 week ago
D is good https://docs.aws.amazon.com/firehose/latest/dev/record-format-conversion.html
upvoted 1 times
Bmaster
7 months, 1 week ago
" If you want to convert an input format other than JSON, such as comma-separated values (CSV) or structured text, you can use AWS Lambda to transform it to JSON first. For more information"
upvoted 3 times
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago