exam questions

Exam AWS Certified Data Engineer - Associate DEA-C01 All Questions

View all questions & answers for the AWS Certified Data Engineer - Associate DEA-C01 exam

Exam AWS Certified Data Engineer - Associate DEA-C01 topic 1 question 22 discussion

A company maintains multiple extract, transform, and load (ETL) workflows that ingest data from the company's operational databases into an Amazon S3 based data lake. The ETL workflows use AWS Glue and Amazon EMR to process data.
The company wants to improve the existing architecture to provide automated orchestration and to require minimal manual effort.
Which solution will meet these requirements with the LEAST operational overhead?

  • A. AWS Glue workflows
  • B. AWS Step Functions tasks
  • C. AWS Lambda functions
  • D. Amazon Managed Workflows for Apache Airflow (Amazon MWAA) workflows
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
valuedate
Highly Voted 10 months, 2 weeks ago
Selected Answer: B
Glue Workflow only orchestrate crawlers and glue jobs
upvoted 15 times
...
DevoteamAnalytix
Highly Voted 11 months ago
Selected Answer: B
For me it's B because I did not found a possibility how Glue can trigger/orchestrate EMR processes OOTB. But with StepFunction there is a way: https://aws.amazon.com/blogs/big-data/orchestrate-amazon-emr-serverless-jobs-with-aws-step-functions/
upvoted 7 times
...
Rpathak4
Most Recent 1 week, 5 days ago
Selected Answer: A
Why Not the Other Options? B. AWS Step Functions More flexible but requires manual setup of states and transitions for Glue & EMR. Higher operational overhead than Glue Workflows. C. AWS Lambda Lambda is not ideal for long-running ETL workflows. Best suited for lightweight data transformations or event-driven tasks. D. Amazon MWAA (Apache Airflow) More control but requires cluster management and custom DAGs. Higher maintenance than Glue Workflows.
upvoted 1 times
...
Palee
2 weeks, 2 days ago
Selected Answer: B
The company wants to improve the existing architecture so A cannot be the right choice
upvoted 1 times
...
plutonash
2 months, 3 weeks ago
Selected Answer: B
it is interesting to choose A for minimum effort but only step functions can trigger the work both on EMR and on GLUE jobs
upvoted 1 times
...
ttpro1995
3 months, 1 week ago
Selected Answer: B
We have both Glue job and EMR job, so we need Step Functions to connect those. Airflow can do it, but required more dev work.
upvoted 2 times
...
Adrifersilva
6 months ago
Selected Answer: A
glue workflows is part of the glue ecosystem so its provides seamless integration with minimal changes
upvoted 1 times
...
Shatheesh
6 months ago
Answer A, Glue workflows
upvoted 1 times
...
Shanmahi
7 months, 1 week ago
Selected Answer: A
Glue workflows are managed services and best for considering least operational overhead.
upvoted 1 times
...
V0811
8 months ago
Selected Answer: A
AWS Glue Workflows are specifically designed for orchestrating ETL jobs in AWS Glue. They allow you to define and manage complex workflows that include multiple jobs and triggers, all within the AWS Glue environment.Integration: AWS Glue workflows seamlessly integrate with other AWS Glue components, making it easier to manage ETL processes without the need for external orchestration tools.Minimal Operational Overhead: Since AWS Glue is a fully managed service, using Glue workflows will reduce the operational overhead compared to managing separate orchestrators or building custom solutions.While D. Amazon Managed Workflows for Apache Airflow (Amazon MWAA) is also a good choice for more complex orchestration, it may involve more management overhead compared to the more straightforward AWS Glue workflows. Thus, AWS Glue workflows provide the least operational overhead given the context of this scenario.
upvoted 1 times
...
HunkyBunky
9 months ago
Selected Answer: B
B - because AWS Glue can't trigger EMR
upvoted 1 times
...
FunkyFresco
10 months, 1 week ago
Selected Answer: B
EMR in workflows , i dont think so
upvoted 3 times
...
VerRi
10 months, 2 weeks ago
Selected Answer: B
There is no way for Glue Workflow to trigger EMR
upvoted 4 times
...
acoshi
11 months, 1 week ago
Selected Answer: A
https://aws.amazon.com/blogs/big-data/orchestrate-an-etl-pipeline-using-aws-glue-workflows-triggers-and-crawlers-with-custom-classifiers/
upvoted 2 times
...
lucas_rfsb
1 year ago
Selected Answer: A
Since it seems to me that this pipeline is complex, with multiple workflows, I would go for Glue workflows.
upvoted 6 times
...
jasango
1 year ago
Yo me voy por la D) Amazon MWAA porque Glue Workflows solo admite Jobs de Glue y Step Function puede fucionar pero no son workflows de datos. Amazon MWAA son workflows de datos y esta integrado tanto con Glue como EMR: https://aws.amazon.com/blogs/big-data/simplify-aws-glue-job-orchestration-and-monitoring-with-amazon-mwaa/
upvoted 3 times
...
certplan
1 year ago
Here's an example of how you can use AWS Glue to initiate an EMR (Elastic MapReduce) job: Let's assume you have an AWS Glue job that performs ETL tasks on data stored in Amazon S3. You want to leverage EMR for a specific task within this job, such as running a complex Spark job. 1. Define a Glue Job: Create an AWS Glue job using the AWS Glue console, SDK, or CLI. Define the input and output data sources, as well as the transformations you want to apply. 2. Incorporate EMR Step: Within the Glue job script, include a section where you define an EMR step. An EMR step is a unit of work that performs a specific task on an EMR cluster. Code follows in the next entry...
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago