An operations team notices that a few AWS Glue jobs for a given ETL application are failing. The AWS Glue jobs read a large number of small JSON files from an
Amazon S3 bucket and write the data to a different S3 bucket in Apache Parquet format with no major transformations. Upon initial investigation, a data engineer notices the following error message in the History tab on the AWS Glue console: `Command Failed with Exit Code 1.`
Upon further investigation, the data engineer notices that the driver memory profile of the failed jobs crosses the safe threshold of 50% usage quickly and reaches
90`"95% soon after. The average memory usage across all executors continues to be less than 4%.
The data engineer also notices the following error while examining the related Amazon CloudWatch Logs.
What should the data engineer do to solve the failure in the MOST cost-effective way?
jyrajan69
Highly Voted 3 years, 6 months agolakeswimmer
3 years, 4 months agocloudlearnerhere
Highly Voted 2 years, 5 months agocloudlearnerhere
2 years, 5 months agocloudlearnerhere
2 years, 5 months agoMLCL
Most Recent 1 year, 8 months agopk349
1 year, 11 months agoMang2000
2 years, 2 months ago[Removed]
2 years, 4 months agohe11ow0rId
2 years, 7 months agorocky48
2 years, 9 months agosamsanta2012
2 years, 10 months agoCloudTimes
2 years, 10 months agoBik000
2 years, 11 months agoBik000
2 years, 11 months agoMWL
2 years, 11 months agojrheen
2 years, 11 months agoyusnardo
3 years, 1 month agoRSSRAO
3 years, 2 months agolakeswimmer
3 years, 4 months ago