DEA-C01 Data Operations and Support • Set 12
DEA-C01 Data Operations and Support Practice Test 12 — 15 questions with explanations. Free, no signup.
A data engineer is running an AWS Glue ETL job that converts CSV files to Parquet. The job fails with the error shown in the exhibit. The input files are about 500 MB each. The job uses 5 workers of type G.1X (16 GB memory each). What is the MOST likely cause?
Refer to the exhibit. CloudWatch Logs log group: /aws/glue/jobs/error Log stream: job-run-12345 Log event: 2024-01-15T10:30:45.000Z ERROR [Executor task launch worker for task 0.0 in stage 0.0 (TID 0)] : java.lang.OutOfMemoryError: Java heap space 2024-01-15T10:30:45.001Z ERROR [Executor task launch worker for task 0.0 in stage 0.0 (TID 0)] : at org.apache.spark.sql.execution.datasources.FileFormatWriter$WriteTask.execute(FileFormatWriter.scala:247)