Question 1,095 of 1,786
Data Operations and SupporthardMultiple SelectObjective-mapped

Quick Answer

The answer is to add the Hadoop AWS JAR to the job’s extra JARs or use a Glue version that includes the S3A filesystem library, such as Glue 3.0 or later. This error occurs because the S3AFileSystem class, part of the Hadoop AWS library, is missing from the classpath when the job tries to read from S3 using the S3A protocol. On the AWS Certified Data Engineer Associate DEA-C01 exam, this question tests your understanding of dependency management in AWS Glue and the difference between classpath issues and configuration or permission errors. A common trap is to confuse this with an IAM or networking problem, but the ClassNotFoundException clearly points to a missing library, not access rights. Remember the memory tip: “Missing class? Add the JAR or upgrade the Glue version.”

DEA-C01 Data Operations and Support Practice Question

This DEA-C01 practice question tests your understanding of data operations and support. The scenario asks you to isolate a root cause — eliminate options that address a different problem before choosing. After answering, compare your reasoning against the explanation and wrong-answer breakdown below. Once you have made your selection, read the full explanation to reinforce the concept and understand why each distractor is designed to mislead on exam day.

A data engineer is troubleshooting a failed AWS Glue ETL job that reads from an S3 bucket. The job logs show the following error: 'java.lang.RuntimeException: java.lang.ClassNotFoundException: Class org.apache.hadoop.fs.s3a.S3AFileSystem not found'. Which TWO actions will resolve this issue?

Question 1hardmulti select
Full question →

Answer choices

Why each option matters

Answer the question above first, then reveal the full breakdown to understand why each option is right or wrong.

Correct answer & explanation

Include the hadoop-aws jar as an extra jar in the Glue job configuration.

Options A and D are correct. The error indicates the S3A filesystem class is missing, which is part of the Hadoop AWS library. Adding the jar to the job's extra jars (A) or using a Glue version that includes the library (D) fixes it. Option B is wrong because changing S3 access mode to EMRFS is for EMR, not Glue. Option C is wrong because the error is a classpath issue, not an IAM issue. Option E is wrong because enabling S3 endpoint is a networking issue.

Key principle: NAT direction and interface roles matter as much as the IP address mapping. Inside/outside designation controls which traffic is translated.

Answer analysis

Option-by-option breakdown

For each option: why learners choose it and why it is or isn't the right answer here.

  • Enable VPC S3 endpoint for the Glue job.

    Why it's wrong here

    Networking is not the issue.

  • Include the hadoop-aws jar as an extra jar in the Glue job configuration.

    Why this is correct

    Adds the missing class to the classpath.

    Related concept

    Static NAT maps one inside address to one outside address.

  • Update the IAM role to allow access to S3.

    Why it's wrong here

    Permissions are not the issue.

  • Use a Glue version that includes the S3A filesystem library (e.g., Glue 3.0 or later).

    Why this is correct

    Newer Glue versions include the library.

    Related concept

    Static NAT maps one inside address to one outside address.

  • Change the S3 access mode from S3A to EMRFS.

    Why it's wrong here

    EMRFS is specific to EMR, not Glue.

Common exam traps

Common exam trap: NAT rules depend on direction and matching traffic

NAT is not only about the public address. The inside/outside interface roles and the ACL or rule that matches traffic are just as important.

Detailed technical explanation

How to think about this question

NAT questions usually test address translation, overload/PAT behaviour, static mappings and whether the right traffic is being translated. Read the interface direction and address terms carefully.

KKey Concepts to Remember

  • Static NAT maps one inside address to one outside address.
  • PAT allows many inside hosts to share one public address using ports.
  • Inside local and inside global describe the private and translated addresses.
  • NAT ACLs identify traffic for translation, not always security filtering.

TExam Day Tips

  • Identify inside and outside interfaces first.
  • Check whether the scenario needs static NAT, dynamic NAT or PAT.
  • Do not confuse NAT matching ACLs with normal packet-filtering intent.

Key takeaway

NAT direction and interface roles matter as much as the IP address mapping. Inside/outside designation controls which traffic is translated.

Real-world example

How this comes up in practice

A media company stores terabytes of video archives that are accessed once a year for audit purposes. Moving these objects to a cold storage tier (Azure Archive, S3 Glacier, or Google Nearline) costs a fraction of hot storage. Questions like this test whether you understand storage tiers, access frequency tradeoffs, and retrieval latency requirements.

What to study next

Got this wrong? Here's your next step.

Review the four NAT address types (inside local, inside global, outside local, outside global), PAT port overload, and static vs dynamic NAT use cases. Then practise related DEA-C01 NAT questions on configuration and troubleshooting.

Related practice questions

Related DEA-C01 practice-question pages

Use these pages to review the topic behind this question. This is how one missed question becomes focused revision.

Practice this exam

Start a free DEA-C01 practice session

Short sessions build daily habit. Longer sessions build exam-day stamina. Try a timed session to simulate real conditions.

FAQ

Questions learners often ask

What does this DEA-C01 question test?

Data Operations and Support — This question tests Data Operations and Support — Static NAT maps one inside address to one outside address..

What is the correct answer to this question?

The correct answer is: Include the hadoop-aws jar as an extra jar in the Glue job configuration. — Options A and D are correct. The error indicates the S3A filesystem class is missing, which is part of the Hadoop AWS library. Adding the jar to the job's extra jars (A) or using a Glue version that includes the library (D) fixes it. Option B is wrong because changing S3 access mode to EMRFS is for EMR, not Glue. Option C is wrong because the error is a classpath issue, not an IAM issue. Option E is wrong because enabling S3 endpoint is a networking issue.

What should I do if I get this DEA-C01 question wrong?

Review the four NAT address types (inside local, inside global, outside local, outside global), PAT port overload, and static vs dynamic NAT use cases. Then practise related DEA-C01 NAT questions on configuration and troubleshooting.

What is the key concept behind this question?

Static NAT maps one inside address to one outside address.

About these practice questions

Courseiva creates original exam-style practice questions with explanations and wrong-answer analysis. It does not publish real exam questions, exam dumps, or protected exam content. Learn why practice questions differ from exam dumps →

How Courseiva writes practice questions · Editorial policy

Same concept, more angles

2 more ways this is tested on DEA-C01

These questions test the same concept from different angles. Work through them to make sure you can recognise it however the exam phrases it.

Variation 1. A data engineer is troubleshooting a failed AWS Glue job that reads from an Apache Hive metastore in an Amazon EMR cluster. The error message indicates 'ClassNotFoundException: org.apache.hadoop.hive.ql.metadata.HiveException'. The Glue job uses a custom Python shell script. What is the most likely cause of this error?

hard
  • A.Check the network connectivity between Glue and the EMR cluster.
  • B.Include the Hive JAR files in the 'Python library path' or use a Glue version with Hive support.
  • C.Modify the Python script to import the Hive libraries manually.
  • D.Update the IAM role to allow 'hive:Describe*' actions.

Why B: Option C is correct because the ClassNotFoundException for Hive classes indicates that the required Hive JARs are not available in the Glue job's classpath. The Glue job needs to include Hive jars either via a library path or by using a Glue version that supports Hive connectivity. Option A is incorrect because the Python script itself does not need to be modified to include imports; the JARs must be provided. Option B is incorrect because network connectivity would not cause a class not found error. Option D is incorrect because the error is not about IAM permissions.

Variation 2. A data engineer is troubleshooting a failed AWS Glue job that reads from an Amazon RDS for MySQL table. The error message indicates 'java.sql.SQLException: No suitable driver'. What is the most likely cause?

medium
  • A.The MySQL JDBC driver JAR is not included in the Glue job's dependencies.
  • B.The Glue job is using the wrong JDBC driver class name.
  • C.The Glue job's VPC subnet does not have a route to the RDS instance.
  • D.The RDS instance is not publicly accessible.

Why A: Option A is correct because the MySQL JDBC driver must be included in the Glue job's dependent JARs or as a Python module. Option B is incorrect because the driver class name is correct; the driver JAR is missing. Option C is incorrect because the error is about driver, not connection. Option D is incorrect because subnet routing does not affect driver loading.

Last reviewed: Jun 20, 2026

Question Discussion

Share a tip, memory trick, or ask about the reasoning behind this question. Do not post real exam questions, leaked content, braindumps, or copyrighted exam material. Comments are moderated and may be removed without notice.

Loading comments…

Sign in to join the discussion.

This DEA-C01 practice question is part of Courseiva's free Amazon Web Services certification practice question bank. Courseiva provides original exam-style practice questions with explanations, topic-based practice, mock exams, readiness tracking, and study analytics to help learners prepare for the DEA-C01 exam.