How should I use these Data Ingestion and Transformation practice questions?

Read each scenario carefully and choose your answer before revealing the explanation. Then check why your choice was right or wrong. Repeat until the reasoning feels automatic.

Can I practise just Data Ingestion and Transformation questions in a focused session?

Yes — use the session launcher on this page to start a 10-, 20-, 30- or 50-question session drawn entirely from the Data Ingestion and Transformation domain.

DEA-C01 · topic practice

Data Ingestion and Transformation practice questions

Practise AWS Certified Data Engineer Associate DEA-C01 Data Ingestion and Transformation practice questions — original exam-style scenarios with answer choices, explanations, and analysis of common mistakes.

Courseiva uses original exam-style practice questions designed for learning and revision. The goal is to understand the concepts, recognise exam patterns, and improve through explanations — not memorise copied exam dumps.

Reviewed byJohnson Ajibi· MSc IT Security

20 questionsDomain: Data Ingestion and Transformation

Practice 10 questions Browse domain →

What the exam tests

What to know about Data Ingestion and Transformation

Data Ingestion and Transformation questions test whether you can apply the concept in context, not just recognise a definition.

How the topic appears in realistic exam-style scenarios.

Which detail in the question changes the correct answer.

How to eliminate plausible but wrong options.

How to connect the question back to the wider exam objective.

Watch out for

Common Data Ingestion and Transformation exam traps

▸Answering from memory before reading the full scenario.
▸Missing a constraint such as cost, availability, security, scope or command context.
▸Choosing a broad answer when the question asks for the most specific fix.
▸Ignoring why the wrong options are tempting.

Practice set

Data Ingestion and Transformation questions

20 questions · select your answer, then reveal the explanation

Question 1easymultiple choice

Read the full Data Ingestion and Transformation explanation →

A data engineer needs to ingest streaming data from an IoT fleet into Amazon S3 for near-real-time analytics. The data volume is approximately 5 GB per hour, and each event is less than 1 KB. Which AWS service should be used as the ingestion endpoint?

Trap 1: AWS DataSync

For large data transfers between storage systems.

Trap 2: Amazon AppFlow

For SaaS data ingestion.

Trap 3: Amazon Kinesis Data Streams

Not the direct IoT ingestion service.

Study all Data Ingestion and Transformation common traps →

A
AWS IoT Core
Designed for IoT device ingestion.
B
AWS DataSync
Why wrong: For large data transfers between storage systems.
C
Amazon AppFlow
Why wrong: For SaaS data ingestion.
D
Amazon Kinesis Data Streams
Why wrong: Not the direct IoT ingestion service.

Data Ingestion and Transformation practice questions

What to know about Data Ingestion and Transformation

Common Data Ingestion and Transformation exam traps

Data Ingestion and Transformation questions

A data engineer needs to ingest streaming data from an IoT fleet into Amazon S3 for near-real-time analytics. The data volume is approximately 5 GB per hour, and each event is less than 1 KB. Which AWS service should be used as the ingestion endpoint?

A data engineering team needs to transform CSV files stored in Amazon S3 into Parquet format using AWS Glue. The files are partitioned by date and are updated hourly. Which AWS Glue feature should be used to automatically detect the schema and partition structure?

A data engineer needs to ingest data from multiple SaaS applications (Salesforce, Marketo) into Amazon S3 for a data lake. The data volumes are moderate and the sync needs to be scheduled daily. Which AWS service is most appropriate for this task?

A data engineer needs to transfer 50 TB of historical data from an on-premises HDFS cluster to Amazon S3. The network bandwidth is limited to 100 Mbps. The transfer must be completed within one week. Which service should be used?

A data engineer needs to ingest JSON data from an on-premises relational database into Amazon S3 every hour. Which AWS service should be used to set up a scheduled, incremental data transfer?

A company is using AWS Glue to process data from Amazon S3. The Glue job reads CSV files and writes Parquet files to a different S3 bucket. The job occasionally fails with 'java.lang.OutOfMemoryError: Java heap space'. The data size varies. Which change should the engineer make to avoid this error?

A data engineer is designing a serverless data ingestion pipeline that uses Amazon Kinesis Data Firehose to deliver data to Amazon S3. The data must be transformed using AWS Lambda before being written to S3. Which two steps are required to enable this transformation? (Select TWO.)

A company uses AWS Glue to process streaming data from Amazon Kinesis Data Streams. The job reads JSON records and writes Parquet to Amazon S3. Recently, the job started failing with 'Out of Memory' errors. Which change is MOST likely to resolve the issue?

Track your progress over time

Start a Data Ingestion and Transformation only practice session

Related DEA-C01 topic practice pages

Data Ingestion and Transformation practice questions

Data Operations and Support practice questions

Data Security and Governance practice questions

Data Store Management practice questions

DEA-C01 fundamentals practice questions

DEA-C01 scenario practice questions

DEA-C01 troubleshooting practice questions

Frequently asked questions

Track your progress

Study resources

Exam traps to avoid