A data engineer is designing a data pipeline that uses Amazon Kinesis Data Streams to ingest real-time transaction data. The data must be processed in near real-time and stored in Amazon S3 for long-term analytics. The engineer wants to ensure data durability and exactly-once processing semantics. Which TWO actions should the engineer take? (Choose two.)
KPL provides exactly-once semantics when configured to do so.
Why this answer
Correct options: C and D. Using the Kinesis Producer Library (KPL) with exactly-once delivery ensures no duplicates. Enabling Kinesis Data Firehose's exactly-once delivery to S3 ensures data is written exactly once.
Option A (SQS) is not part of Kinesis. Option B (Glue) does not provide exactly-once for streaming. Option E (Lambda) can process records but does not guarantee exactly-once semantics without additional logic.