How should I use these Design Resilient Architectures practice questions?

Read each scenario carefully and choose your answer before revealing the explanation. Then check why your choice was right or wrong. Repeat until the reasoning feels automatic.

Can I practise just Design Resilient Architectures questions in a focused session?

Yes — use the session launcher on this page to start a 10-, 20-, 30- or 50-question session drawn entirely from the Design Resilient Architectures domain.

SAA-C03 · topic practice

Design Resilient Architectures practice questions

Use this page to practise high availability and resilience questions. The SAA-C03 exam tests your ability to match an architecture pattern to an RTO/RPO requirement — know the cost and recovery time of each pattern.

Courseiva uses original exam-style practice questions designed for learning and revision. The goal is to understand the concepts, recognise exam patterns, and improve through explanations — not memorise copied exam dumps.

Reviewed byJohnson Ajibi· MSc IT Security

20 questionsDomain: Design Resilient Architectures

Practice 10 questions Browse domain →

What the exam tests

What to know about Design Resilient Architectures

High availability and resilience questions test multi-AZ vs multi-Region patterns, Auto Scaling, load balancing and the right service for a given recovery time objective.

Multi-AZ vs multi-Region deployment trade-offs.

Auto Scaling policies and when to scale horizontally vs vertically.

Elastic Load Balancing: ALB, NLB, CLB and their use cases.

RTO and RPO targets matched to the correct AWS architecture.

Watch out for

Common Design Resilient Architectures exam traps

▸Multi-AZ protects against AZ failure; multi-Region protects against Region failure.
▸Auto Scaling does not guarantee zero downtime without a load balancer.
▸ALB operates at Layer 7; NLB operates at Layer 4.
▸Pilot light is cheaper than warm standby but has longer recovery time.

Practice set

Design Resilient Architectures questions

20 questions · select your answer, then reveal the explanation

Question 1mediummultiple choice

Read the full Design Resilient Architectures explanation →

An order-processing service consumes messages from an Amazon SQS Standard queue using a custom worker. During traffic spikes, the worker occasionally times out after performing some work but before acknowledging the message, so SQS redelivers it and it may be processed again.

You also observe that a small set of “poison” messages always fail validation.

What change most directly improves resilience by (1) preventing poison messages from retrying indefinitely and (2) avoiding duplicate side effects caused by legitimate retries?

Trap 1: Increase the SQS visibility timeout and, when validation fails,…

Increasing visibility reduces redelivery temporarily, but it does not implement a poison-message quarantine strategy. Deleting invalid messages immediately removes evidence and prevents systematic handling (for example, inspection or correction) of the poison messages.

Trap 2: Move to SNS topics with subscriptions and rely on SNS to provide…

SNS does not provide exactly-once delivery guarantees. Duplicate deliveries can still occur due to retries and downstream failures, so you still need an idempotency strategy to protect side effects.

Trap 3: Change the queue to FIFO and enable content-based deduplication,…

FIFO with content-based deduplication may reduce some duplicates, but it does not guarantee protection against duplicate side effects when the consumer times out or fails after partially processing. Poison-message retry loops still need a DLQ/redrive approach, and idempotency is still required to make processing safe under retries.

Study all Design Resilient Architectures common traps →

A
Increase the SQS visibility timeout and, when validation fails, call DeleteMessage in the consumer to remove the message immediately.
Why wrong: Increasing visibility reduces redelivery temporarily, but it does not implement a poison-message quarantine strategy. Deleting invalid messages immediately removes evidence and prevents systematic handling (for example, inspection or correction) of the poison messages.
B
Move to SNS topics with subscriptions and rely on SNS to provide exactly-once delivery to eliminate duplicates automatically.
Why wrong: SNS does not provide exactly-once delivery guarantees. Duplicate deliveries can still occur due to retries and downstream failures, so you still need an idempotency strategy to protect side effects.
C
Configure a dead-letter queue (DLQ) with a redrive policy that moves messages after maxReceiveCount, and implement idempotent processing in the consumer using an idempotency key.
SQS Standard is at-least-once delivery, so timeouts can cause redelivery and duplicates. A DLQ with a redrive policy prevents poison messages from retrying forever by moving them after repeated failures. Idempotent processing (for example, storing a processed marker in a database with conditional logic keyed by an idempotency key) prevents duplicate side effects when retries occur for valid messages.
D
Change the queue to FIFO and enable content-based deduplication, leaving the consumer logic unchanged.
Why wrong: FIFO with content-based deduplication may reduce some duplicates, but it does not guarantee protection against duplicate side effects when the consumer times out or fails after partially processing. Poison-message retry loops still need a DLQ/redrive approach, and idempotency is still required to make processing safe under retries.

Design Resilient Architectures practice questions

What to know about Design Resilient Architectures

Common Design Resilient Architectures exam traps

Design Resilient Architectures questions

Based on the exhibit, the application sees several minutes of connection errors during an Aurora failover. What is the best change to reduce failover impact?

Exhibit

Based on the exhibit, DNS still sends traffic to the primary Region even though Route 53 health checks show the primary endpoint is unhealthy. What is the best change to make failover work as intended?

Exhibit

Based on the exhibit, the web application must remain available even if one Availability Zone fails. What is the best change to improve resilience with the least redesign?

Exhibit

Based on the exhibit, the database must continue serving if the current Availability Zone fails. What should you change?

Exhibit

Based on the exhibit, the application tier is not replacing unhealthy instances even though the Auto Scaling group spans two Availability Zones. What change most directly improves automatic recovery when the application process fails?

Based on the exhibit, the team must restore an Amazon RDS for PostgreSQL database to the exact state just before a bad delete happened. What is the best recovery approach?

Exhibit

Based on the exhibit, the company wants DNS traffic to fail over automatically from the primary Region to a secondary Region when the primary endpoint is unhealthy. Which Route 53 change is best?

Exhibit

Based on the exhibit, downstream payment timeouts cause EventBridge deliveries to back up and some events are retried until they age out. What change best improves resilience and preserves events during downstream outages?

Exhibit

Based on the exhibit, the web tier becomes unavailable if us-west-2a has an outage. What is the best change to improve resilience with the least redesign?

Exhibit

Based on the exhibit, the database is manually promoted during an Availability Zone failure and the application outage lasts longer than the target. What change best improves resilience with the least operational intervention?

Exhibit

Track your progress over time

Start a Design Resilient Architectures only practice session

Related SAA-C03 topic practice pages

Design Secure Architectures practice questions

Design Resilient Architectures practice questions

Design High-Performing Architectures practice questions

Design Cost-Optimized Architectures practice questions

SAA-C03 VPC practice questions

SAA-C03 S3 lifecycle policy questions

SAA-C03 RDS Multi-AZ questions

SAA-C03 IAM policy practice questions

SAA-C03 Route 53 failover questions

SAA-C03 CloudFront practice questions

SAA-C03 NAT gateway questions

SAA-C03 VPC endpoint questions

Frequently asked questions

Track your progress

Study resources

Exam traps to avoid