Question 45 of 1,040
Design Resilient ArchitecturesmediumMultiple ChoiceObjective-mapped

Quick Answer

The answer is to increase the ASG health check grace period and use a rolling instance refresh. This is correct because the grace period prevents the Auto Scaling group from prematurely terminating new instances that are still booting and initializing their application, which is exactly when they return HTTP 5xx errors. Without a sufficient grace period, the ASG sees the failed health checks and immediately replaces the instance, creating a cycle of churn and downtime. On the SAA-C03 exam, this scenario tests your understanding of how health check grace periods decouple instance startup from health check evaluation, and the common trap is to only adjust the ALB health check interval or threshold, which does not stop the ASG from terminating the instance first. A rolling instance refresh complements this by replacing instances in batches, ensuring existing healthy instances continue serving traffic while new ones warm up. Memory tip: think of the grace period as a “no-termination buffer” for new instances—give them time to breathe before the ASG judges them.

SAA-C03 Design Resilient Architectures Practice Question

This SAA-C03 practice question tests your understanding of design resilient architectures. Read the scenario carefully and evaluate each option against the stated constraints before committing to an answer. After answering, compare your reasoning against the explanation and wrong-answer breakdown below. Once you have made your selection, read the full explanation to reinforce the concept and understand why each distractor is designed to mislead on exam day.

A web application runs on an EC2 Auto Scaling group (ASG) behind an Application Load Balancer (ALB). The ASG spans three Availability Zones. After a deployment, new instances frequently fail the ALB target group health checks with HTTP 5xx responses and are quickly terminated by the ASG. What change most improves resiliency during deployments with minimal downtime by preventing premature removal of instances that are still starting?

Question 1mediummultiple choice
Read the full NAT/PAT explanation →

Answer choices

Why each option matters

Answer the question above first, then reveal the full breakdown to understand why each option is right or wrong.

Correct answer & explanation

Use a longer ASG health check grace period and deploy new instances using controlled replacement (for example, rolling instance refresh) so existing healthy instances continue serving while new ones warm up.

Option B is correct because increasing the ASG health check grace period gives new instances more time to complete their startup and pass the ALB health checks before the ASG marks them unhealthy. A rolling instance refresh replaces instances in a controlled manner, ensuring that existing healthy instances continue serving traffic while new instances warm up, minimizing downtime and preventing premature termination.

Key principle: Answer the scenario, not the keyword: identify the specific constraint before choosing the most familiar-sounding option.

Answer analysis

Option-by-option breakdown

For each option: why learners choose it and why it is or isn't the right answer here.

  • Reduce the ASG health check grace period to 0 seconds so issues are detected faster.

    Why it's wrong here

    A shorter grace period increases the chance that instances will be marked unhealthy before the application finishes startup or initialization. That can reduce available capacity and increase downtime during deployments.

  • Use a longer ASG health check grace period and deploy new instances using controlled replacement (for example, rolling instance refresh) so existing healthy instances continue serving while new ones warm up.

    Why this is correct

    A longer ASG health check grace period prevents instances from being evaluated too early during normal startup time. Controlled replacement or rolling instance refresh ensures capacity is maintained while new instances warm up, so the ALB continues routing requests only to healthy targets.

    Related concept

    Read the scenario before looking for a memorised answer.

  • Restrict the ASG to a single Availability Zone so health check evaluation is simpler.

    Why it's wrong here

    Reducing the ASG to one Availability Zone decreases fault tolerance. It also increases the blast radius of deployment errors or AZ issues, worsening resiliency rather than improving it.

  • Disable ALB health checks so the ASG does not terminate instances on HTTP 5xx responses.

    Why it's wrong here

    Disabling health checks removes the safety mechanism that prevents routing traffic to broken instances. The system can become unavailable or degrade because the ALB may route requests to unhealthy targets.

Common exam traps

Common exam trap: answer the scenario, not the keyword

The trap here is that candidates think reducing the grace period or disabling health checks will speed up recovery, when in fact it causes premature termination or serves traffic to unhealthy instances, increasing downtime.

Detailed technical explanation

How to think about this question

The ASG health check grace period (default 300 seconds) delays the start of health checks after an instance enters the InService state. During a deployment, new instances may return HTTP 5xx while their application initializes (e.g., loading caches, connecting to databases). A rolling instance refresh, combined with a sufficient grace period, allows the ALB to complete its own health check interval (default 30 seconds) and unhealthy threshold (default 2 failures) before the ASG acts, preventing a cascade of terminations.

KKey Concepts to Remember

  • Read the scenario before looking for a memorised answer.
  • Find the constraint that changes the correct option.
  • Eliminate answers that are true in general but not in this case.

TExam Day Tips

  • Watch for words such as best, first, most likely and least administrative effort.
  • Review why wrong options are wrong, not only why the correct option is correct.

Key takeaway

Answer the scenario, not the keyword: identify the specific constraint before choosing the most familiar-sounding option.

Real-world example

How this comes up in practice

An e-commerce site experiences heavy traffic on Black Friday and near-zero traffic during off-peak weeks. Rather than provisioning permanent large VMs, the team uses auto-scaling groups that add capacity automatically under load and reduce it overnight. Questions like this test whether you understand elasticity, availability zones, and cloud compute scaling patterns.

What to study next

Got this wrong? Here's your next step.

Identify which exam domain this question belongs to, review the core concept, then practise similar questions from the same domain.

Related practice questions

Related SAA-C03 practice-question pages

Use these pages to review the topic behind this question. This is how one missed question becomes focused revision.

Practice this exam

Start a free SAA-C03 practice session

Short sessions build daily habit. Longer sessions build exam-day stamina. Try a timed session to simulate real conditions.

FAQ

Questions learners often ask

What does this SAA-C03 question test?

Design Resilient Architectures — This question tests Design Resilient Architectures — Read the scenario before looking for a memorised answer..

What is the correct answer to this question?

The correct answer is: Use a longer ASG health check grace period and deploy new instances using controlled replacement (for example, rolling instance refresh) so existing healthy instances continue serving while new ones warm up. — Option B is correct because increasing the ASG health check grace period gives new instances more time to complete their startup and pass the ALB health checks before the ASG marks them unhealthy. A rolling instance refresh replaces instances in a controlled manner, ensuring that existing healthy instances continue serving traffic while new instances warm up, minimizing downtime and preventing premature termination.

What should I do if I get this SAA-C03 question wrong?

Identify which exam domain this question belongs to, review the core concept, then practise similar questions from the same domain.

What is the key concept behind this question?

Read the scenario before looking for a memorised answer.

About these practice questions

Courseiva creates original exam-style practice questions with explanations and wrong-answer analysis. It does not publish real exam questions, exam dumps, or protected exam content. Learn why practice questions differ from exam dumps →

How Courseiva writes practice questions · Editorial policy

Same concept, more angles

2 more ways this is tested on SAA-C03

These questions test the same concept from different angles. Work through them to make sure you can recognise it however the exam phrases it.

Variation 1. A web application runs on an Auto Scaling group (ASG) behind an Application Load Balancer (ALB). After a new release, instances begin failing ALB health checks with errors like 502 while the application is still starting up. CloudWatch shows that the ASG replaces the instances before they finish initializing, so traffic never reaches healthy targets. Which change most directly prevents premature replacement during startup so traffic can resume as soon as the instances are actually healthy?

medium
  • A.Reduce the ALB health check timeout to 1 second so failures are detected faster.
  • B.Increase the Auto Scaling group health check grace period to cover application startup and initialization time.
  • C.Enable connection draining on the ALB target group but set deregistration delay to 0 seconds.
  • D.Switch the ALB target group health checks from HTTP to TCP so the application does not need to return HTTP 200.

Why B: B is correct because the Auto Scaling group health check grace period allows instances a specified amount of time to initialize before the ASG starts checking their health status. By increasing this grace period to cover the application startup time, the ASG will not prematurely replace instances that are still initializing, allowing them to pass the ALB health checks and begin receiving traffic once they are actually healthy.

Variation 2. An Auto Scaling group behind an Application Load Balancer frequently replaces new EC2 instances. The application needs ~6 minutes to warm up after instance launch. However, the ALB target group health checks start immediately and mark the targets unhealthy until the application is ready. Because the targets become unhealthy early, the Auto Scaling group then terminates the instances and launches replacements, creating a repeated unhealthy/termination loop. What configuration change will most directly improve recovery by preventing premature ASG termination while the application is warming up?

medium
  • A.Set a health check grace period on the Auto Scaling group that exceeds the application startup/warm-up time.
  • B.Increase the Auto Scaling group's desired capacity to a higher number than required.
  • C.Disable ALB target group health checks so instances are considered healthy as soon as they register.
  • D.Change the Auto Scaling health check type from ELB to EC2 so the ALB will no longer determine instance health.

Why A: The health check grace period on an Auto Scaling group (ASG) allows a newly launched EC2 instance to bypass health check failures for a specified duration. By setting this grace period to exceed the application's ~6-minute warm-up time, the ASG will not prematurely terminate the instance based on ALB health check results. This directly breaks the unhealthy/termination loop while the application initializes.

Keep practising

More SAA-C03 practice questions

Last reviewed: Jun 11, 2026

Question Discussion

Share a tip, memory trick, or ask about the reasoning behind this question. Do not post real exam questions, leaked content, braindumps, or copyrighted exam material. Comments are moderated and may be removed without notice.

Loading comments…

Sign in to join the discussion.

This SAA-C03 practice question is part of Courseiva's free Amazon Web Services certification practice question bank. Courseiva provides original exam-style practice questions with explanations, topic-based practice, mock exams, readiness tracking, and study analytics to help learners prepare for the SAA-C03 exam.