A web API runs on an Auto Scaling group (ASG) behind an Application Load Balancer (ALB). During traffic spikes, users experience request timeouts even though CPU stays below 40%. After investigation, you find the ASG often has too few healthy targets to handle the current request rate. Which change will best improve responsiveness during spikes?
Answer choices
Why each option matters
Good practice is not just finding the correct option. The wrong answers often show the exact trap the exam wants you to fall into.
Distractor review
Keep the ASG scaling policy based on CPU utilization, but increase the ASG min capacity by 50%.
Raising the minimum capacity can reduce the odds of running out of healthy targets, but the scaling decision is still driven by CPU. In this scenario, CPU is explicitly <40% during the timeouts, so CPU-based scaling may not trigger additional capacity early enough for the request-rate increase. This also permanently increases baseline cost even when traffic is low.
Best answer
Create a target tracking scaling policy using an ALB metric such as RequestCountPerTarget or TargetResponseTime.
Target tracking with an ALB performance metric scales based on the same layer where the problem is observed (requests/latency through the ALB). As traffic spikes, RequestCountPerTarget and/or TargetResponseTime increase; the scaling policy then increases the ASG desired capacity so the ALB has more healthy targets to distribute requests to. That reduces queuing/latency and helps prevent timeouts without waiting for CPU to rise.
Distractor review
Enable EC2 detailed monitoring for one-minute granularity and keep CPU scaling.
More frequent CPU measurements may make scaling react slightly faster, but it does not change the underlying signal. Since CPU remains below 40% during timeouts, the CPU metric still won’t reflect the bottleneck (insufficient healthy targets for the request rate), so scaling may not increase capacity when needed.
Distractor review
Switch to scaling based on the ASG network out bytes metric only, ignoring ALB response metrics.
Network out bytes can correlate with traffic, but it is not a direct measure of user-perceived performance. It can be noisy (for example, from background data transfer) and it may not capture saturation effects that show up as rising ALB latency or request queuing. ALB request/latency metrics provide a closer feedback loop to timeouts.
Common exam trap
Common exam trap: answer the scenario, not the keyword
Many certification questions include familiar terms but test a specific constraint. Read the exact wording before choosing an answer that is generally true but wrong for this case.
Technical deep dive
How to think about this question
This question should be treated as a scenario, not a definition check. Identify the problem, the constraint and the best action. Then compare each option against those facts.
KKey Concepts to Remember
- Read the scenario before looking for a memorised answer.
- Find the constraint that changes the correct option.
- Eliminate answers that are true in general but not in this case.
- Use explanations to understand the rule behind the answer.
TExam Day Tips
- Underline the problem statement mentally.
- Watch for words such as best, first, most likely and least administrative effort.
- Review why wrong options are wrong, not only why the correct option is correct.
Related practice questions
Related SAA-C03 practice-question pages
Use these pages to review the topic behind this question. This is how one missed question becomes focused revision.
SAA-C03 VPC practice questions
Practise SAA-C03 questions linked to SAA-C03 VPC.
SAA-C03 S3 lifecycle policy questions
Practise SAA-C03 questions linked to SAA-C03 S3 lifecycle policy questions.
SAA-C03 RDS Multi-AZ questions
Practise SAA-C03 questions linked to SAA-C03 RDS Multi-AZ questions.
SAA-C03 IAM policy practice questions
Practise SAA-C03 questions linked to SAA-C03 IAM policy.
SAA-C03 Route 53 failover questions
Practise SAA-C03 questions linked to SAA-C03 Route 53 failover questions.
SAA-C03 CloudFront practice questions
Practise SAA-C03 questions linked to SAA-C03 CloudFront.
SAA-C03 NAT gateway questions
Practise SAA-C03 questions linked to SAA-C03 NAT gateway questions.
SAA-C03 VPC endpoint questions
Practise SAA-C03 questions linked to SAA-C03 VPC endpoint questions.
SAA-C03 Auto Scaling practice questions
Practise SAA-C03 questions linked to SAA-C03 Auto Scaling.
SAA-C03 disaster recovery questions
Practise SAA-C03 questions linked to SAA-C03 disaster recovery questions.
SAA-C03 high availability questions
Practise SAA-C03 questions linked to SAA-C03 high availability questions.
SAA-C03 cost optimization questions
Practise SAA-C03 questions linked to SAA-C03 cost optimization questions.
More questions from this exam
Keep practising from the same exam bank, or move into a focused topic page if this question exposed a weak area.
Question 1
A team needs to distribute TCP traffic (not HTTP) across multiple services. The services must see the original client source IP for auditing. Which AWS load balancer is the best fit?
Question 2
A team wants to run containerized services with AWS-managed orchestration and autoscaling. They do NOT require Kubernetes compatibility. Which AWS service choice is most appropriate to meet these goals?
Question 3
A solutions architect is designing an S3 bucket for a IoT ingestion API. The objects must never be publicly accessible, even if a developer later adds an overly broad bucket policy. What should the architect configure? The design must avoid adding custom operational scripts.
Question 4
A solutions architect is designing an S3 bucket for a claims portal. The objects must never be publicly accessible, even if a developer later adds an overly broad bucket policy. What should the architect configure?
Question 5
A team wants to delegate IAM management to developers, but must ensure developers can never grant themselves permissions beyond a specific limit. Which AWS mechanism best matches this requirement?
Question 6
A solutions architect is designing an S3 bucket for a healthcare document service. The objects must never be publicly accessible, even if a developer later adds an overly broad bucket policy. What should the architect configure?
FAQ
Questions learners often ask
What does this SAA-C03 question test?
Read the scenario before looking for a memorised answer.
What is the correct answer to this question?
The correct answer is: Create a target tracking scaling policy using an ALB metric such as RequestCountPerTarget or TargetResponseTime. — Because CPU stays low, the timeouts are most consistent with a capacity/healthy-target problem relative to the request rate (for example, requests are queuing at the ALB because there are not enough healthy targets). A target tracking policy that uses ALB metrics like RequestCountPerTarget or TargetResponseTime scales in response to load and latency experienced through the ALB. When those metrics rise during a spike, the ASG adds instances so the ALB can route requests to more healthy targets, reducing queueing and preventing timeouts. A can raise baseline capacity, but the ASG still scales based on CPU, which is not the bottleneck in the scenario. C improves measurement frequency but still relies on a metric that remains low (<40%), so it may not trigger scale-out during spikes. D relies on network-level signals that do not directly reflect ALB queuing/latency and can miss the performance symptoms that users see.
What should I do if I get this SAA-C03 question wrong?
Then try more questions from the same exam bank and focus on understanding why the wrong options are tempting.
Discussion
Sign in to join the discussion.