Question 89 of 499
Cloud Architecture and DesigneasyMultiple ChoiceObjective-mapped

Quick Answer

The correct answer is horizontal auto-scaling based on CPU utilization. This strategy is the right choice because it directly implements cloud elasticity, allowing the application to dynamically add new instances (scale out) when CPU load spikes unpredictably and remove them (scale in) when demand drops, thereby handling unpredictable traffic spikes while minimizing costs. On the CompTIA Cloud+ CV0-004 exam, this question tests your understanding of scaling policies and the distinction between horizontal and vertical scaling; a common trap is selecting vertical scaling, which adds power to a single instance and fails to handle distributed spikes efficiently. Remember the memory tip: “Horizontal handles the herd” — think of adding more servers (horizontal) rather than a bigger server (vertical) when traffic is unpredictable.

CV0-004 Cloud Architecture and Design Practice Question

This CV0-004 practice question tests your understanding of cloud architecture and design. Read the scenario carefully and evaluate each option against the stated constraints before committing to an answer. After answering, compare your reasoning against the explanation and wrong-answer breakdown below. Once you have made your selection, read the full explanation to reinforce the concept and understand why each distractor is designed to mislead on exam day.

An architect is designing a cloud application that must handle unpredictable spikes in traffic. The application should automatically add resources during peak demand and remove them when demand decreases to minimize costs. Which scaling strategy should be used?

Clue words in this question

Noticing these words before you look at the options changes how you read each choice.

  • Clue: "minimum / minimize"

    Why it matters: Asks for the least resource use — fewest addresses, smallest subnet, lowest overhead. Eliminate over-provisioned options even if they would technically work.

Question 1easymultiple choice
Full question →

Answer choices

Why each option matters

Answer the question above first, then reveal the full breakdown to understand why each option is right or wrong.

Correct answer & explanation

Horizontal auto-scaling based on CPU utilization

Horizontal auto-scaling based on CPU utilization is the correct strategy because it dynamically adds or removes instances in response to real-time demand, ensuring the application can handle unpredictable traffic spikes while minimizing costs. This approach aligns with cloud elasticity principles, where resources scale out (add instances) during high CPU load and scale in (remove instances) when load decreases, without manual intervention.

Key principle: Answer the scenario, not the keyword: identify the specific constraint before choosing the most familiar-sounding option.

Answer analysis

Option-by-option breakdown

For each option: why learners choose it and why it is or isn't the right answer here.

  • Scheduled scaling based on historical patterns

    Why it's wrong here

    Cannot handle unpredictable spikes.

  • Vertical scaling of existing instances

    Why it's wrong here

    Limited ceiling and may require downtime.

  • Manual scaling by operations team

    Why it's wrong here

    Cannot react quickly to unpredictable spikes.

  • Horizontal auto-scaling based on CPU utilization

    Why this is correct

    Automatically adds/removes instances as needed.

    Clue confirmation

    The clue word "minimum / minimize" in the question point toward this answer.

    Related concept

    Read the scenario before looking for a memorised answer.

Common exam traps

Common exam trap: answer the scenario, not the keyword

The trap here is that candidates often confuse vertical scaling (scaling up) with horizontal scaling (scaling out), assuming resizing existing instances is more cost-effective, but vertical scaling has hard limits and cannot match the elasticity required for unpredictable spikes.

Detailed technical explanation

How to think about this question

Under the hood, horizontal auto-scaling uses a metric like CPU utilization (e.g., average > 70% for 5 minutes) to trigger a scale-out event via an auto-scaling group, which launches new instances from a pre-configured Amazon Machine Image (AMI) or template. A subtle behavior is the cooldown period (e.g., 300 seconds) that prevents rapid scaling oscillations, and in real-world scenarios, you might combine CPU with a custom metric like request queue depth to avoid scaling on transient CPU spikes.

KKey Concepts to Remember

  • Read the scenario before looking for a memorised answer.
  • Find the constraint that changes the correct option.
  • Eliminate answers that are true in general but not in this case.

TExam Day Tips

  • Watch for words such as best, first, most likely and least administrative effort.
  • Review why wrong options are wrong, not only why the correct option is correct.

Key takeaway

Answer the scenario, not the keyword: identify the specific constraint before choosing the most familiar-sounding option.

Real-world example

How this comes up in practice

A small business has 20 workstations on the 192.168.1.0/24 network and one public IP from its ISP. The router uses PAT (NAT overload) so all 20 devices share one public address using different source ports. NAT questions test whether you understand the four address terms and which direction each translation applies.

What to study next

Got this wrong? Here's your next step.

Identify which exam domain this question belongs to, review the core concept, then practise similar questions from the same domain.

Related practice questions

Related CV0-004 practice-question pages

Use these pages to review the topic behind this question. This is how one missed question becomes focused revision.

Practice this exam

Start a free CV0-004 practice session

Short sessions build daily habit. Longer sessions build exam-day stamina. Try a timed session to simulate real conditions.

FAQ

Questions learners often ask

What does this CV0-004 question test?

Cloud Architecture and Design — This question tests Cloud Architecture and Design — Read the scenario before looking for a memorised answer..

What is the correct answer to this question?

The correct answer is: Horizontal auto-scaling based on CPU utilization — Horizontal auto-scaling based on CPU utilization is the correct strategy because it dynamically adds or removes instances in response to real-time demand, ensuring the application can handle unpredictable traffic spikes while minimizing costs. This approach aligns with cloud elasticity principles, where resources scale out (add instances) during high CPU load and scale in (remove instances) when load decreases, without manual intervention.

What should I do if I get this CV0-004 question wrong?

Identify which exam domain this question belongs to, review the core concept, then practise similar questions from the same domain.

Are there clue words in this question I should notice?

Yes — watch for: "minimum / minimize". Asks for the least resource use — fewest addresses, smallest subnet, lowest overhead. Eliminate over-provisioned options even if they would technically work.

What is the key concept behind this question?

Read the scenario before looking for a memorised answer.

About these practice questions

Courseiva creates original exam-style practice questions with explanations and wrong-answer analysis. It does not publish real exam questions, exam dumps, or protected exam content. Learn why practice questions differ from exam dumps →

How Courseiva writes practice questions · Editorial policy

Same concept, more angles

1 more ways this is tested on CV0-004

These questions test the same concept from different angles. Work through them to make sure you can recognise it however the exam phrases it.

Variation 1. A startup is deploying a web application on a public cloud and expects variable traffic throughout the day. The team wants to minimize costs while ensuring that the application can handle sudden spikes in demand. Which scaling strategy best meets these requirements?

easy
  • A.Auto scaling based on CPU utilization thresholds
  • B.Horizontal scaling using a fixed schedule
  • C.Vertical scaling during off-peak hours
  • D.Manual scaling based on historical data

Why A: Auto scaling based on CPU utilization thresholds is the correct strategy because it dynamically adjusts the number of compute instances in response to real-time demand, ensuring the application can handle sudden spikes while minimizing costs during low-traffic periods. This approach aligns with the startup's requirement for variable traffic and cost efficiency, as it only provisions resources when needed, unlike fixed schedules or manual interventions that cannot react to unpredictable spikes.

Last reviewed: Jun 11, 2026

Question Discussion

Share a tip, memory trick, or ask about the reasoning behind this question. Do not post real exam questions, leaked content, braindumps, or copyrighted exam material. Comments are moderated and may be removed without notice.

Loading comments…

Sign in to join the discussion.

This CV0-004 practice question is part of Courseiva's free CompTIA certification practice question bank. Courseiva provides original exam-style practice questions with explanations, topic-based practice, mock exams, readiness tracking, and study analytics to help learners prepare for the CV0-004 exam.