Question 960 of 997
Develop Azure compute solutionseasyMultiple ChoiceObjective-mapped

Quick Answer

The answer is the HorizontalPodAutoscaler (HPA), which is the correct Kubernetes resource to configure for automatically scaling pod replicas based on HTTP request load in Azure Kubernetes Service (AKS). HPA works by continuously monitoring custom metrics—such as HTTP request rate—along with standard CPU or memory usage, and then adjusting the replica count of a Deployment or ReplicaSet to match a defined target. On the Microsoft Azure Developer Associate AZ-204 exam, this concept tests your understanding of scaling strategies within AKS, often appearing in scenario-based questions where you must distinguish between HPA, Cluster Autoscaler, and manual scaling. A common trap is confusing HPA with the Cluster Autoscaler, but remember: HPA scales pods within a cluster, while Cluster Autoscaler scales the node pool itself. For a quick memory tip, think “HPA handles the pods, CA handles the nodes.”

AZ-204 Develop Azure compute solutions Practice Question

This AZ-204 practice question tests your understanding of develop azure compute solutions. This is a configuration task: choose the command set that satisfies every stated requirement. Small differences — like 'secret' vs 'password' or 'transport input ssh' vs 'all' — change whether the answer is correct. After answering, compare your reasoning against the explanation and wrong-answer breakdown below. Once you have made your selection, read the full explanation to reinforce the concept and understand why each distractor is designed to mislead on exam day.

Your team develops a containerized web app using Azure Kubernetes Service (AKS). You need to ensure that the application can automatically scale based on HTTP request load. Which Kubernetes resource should you configure?

Question 1easymultiple choice
Full question →

Answer choices

Why each option matters

Answer the question above first, then reveal the full breakdown to understand why each option is right or wrong.

Correct answer & explanation

HorizontalPodAutoscaler

The HorizontalPodAutoscaler (HPA) is the correct Kubernetes resource for automatically scaling the number of pod replicas based on observed CPU, memory, or custom metrics like HTTP request rate. In an AKS cluster, HPA adjusts the replica count of a Deployment or ReplicaSet to match the target metric, enabling the application to handle varying HTTP load without manual intervention.

Key principle: Answer the scenario, not the keyword: identify the specific constraint before choosing the most familiar-sounding option.

Answer analysis

Option-by-option breakdown

For each option: why learners choose it and why it is or isn't the right answer here.

  • VerticalPodAutoscaler

    Why it's wrong here

    Adjusts resource requests/limits, not scaling on load.

  • PodDisruptionBudget

    Why it's wrong here

    Ensures minimum available pods during disruptions.

  • HorizontalPodAutoscaler

    Why this is correct

    Correctly scales based on load metrics.

    Related concept

    Read the scenario before looking for a memorised answer.

  • NetworkPolicy

    Why it's wrong here

    Controls pod-to-pod communication.

Common exam traps

Common exam trap: answer the scenario, not the keyword

The trap here is that candidates often confuse HorizontalPodAutoscaler with VerticalPodAutoscaler, mistakenly thinking that adjusting pod resources (CPU/memory) is the correct way to handle HTTP load, when in fact HPA scales the number of pod replicas horizontally to distribute the load.

Detailed technical explanation

How to think about this question

HPA works by querying the Kubernetes Metrics Server (or a custom metrics adapter like Prometheus) at a default interval of 15 seconds, comparing the current metric value (e.g., average CPU utilization across pods) against a target threshold, and calculating the desired replica count using the formula `desiredReplicas = ceil[currentReplicas * (currentMetricValue / targetMetricValue)]`. For HTTP-based scaling, you would configure a custom metric from an ingress controller (e.g., NGINX Ingress) exposing requests-per-second, or use the Kubernetes Event-Driven Autoscaler (KEDA) for more advanced event-driven scaling.

KKey Concepts to Remember

  • Read the scenario before looking for a memorised answer.
  • Find the constraint that changes the correct option.
  • Eliminate answers that are true in general but not in this case.

TExam Day Tips

  • Watch for words such as best, first, most likely and least administrative effort.
  • Review why wrong options are wrong, not only why the correct option is correct.

Key takeaway

Answer the scenario, not the keyword: identify the specific constraint before choosing the most familiar-sounding option.

Real-world example

How this comes up in practice

An e-commerce site experiences heavy traffic on Black Friday and near-zero traffic during off-peak weeks. Rather than provisioning permanent large VMs, the team uses auto-scaling groups that add capacity automatically under load and reduce it overnight. Questions like this test whether you understand elasticity, availability zones, and cloud compute scaling patterns.

What to study next

Got this wrong? Here's your next step.

Identify which exam domain this question belongs to, review the core concept, then practise similar questions from the same domain.

Related practice questions

Related AZ-204 practice-question pages

Use these pages to review the topic behind this question. This is how one missed question becomes focused revision.

Practice this exam

Start a free AZ-204 practice session

Short sessions build daily habit. Longer sessions build exam-day stamina. Try a timed session to simulate real conditions.

FAQ

Questions learners often ask

What does this AZ-204 question test?

Develop Azure compute solutions — This question tests Develop Azure compute solutions — Read the scenario before looking for a memorised answer..

What is the correct answer to this question?

The correct answer is: HorizontalPodAutoscaler — The HorizontalPodAutoscaler (HPA) is the correct Kubernetes resource for automatically scaling the number of pod replicas based on observed CPU, memory, or custom metrics like HTTP request rate. In an AKS cluster, HPA adjusts the replica count of a Deployment or ReplicaSet to match the target metric, enabling the application to handle varying HTTP load without manual intervention.

What should I do if I get this AZ-204 question wrong?

Identify which exam domain this question belongs to, review the core concept, then practise similar questions from the same domain.

What is the key concept behind this question?

Read the scenario before looking for a memorised answer.

About these practice questions

Courseiva creates original exam-style practice questions with explanations and wrong-answer analysis. It does not publish real exam questions, exam dumps, or protected exam content. Learn why practice questions differ from exam dumps →

How Courseiva writes practice questions · Editorial policy

Last reviewed: Jun 24, 2026

Question Discussion

Share a tip, memory trick, or ask about the reasoning behind this question. Do not post real exam questions, leaked content, braindumps, or copyrighted exam material. Comments are moderated and may be removed without notice.

Loading comments…

Sign in to join the discussion.

This AZ-204 practice question is part of Courseiva's free Microsoft certification practice question bank. Courseiva provides original exam-style practice questions with explanations, topic-based practice, mock exams, readiness tracking, and study analytics to help learners prepare for the AZ-204 exam.