- A
Enable container concurrency.
Container concurrency allows multiple requests per container, increasing throughput.
- B
Use Cloud Load Balancing.
Why wrong: Cloud Run already distributes traffic; an external LB is not needed for a single service.
- C
Enable CPU always on allocation.
Why wrong: CPU always on is for background processing, not scaling.
- D
Set max instances to a high value.
High max instances allows scaling out to handle more requests.
- E
Set min instances to zero to save cost.
Why wrong: Zero min instances causes cold starts during spikes.
Quick Answer
The answer is to enable container concurrency and set max instances to a high value. Container concurrency allows a single Cloud Run instance to handle multiple simultaneous requests—up to 1,000—which dramatically boosts throughput during traffic spikes without needing to spin up new containers. Setting max instances high ensures the service can scale out aggressively, creating enough parallel instances to absorb sudden load and prevent request queuing or cold starts. On the Google Professional Cloud Developer exam, this tests your understanding of Cloud Run’s dual scaling dimensions: vertical (concurrency per instance) and horizontal (instance count). A common trap is thinking only one dimension matters—candidates often max instances but forget concurrency, or vice versa. Remember the mnemonic “Concurrency for density, Max for capacity” to recall that concurrency packs requests into fewer containers, while max instances gives you the ceiling to burst outward.
PCD Practice Question: Designing highly scalable, available, and reliable cloud-native applications
This PCD practice question tests your understanding of designing highly scalable, available, and reliable cloud-native applications. This is a configuration task: choose the command set that satisfies every stated requirement. Small differences — like 'secret' vs 'password' or 'transport input ssh' vs 'all' — change whether the answer is correct. After answering, compare your reasoning against the explanation and wrong-answer breakdown below. Once you have made your selection, read the full explanation to reinforce the concept and understand why each distractor is designed to mislead on exam day.
A company is using Cloud Run for a stateless API. They want to ensure that the service can handle sudden traffic spikes. Which two features should they configure?
Answer choices
Why each option matters
Answer the question above first, then reveal the full breakdown to understand why each option is right or wrong.
Correct answer & explanation
Enable container concurrency.
Option A is correct because enabling container concurrency allows a single Cloud Run container instance to handle multiple requests simultaneously, up to the configured concurrency limit (default 80, max 1000). This improves throughput and resource utilization during traffic spikes without requiring additional instances. Option D is correct because setting max instances to a high value ensures the service can scale out to handle sudden load by creating more container instances, up to the configured maximum, preventing cold starts and request queuing.
Key principle: Answer the scenario, not the keyword: identify the specific constraint before choosing the most familiar-sounding option.
Answer analysis
Option-by-option breakdown
For each option: why learners choose it and why it is or isn't the right answer here.
- ✓
Enable container concurrency.
Why this is correct
Container concurrency allows multiple requests per container, increasing throughput.
Related concept
Read the scenario before looking for a memorised answer.
- ✗
Use Cloud Load Balancing.
Why it's wrong here
Cloud Run already distributes traffic; an external LB is not needed for a single service.
- ✗
Enable CPU always on allocation.
Why it's wrong here
CPU always on is for background processing, not scaling.
- ✓
Set max instances to a high value.
Why this is correct
High max instances allows scaling out to handle more requests.
Related concept
Read the scenario before looking for a memorised answer.
- ✗
Set min instances to zero to save cost.
Why it's wrong here
Zero min instances causes cold starts during spikes.
Common exam traps
Common exam trap: answer the scenario, not the keyword
Cisco often tests the misconception that Cloud Load Balancing is required for scaling Cloud Run, but Cloud Run's built-in autoscaling and managed HTTPS load balancer already handle traffic spikes; the trap is that candidates confuse external load balancing with internal scaling mechanisms.
Detailed technical explanation
How to think about this question
Under the hood, Cloud Run uses Knative serving, where container concurrency is implemented via the `containerConcurrency` field in the Knative Service spec, controlling how many requests are routed to a single container instance. Setting max instances to a high value (e.g., 1000) allows the Knative autoscaler to create more pods (instances) based on concurrency metrics, but note that Cloud Run has a soft limit of 1000 instances per region per project, and exceeding this requires a quota increase. In a real-world scenario, a service with concurrency set to 80 and max instances set to 1000 can handle up to 80,000 concurrent requests, but if max instances is left at the default (100), the service would be limited to 8,000 concurrent requests, potentially causing 429 errors during spikes.
KKey Concepts to Remember
- Read the scenario before looking for a memorised answer.
- Find the constraint that changes the correct option.
- Eliminate answers that are true in general but not in this case.
TExam Day Tips
- Watch for words such as best, first, most likely and least administrative effort.
- Review why wrong options are wrong, not only why the correct option is correct.
Key takeaway
Answer the scenario, not the keyword: identify the specific constraint before choosing the most familiar-sounding option.
Real-world example
How this comes up in practice
A startup's cloud architect reviews their monthly bill and notices costs are higher than expected for a long-running batch job. Switching from on-demand instances to Reserved Instances — or using Spot/Preemptible VMs — can reduce compute costs by up to 72 %. Questions like this test whether you understand the tradeoffs between commitment, flexibility, and cost across cloud pricing models.
What to study next
Got this wrong? Here's your next step.
Identify which exam domain this question belongs to, review the core concept, then practise similar questions from the same domain.
- →
Designing highly scalable, available, and reliable cloud-native applications — study guide chapter
Learn the concepts, then practise the questions
- →
Designing highly scalable, available, and reliable cloud-native applications practice questions
Targeted practice on this topic area only
- →
All PCD questions
500 questions across all exam domains
- →
Google Professional Cloud Developer study guide
Full concept coverage aligned to exam objectives
- →
PCD practice test guide
How to use practice tests most effectively before exam day
Related practice questions
Related PCD practice-question pages
Use these pages to review the topic behind this question. This is how one missed question becomes focused revision.
Designing highly scalable, available, and reliable cloud-native applications practice questions
Practise PCD questions linked to Designing highly scalable, available, and reliable cloud-native applications.
Building and testing applications practice questions
Practise PCD questions linked to Building and testing applications.
Deploying applications practice questions
Practise PCD questions linked to Deploying applications.
Integrating Google Cloud services practice questions
Practise PCD questions linked to Integrating Google Cloud services.
Managing application performance monitoring practice questions
Practise PCD questions linked to Managing application performance monitoring.
PCD fundamentals practice questions
Practise PCD questions linked to PCD fundamentals.
PCD scenario practice questions
Practise PCD questions linked to PCD scenario.
PCD troubleshooting practice questions
Practise PCD questions linked to PCD troubleshooting.
Practice this exam
Start a free PCD practice session
Short sessions build daily habit. Longer sessions build exam-day stamina. Try a timed session to simulate real conditions.
FAQ
Questions learners often ask
What does this PCD question test?
Designing highly scalable, available, and reliable cloud-native applications — This question tests Designing highly scalable, available, and reliable cloud-native applications — Read the scenario before looking for a memorised answer..
What is the correct answer to this question?
The correct answer is: Enable container concurrency. — Option A is correct because enabling container concurrency allows a single Cloud Run container instance to handle multiple requests simultaneously, up to the configured concurrency limit (default 80, max 1000). This improves throughput and resource utilization during traffic spikes without requiring additional instances. Option D is correct because setting max instances to a high value ensures the service can scale out to handle sudden load by creating more container instances, up to the configured maximum, preventing cold starts and request queuing.
What should I do if I get this PCD question wrong?
Identify which exam domain this question belongs to, review the core concept, then practise similar questions from the same domain.
What is the key concept behind this question?
Read the scenario before looking for a memorised answer.
About these practice questions
Courseiva creates original exam-style practice questions with explanations and wrong-answer analysis. It does not publish real exam questions, exam dumps, or protected exam content. Learn why practice questions differ from exam dumps →
Last reviewed: Jun 25, 2026
This PCD practice question is part of Courseiva's free Google Cloud certification practice question bank. Courseiva provides original exam-style practice questions with explanations, topic-based practice, mock exams, readiness tracking, and study analytics to help learners prepare for the PCD exam.
Question Discussion
Share a tip, memory trick, or ask about the reasoning behind this question. Do not post real exam questions, leaked content, braindumps, or copyrighted exam material. Comments are moderated and may be removed without notice.
Sign in to join the discussion.