SAA-C03 Design High-Performing Architectures — All Questions With Answers

Question 1mediummulti select

Read the full Design High-Performing Architectures explanation →

A Lambda function behind API Gateway has predictable traffic spikes every hour. The function does not need access to resources in a VPC, and p95 latency spikes are caused by cold starts during scale-out. Which two actions are most effective? Select two.

Question 2mediummulti select

Read the full Design High-Performing Architectures explanation →

An Aurora PostgreSQL application has an OLTP writer and a reporting dashboard that issues many read-only queries. The writer is healthy, but read latency rises noticeably during reporting windows. Which two changes should you make? Select two.

Question 3mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A production application writes to an Amazon Aurora PostgreSQL cluster. Users report that during business-hour reporting runs, write latency increases. The application team wants to keep the writer focused on OLTP writes while still providing low-latency reads for reporting queries. What architectural approach should the solutions architect recommend?

Question 4mediummultiple choice

Read the full NAT/PAT explanation →

A DynamoDB table stores device status items. The partition key is deviceId, and the partition distribution is healthy (no single partition dominates). However, during peak periods the application experiences high read latency because many clients repeatedly request the latest status for the same devices. Which action best improves read latency without changing the DynamoDB partitioning model?

Question 5mediummulti select

Read the full NAT/PAT explanation →

A team is splitting a new workload into two fronts. The first front serves HTTPS microservices that need host- and path-based routing plus health checks. The second front must handle TCP and UDP traffic for a real-time service and preserve static IP addresses for firewall allowlisting. Which two AWS load balancer choices best match these requirements? Select two.

Question 6mediummultiple choice

Read the full Design High-Performing Architectures explanation →

An API team runs an AWS Lambda function behind an Application Load Balancer (ALB). During predictable hourly traffic spikes, p95 response latency increases due to occasional cold starts. The team wants stable latency during those spikes without permanently overprovisioning resources for all functions. Which configuration is the most appropriate way to reduce cold starts for this Lambda function?

Question 7mediummulti select

Read the full Design High-Performing Architectures explanation →

A distributed simulation launches 40 EC2 instances that exchange small packets frequently and are sensitive to cross-instance latency. The workload stays in one Availability Zone and can use the same instance family across nodes. Which two choices improve network performance the most? Select two.

Question 8mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A Lambda function behind an API needs consistent low latency. Traffic normally drops to near zero, then spikes several times per hour. During spikes, the p95 latency often spikes above 800 ms due to cold starts. The team wants to keep using Lambda (no containers) but minimize cold start impact during predictable spikes. What is the best AWS configuration to meet this goal?

Question 9mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A media processing service runs ECS tasks in multiple Availability Zones. Each task must read and write the same shared filesystem with low latency because tasks stream intermediate artifacts to other tasks. The team currently mounts an EBS volume per task, and cross-AZ tasks frequently cannot see each other’s files. Which option best resolves the shared filesystem requirement while supporting high-performing access?

Question 10easymultiple choice

Read the full Design High-Performing Architectures explanation →

An application uses DynamoDB to store order status. Reads happen extremely frequently for the same few keys (for example, the most recent orders), and the team wants lower read latency without changing the table’s partition key design. Which AWS service best fits this requirement?

Question 11mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A company serves the same public content to many users through Amazon CloudFront. The origin is experiencing increased fetches because CloudFront cache hit rate is dropping. Most requests include an Authorization header and a custom header that changes per user. The response content is identical regardless of these headers. What change should the solutions architect make to restore a high cache hit rate?

Question 12mediummultiple choice

Read the full Design High-Performing Architectures explanation →

Your team runs a tightly coupled distributed workload (for example, synchronous training nodes) across many EC2 instances placed within a single cluster environment. The instances need low-latency networking to reduce delays at synchronization barriers. Which EC2 placement strategy should you use to improve inter-node latency?

Question 13mediummulti select

Read the full Design High-Performing Architectures explanation →

An order lookup API repeatedly reads the same few items from DynamoDB. The application can tolerate slightly stale data for a few seconds, and the team wants the lowest-latency design with minimal application changes. Which two changes should they make? Select two.

Question 14easymultiple choice

Read the full Design High-Performing Architectures explanation →

A team needs to distribute TCP traffic (not HTTP) across multiple services. The services must see the original client source IP for auditing. Which AWS load balancer is the best fit?

Question 15easymultiple choice

Read the full Design High-Performing Architectures explanation →

Your team hosts versioned static assets (for example, /static/app-<buildHash>.js). Each build hash never changes, but you release new files on new URLs. To maximize cache hit rate and reduce origin load using CloudFront, what should you do when generating HTTP responses for these assets?

Question 16mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A research team runs a latency-sensitive distributed training job on Amazon EC2. They deploy 80 identical nodes that exchange small messages frequently and need low network jitter. The job must run entirely within one Availability Zone. Which placement group strategy should a solutions architect use to maximize intra-cluster network performance?

Question 17mediummultiple choice

Read the full NAT/PAT explanation →

A game streaming service must use UDP for real-time gameplay traffic. For external firewall allowlisting, the service requires stable, static IP addresses. The TLS handshake must be handled end-to-end by the application servers (the load balancer must not terminate TLS). Which AWS load balancing option best fits these requirements?

Question 18easymultiple choice

Read the full Design High-Performing Architectures explanation →

A company runs an Amazon RDS for PostgreSQL database. The application performs frequent OLTP writes, but it also has a separate dashboard that runs heavy SELECT queries and is slowing down overall database performance. The writes must remain on the primary. What is the best approach to improve performance for the dashboard?

Question 19mediummultiple choice

Read the full NAT/PAT explanation →

A startup runs an HTTP/2 API that also supports WebSocket connections. They need path-based routing to separate microservices (for example, /api/* to Service A and /metrics/* to Service B) and want TLS terminated at the load balancer. Which AWS option best meets these requirements while maintaining high request performance?

Question 20easymultiple choice

Read the full Design High-Performing Architectures explanation →

A backend API uses an AWS Lambda function behind API Gateway. The first requests after every weekly deployment experience cold starts, causing p95 latency spikes for a few minutes. Which configuration most directly prevents those cold starts for the published version?

Question 21mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A DynamoDB table uses this schema: partition key = customerId, sort key = timestamp. During a marketing campaign, one customer generates extremely high read traffic and the application sees ProvisionedThroughputExceeded errors even though the table’s total capacity is sufficient. What change most directly improves read distribution across partitions?

Question 22hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, what change should the team make to achieve the lowest possible network latency for the distributed workload?

Exhibit

Current deployment summary:
- 48 EC2 instances run a tightly coupled simulation engine.
- Instances are spread across us-east-1a and us-east-1b.
- Each worker exchanges small TCP messages every 5-10 ms with all other workers.
- Measured east-west RTT: 4.9 ms average, 17.2 ms p95.
- The application owner states that the workload can run in a single Availability Zone if that improves performance.
- No external clients access the cluster directly.

Question 23mediummulti select

Read the full Design High-Performing Architectures explanation →

A marketing site serves versioned JavaScript and CSS files from Amazon S3 through CloudFront. Origin bandwidth costs are rising because CloudFront keeps revalidating objects and fetching too much content from the bucket. Which two changes most directly improve cache hit ratio and reduce origin load? Select two.

Question 24mediummultiple choice

Read the full NAT/PAT explanation →

An Aurora PostgreSQL cluster is experiencing high read latency because 85% of traffic consists of read-only queries. The write workload must stay on the writer instance, and the team wants to offload reads without changing the application’s core query patterns. What is the best architectural option?

Question 25easymultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, which EBS volume type should the team use to meet the performance need at lower cost than overprovisioning capacity?

Exhibit

Database storage review:
- Current volume type: gp2
- Peak Read/Write IOPS observed: 9,700
- VolumeQueueLength increases during busy periods
- ReadLatency reaches 8-12 ms
- Requirement: provision about 10,000 IOPS without buying much extra capacity

Question 26easymultiple choice

Read the full Design High-Performing Architectures explanation →

A customer-facing application has a relational data model and needs frequent complex queries (joins and aggregations), but it also experiences a significant read-heavy workload. Which design choice best improves read performance while keeping relational features?

Question 27easymultiple choice

Read the full Design High-Performing Architectures explanation →

An ECS service runs on EC2 capacity. During peak traffic, tasks frequently wait for available container instances. The team wants faster scale-out for the underlying EC2 capacity when tasks increase. What is the best first architectural step?

Question 28easymultiple choice

Read the full Design High-Performing Architectures explanation →

A system uses multiple AWS Lambda functions behind different event sources. One Lambda occasionally spikes and causes other Lambdas to be throttled due to shared concurrency limits. Which setting best helps ensure the important Lambda keeps capacity during spikes?

Question 29easymultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, which Amazon EFS performance mode is the best fit for this workload?

Exhibit

EFS usage summary:
- 25 EC2 workers mounted to one file system
- Mostly small metadata reads and writes
- Each request needs very low file system latency
- No requirement for massive concurrent throughput across thousands of clients

Question 30easymultiple choice

Read the full Design High-Performing Architectures explanation →

Your application uses ElastiCache Redis as a cache for user profiles stored in DynamoDB. You must ensure that when a profile is updated, subsequent reads see the latest value quickly. Which cache strategy is generally the best fit for this requirement?

Question 31easymultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, the team wants to improve application performance without changing the code. Which EC2 instance family should they choose next?

Exhibit

CloudWatch summary for app servers:
- Average CPUUtilization: 24%
- Average MemoryUtilization: 91%
- Average NetworkIn/Out: low
- Current instance type: m6i.large
- User reports: application slows when more sessions are active

Question 32easymultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, what change best reduces Lambda cold-start impact for a predictable user-upload workflow?

Exhibit

CloudWatch metrics for Lambda function 'image-resize':
- Average Duration: 220 ms
- P95 Init Duration after idle: 1,400 ms
- ConcurrentExecutions: 15 average, 60 during campaign launches
- Throttles: 0
- User complaint: first upload after inactivity feels slow

Question 33easymultiple choice

Read the full Design High-Performing Architectures explanation →

A team runs a latency-sensitive service on EC2 and needs consistent, low-latency block storage for a database. The application requires predictable performance and should be fast for random reads/writes. Which EBS volume type is the best choice?

Question 34easymultiple choice

Read the full Design High-Performing Architectures explanation →

A new feature stores user events in DynamoDB. Each event must be fetched by user_id and sorted by event_time. The team expects many different users and wants to avoid a single hot partition. Which partition key design is best?

Question 35easymultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, which AWS feature should the team use to minimize network latency between EC2 instances that exchange messages very frequently?

Exhibit

Application topology:
- 12 EC2 instances in one Region
- Instances process small jobs and send frequent messages to each other
- Observed inter-node latency: 2.8 ms to 4.1 ms
- Requirement: lowest possible latency between application nodes

Question 36easymultiple choice

Read the full Design High-Performing Architectures explanation →

A web service runs on an Auto Scaling group (ASG). The team updates configuration (AMIs, environment variables) in a Launch Template and wants new instances created during scale-out to use the latest Launch Template version. What should the architect do?

Question 37easymultiple choice

Read the full Design High-Performing Architectures explanation →

A team runs a stateless web app on Amazon EC2 behind an Application Load Balancer. During traffic spikes, new EC2 instances take several minutes to finish bootstrapping before they can receive traffic. Which Auto Scaling configuration most directly reduces the time until additional capacity is available?

Question 38hardmatching

Read the full NAT/PAT explanation →

A company runs a stateless application tier behind an Application Load Balancer. Match each observed scaling pattern on the left to the best Auto Scaling strategy or metric on the right.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts

Matches

Scale the Auto Scaling group on ALB RequestCountPerTarget.

Scale on SQS queue depth using a custom CloudWatch metric.

Use scheduled scaling to add capacity before the recurring surge.

Use target tracking on EC2 CPUUtilization.

Question 39easymultiple choice

Read the full NAT/PAT explanation →

A compute workload uses temporary scratch space for intermediate results (reproducible), and it can tolerate data loss if the instance is terminated. The workload benefits from very high local I/O throughput. Which storage option is the best fit for the scratch data?

Question 40hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, which change best reduces latency during peak traffic without overprovisioning the fleet?

Exhibit

ALB and ASG snapshot (15-minute peak):
- RequestCountPerTarget: 1,920
- TargetResponseTime p95: 2.9 seconds
- HTTPCode_Target_5XX_Count: 0
EC2 application metrics from CloudWatch agent:
- CPUUtilization: 33%
- MemoryUtilization: 46%
- NetworkIn/Out: steady
Application logs:
[WARN] worker queue depth reached 5,000
[INFO] rejecting requests after thread pool saturation
Current Auto Scaling policy:
- Target tracking on CPUUtilization = 55%

Question 41easymultiple choice

Read the full NAT/PAT explanation →

A retail analytics app uses Amazon RDS for PostgreSQL. Read traffic is growing, and the database CPU spikes mainly due to SELECT-heavy workloads. Writes are less frequent, and the app can tolerate eventually consistent reads for the reports. What is the most appropriate AWS-native way to improve read performance with minimal application changes?

Question 42easymultiple choice

Read the full Design High-Performing Architectures explanation →

A media company uses CloudFront in front of an S3 bucket origin for video thumbnails. They want to prevent users from bypassing CloudFront and accessing the S3 bucket directly, while still allowing CloudFront to fetch objects. What is the best option?

Question 43hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, which change will most improve the CloudFront cache hit ratio for the static assets while still serving the same files to all users?

Exhibit

CloudFront behavior summary for path pattern /static/*:
- Allowed methods: GET, HEAD
- Cache policy: forwards all query strings
- Origin request policy: forwards all cookies and the Authorization header
- Average cache hit ratio: 11%
Sample request log lines:
GET /static/app.js?v=18&userId=123 Cookie: session=abcd
GET /static/app.js?v=18&userId=987 Cookie: session=xyzt
GET /static/logo.svg?v=18&locale=en Cookie: session=mnop
Origin responses:
- All objects are identical for every viewer
- Objects are versioned only by the v query parameter

Question 44hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, what is the best change to improve read performance without increasing write latency on the primary database?

Exhibit

Amazon RDS for PostgreSQL metrics during the end-of-day report window:
- CPUUtilization: 24%
- ReadLatency: 118 ms
- WriteLatency: 7 ms
- DiskQueueDepth: 0.4
- FreeStorageSpace: stable
Application notes:
- Report queries are read-only and run for 20 to 30 minutes
- The operational API continues to perform writes during the report window
- Business accepts slightly stale report data if write performance stays unchanged

Question 45hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, which storage design best supports the application servers' shared working directory requirement?

Exhibit

Deployment notes for a media-processing Auto Scaling group:
- 6 EC2 instances across 2 Availability Zones
- Each node compiles project artifacts and writes them to /workspace/output
- Other nodes must immediately see the same files for the next pipeline stage
- Files must persist when an instance is replaced or scaled in/out
- Logs show failures such as:
  [ERROR] missing artifact: /workspace/output/frame_2048.png
  [WARN] local copy not found after instance termination

Question 46hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, which design change is the best way to reduce the observed read latency for this DynamoDB-backed service?

Exhibit

DynamoDB metrics and access pattern:
- Table mode: on-demand
- ConsumedReadCapacityUnits: steady, no throttling overall
- SuccessfulRequestLatency: p95 = 34 ms
- Hot partition key detected: tenant#42 consumes 92% of read traffic during peak
Application notes:
- Requests repeatedly fetch the same dashboard items for up to 60 seconds
- Reads are eventually consistent and the application can tolerate brief cache staleness
- Writes are infrequent and do not dominate the workload

Question 47hardmatching

Read the full Design High-Performing Architectures explanation →

A media platform serves global users through Amazon CloudFront and an S3 origin. Match each requirement on the left to the CloudFront configuration or behavior on the right.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts

Matches

Use CloudFront Origin Access Control and allow only the distribution in the bucket policy.

Use versioned object filenames or hashed asset names with a long TTL.

Exclude the tracking query string from the cache key with a cache policy.

Use CloudFront signed URLs or signed cookies.

Question 48hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, which storage choice best matches the workload requirements?

Exhibit

fio benchmark on the selected EC2 family:
- Device: /dev/nvme1n1
- 4 KiB random read IOPS: 710,000
- Average latency: 0.18 ms
- Sequential throughput: 2.8 GiB/s
Workload notes:
- Workers download source video files from S3
- They generate temporary frame extracts and intermediate artifacts locally
- Final MP4 outputs are uploaded to S3 immediately after processing
- If an instance terminates, the job is retried from the original source file

Question 49easymultiple choice

Read the full Design High-Performing Architectures explanation →

A company runs a stateless web API on Amazon EC2 behind an Application Load Balancer. The team notices that during business hours, the ALB starts queueing requests and the average request latency rises. They want to scale out quickly and reliably based on demand, not CPU alone. Which Auto Scaling approach best matches this requirement?

Question 50easymultiple choice

Read the full Design High-Performing Architectures explanation →

A company serves mostly static images and JavaScript files from an origin in one AWS Region. They want to reduce origin load and improve global performance. Which change most directly increases cache-hit ratio for static assets while avoiding stale content?

Question 51hardmatching

Read the full NAT/PAT explanation →

A company runs a stateless application tier behind an Application Load Balancer. Match each observed scaling pattern on the left to the best Auto Scaling strategy or metric on the right.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts

Matches

Scale the Auto Scaling group on ALB RequestCountPerTarget.

Scale on SQS queue depth using a custom CloudWatch metric.

Use scheduled scaling to add capacity before the recurring surge.

Use target tracking on EC2 CPUUtilization.

Question 52easymultiple choice

Read the full Design High-Performing Architectures explanation →

A company runs a stateless web API on Amazon EC2 behind an Application Load Balancer. The team notices that during business hours, the ALB starts queueing requests and the average request latency rises. They want to scale out quickly and reliably based on demand, not CPU alone. Which Auto Scaling approach best matches this requirement?

Question 53hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, which change best reduces latency during peak traffic without overprovisioning the fleet?

Exhibit

ALB and ASG snapshot (15-minute peak):
- RequestCountPerTarget: 1,920
- TargetResponseTime p95: 2.9 seconds
- HTTPCode_Target_5XX_Count: 0
EC2 application metrics from CloudWatch agent:
- CPUUtilization: 33%
- MemoryUtilization: 46%
- NetworkIn/Out: steady
Application logs:
[WARN] worker queue depth reached 5,000
[INFO] rejecting requests after thread pool saturation
Current Auto Scaling policy:
- Target tracking on CPUUtilization = 55%

Question 54hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, which storage choice best matches the workload requirements?

Exhibit

fio benchmark on the selected EC2 family:
- Device: /dev/nvme1n1
- 4 KiB random read IOPS: 710,000
- Average latency: 0.18 ms
- Sequential throughput: 2.8 GiB/s
Workload notes:
- Workers download source video files from S3
- They generate temporary frame extracts and intermediate artifacts locally
- Final MP4 outputs are uploaded to S3 immediately after processing
- If an instance terminates, the job is retried from the original source file

Question 55hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, which design change is the best way to reduce the observed read latency for this DynamoDB-backed service?

Exhibit

DynamoDB metrics and access pattern:
- Table mode: on-demand
- ConsumedReadCapacityUnits: steady, no throttling overall
- SuccessfulRequestLatency: p95 = 34 ms
- Hot partition key detected: tenant#42 consumes 92% of read traffic during peak
Application notes:
- Requests repeatedly fetch the same dashboard items for up to 60 seconds
- Reads are eventually consistent and the application can tolerate brief cache staleness
- Writes are infrequent and do not dominate the workload

Question 56hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, which change will most improve the CloudFront cache hit ratio for the static assets while still serving the same files to all users?

Exhibit

CloudFront behavior summary for path pattern /static/*:
- Allowed methods: GET, HEAD
- Cache policy: forwards all query strings
- Origin request policy: forwards all cookies and the Authorization header
- Average cache hit ratio: 11%
Sample request log lines:
GET /static/app.js?v=18&userId=123 Cookie: session=abcd
GET /static/app.js?v=18&userId=987 Cookie: session=xyzt
GET /static/logo.svg?v=18&locale=en Cookie: session=mnop
Origin responses:
- All objects are identical for every viewer
- Objects are versioned only by the v query parameter

Question 57hardmatching

Read the full Design High-Performing Architectures explanation →

A media platform serves global users through Amazon CloudFront and an S3 origin. Match each requirement on the left to the CloudFront configuration or behavior on the right.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts

Matches

Use CloudFront Origin Access Control and allow only the distribution in the bucket policy.

Use versioned object filenames or hashed asset names with a long TTL.

Exclude the tracking query string from the cache key with a cache policy.

Use CloudFront signed URLs or signed cookies.

Question 58easymultiple choice

Read the full NAT/PAT explanation →

A retail analytics app uses Amazon RDS for PostgreSQL. Read traffic is growing, and the database CPU spikes mainly due to SELECT-heavy workloads. Writes are less frequent, and the app can tolerate eventually consistent reads for the reports. What is the most appropriate AWS-native way to improve read performance with minimal application changes?

Question 59hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, what is the best change to improve read performance without increasing write latency on the primary database?

Exhibit

Amazon RDS for PostgreSQL metrics during the end-of-day report window:
- CPUUtilization: 24%
- ReadLatency: 118 ms
- WriteLatency: 7 ms
- DiskQueueDepth: 0.4
- FreeStorageSpace: stable
Application notes:
- Report queries are read-only and run for 20 to 30 minutes
- The operational API continues to perform writes during the report window
- Business accepts slightly stale report data if write performance stays unchanged

Question 60easymultiple choice

Read the full Design High-Performing Architectures explanation →

A company serves mostly static images and JavaScript files from an origin in one AWS Region. They want to reduce origin load and improve global performance. Which change most directly increases cache-hit ratio for static assets while avoiding stale content?

Question 61easymultiple choice

Read the full Design High-Performing Architectures explanation →

A team runs a stateless web app on Amazon EC2 behind an Application Load Balancer. During traffic spikes, new EC2 instances take several minutes to finish bootstrapping before they can receive traffic. Which Auto Scaling configuration most directly reduces the time until additional capacity is available?

Question 62hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, which storage design best supports the application servers' shared working directory requirement?

Exhibit

Deployment notes for a media-processing Auto Scaling group:
- 6 EC2 instances across 2 Availability Zones
- Each node compiles project artifacts and writes them to /workspace/output
- Other nodes must immediately see the same files for the next pipeline stage
- Files must persist when an instance is replaced or scaled in/out
- Logs show failures such as:
  [ERROR] missing artifact: /workspace/output/frame_2048.png
  [WARN] local copy not found after instance termination

Question 63easymultiple choice

Read the full NAT/PAT explanation →

A compute workload uses temporary scratch space for intermediate results (reproducible), and it can tolerate data loss if the instance is terminated. The workload benefits from very high local I/O throughput. Which storage option is the best fit for the scratch data?

Question 64easymultiple choice

Read the full Design High-Performing Architectures explanation →

A media company uses CloudFront in front of an S3 bucket origin for video thumbnails. They want to prevent users from bypassing CloudFront and accessing the S3 bucket directly, while still allowing CloudFront to fetch objects. What is the best option?

Question 65easymultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, what change best reduces Lambda cold-start impact for a predictable user-upload workflow?

Exhibit

CloudWatch metrics for Lambda function 'image-resize':
- Average Duration: 220 ms
- P95 Init Duration after idle: 1,400 ms
- ConcurrentExecutions: 15 average, 60 during campaign launches
- Throttles: 0
- User complaint: first upload after inactivity feels slow

Question 66easymultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, the team wants to improve application performance without changing the code. Which EC2 instance family should they choose next?

Exhibit

CloudWatch summary for app servers:
- Average CPUUtilization: 24%
- Average MemoryUtilization: 91%
- Average NetworkIn/Out: low
- Current instance type: m6i.large
- User reports: application slows when more sessions are active

Question 67easymultiple choice

Read the full Design High-Performing Architectures explanation →

A system uses multiple AWS Lambda functions behind different event sources. One Lambda occasionally spikes and causes other Lambdas to be throttled due to shared concurrency limits. Which setting best helps ensure the important Lambda keeps capacity during spikes?

Question 68easymultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, which Amazon EFS performance mode is the best fit for this workload?

Exhibit

EFS usage summary:
- 25 EC2 workers mounted to one file system
- Mostly small metadata reads and writes
- Each request needs very low file system latency
- No requirement for massive concurrent throughput across thousands of clients

Question 69easymultiple choice

Read the full Design High-Performing Architectures explanation →

A web service runs on an Auto Scaling group (ASG). The team updates configuration (AMIs, environment variables) in a Launch Template and wants new instances created during scale-out to use the latest Launch Template version. What should the architect do?

Question 70easymultiple choice

Read the full Design High-Performing Architectures explanation →

Your application uses ElastiCache Redis as a cache for user profiles stored in DynamoDB. You must ensure that when a profile is updated, subsequent reads see the latest value quickly. Which cache strategy is generally the best fit for this requirement?

Question 71easymultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, which AWS feature should the team use to minimize network latency between EC2 instances that exchange messages very frequently?

Exhibit

Application topology:
- 12 EC2 instances in one Region
- Instances process small jobs and send frequent messages to each other
- Observed inter-node latency: 2.8 ms to 4.1 ms
- Requirement: lowest possible latency between application nodes

Question 72easymultiple choice

Read the full Design High-Performing Architectures explanation →

A team runs a latency-sensitive service on EC2 and needs consistent, low-latency block storage for a database. The application requires predictable performance and should be fast for random reads/writes. Which EBS volume type is the best choice?

Question 73easymultiple choice

Read the full Design High-Performing Architectures explanation →

A customer-facing application has a relational data model and needs frequent complex queries (joins and aggregations), but it also experiences a significant read-heavy workload. Which design choice best improves read performance while keeping relational features?

Question 74easymultiple choice

Read the full Design High-Performing Architectures explanation →

A new feature stores user events in DynamoDB. Each event must be fetched by user_id and sorted by event_time. The team expects many different users and wants to avoid a single hot partition. Which partition key design is best?

Question 75easymultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, which EBS volume type should the team use to meet the performance need at lower cost than overprovisioning capacity?

Exhibit

Database storage review:
- Current volume type: gp2
- Peak Read/Write IOPS observed: 9,700
- VolumeQueueLength increases during busy periods
- ReadLatency reaches 8-12 ms
- Requirement: provision about 10,000 IOPS without buying much extra capacity

Question 76easymultiple choice

Read the full Design High-Performing Architectures explanation →

An ECS service runs on EC2 capacity. During peak traffic, tasks frequently wait for available container instances. The team wants faster scale-out for the underlying EC2 capacity when tasks increase. What is the best first architectural step?

Question 77mediummultiple choice

Review the full routing breakdown →

A web application uses an Amazon Aurora DB cluster for a read-heavy workload. The application team needs higher read throughput but cannot change the database schema. They want to avoid blocking writes and are willing to route read traffic separately. What is the most appropriate architecture change?

Question 78mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A team serves image files from S3 through CloudFront. During a performance review, they notice that CloudFront cache hit ratio is low and the S3 origin receives many repeated requests for the same images. Request URLs include a volatile query parameter called 'sessionId' that changes for each user, but the image content is identical regardless of 'sessionId'. What configuration change will most effectively increase cache hit ratio?

Question 79mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A trading analytics system deploys 10 EC2 instances that exchange very frequent, low-latency messages over the network. The instances must be placed as close together as possible to minimize network hop count and inter-node jitter. Which deployment choice best matches this requirement?

Question 80mediummultiple choice

Read the full network assurance explanation →

Your company currently uses an Application Load Balancer (ALB) in front of a service that receives a large number of TCP and UDP packets (including UDP-based telemetry). During load tests, you need to support both TCP and UDP traffic at high throughput while keeping stable IP endpoints for a downstream firewall allowlist. Which change best meets these requirements?

Question 81mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A DynamoDB-backed multi-tenant app experiences throttling. Most write traffic for tenant 'ACME' targets a single logical stream of events (you write items for ACME in near-real time). The table currently uses partition key = tenantId and sort key = eventTimestamp. CloudWatch shows partition-level throttling concentrated in the ACME partition. What design change most directly improves write throughput for the hottest tenant while still enabling efficient queries for recent events for that tenant?

Question 82mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A serverless API built with AWS Lambda serves latency-sensitive requests. The team observes intermittent slow responses during traffic ramp-ups and expects some users to hit the API immediately after a period of inactivity. Which configuration best reduces cold-start latency during these ramp-ups?

Question 83mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A media processing pipeline uses EBS-backed storage for an application that performs sustained random I/O with low latency requirements. During peak processing windows, the team sees increased read latency and occasional timeouts at the application layer. They need predictable, high IOPS performance rather than best-effort throughput. Which EBS configuration choice is most appropriate?

Question 84mediummultiple choice

Read the full NAT/PAT explanation →

Your mobile app writes events to a single DynamoDB table with partition key = customerId and sort key = eventTime. During a promotional campaign, one tenant ("ACME") generates far more traffic than others. CloudWatch shows sustained throttling (ProvisionedThroughputExceeded) and elevated p99 latency only for that tenant. The workload pattern cannot be changed to a completely different schema, but you can change how items are partitioned. Which design change is most likely to reduce the hot-partition throttling while keeping efficient reads for ACME?

Question 85mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A marketing team uses CloudFront with an S3 origin to serve a single-page web app. After a release, CloudFront cache hit ratio dropped sharply. The app requests the same static JS and CSS assets, but each request includes a unique tracking query parameter (for example, ?utm_source=campaign123, campaign456, etc.). You want CloudFront to cache those assets efficiently even when the tracking query parameter changes. What should you do?

Question 86mediummultiple choice

Read the full Design High-Performing Architectures explanation →

You run a web application on an EC2 Auto Scaling group behind an Application Load Balancer (ALB). During scheduled traffic spikes, new instances launch but customers occasionally see 5xx errors for the first few minutes after scale-out. Operational logs show instances need ~4 minutes to warm up (load caches and initialize dependencies). ALB target health becomes healthy only after this warm-up. Which change most directly improves performance during spikes by reducing the time to serve traffic after scaling?

Question 87easymultiple choice

Read the full Design High-Performing Architectures explanation →

A data processing application runs on a single EC2 instance and needs persistent block storage with sustained low-latency random read/write performance (high IOPS). Which storage choice is most appropriate?

Question 88easymultiple choice

Read the full Design High-Performing Architectures explanation →

A trading analytics system deploys multiple EC2 instances that exchange very frequent, low-latency, east-west messages. The application team wants the instances to be placed to minimize network latency and variability. Which AWS feature should they use?

Question 89easymultiple choice

Read the full Design High-Performing Architectures explanation →

Your team serves static JavaScript and CSS files from an S3 origin through CloudFront. After a release, the CloudFront cache hit ratio dropped because clients keep re-downloading the same assets. What is the best next change to improve caching performance?

Question 90easymultiple choice

Read the full Design High-Performing Architectures explanation →

A DynamoDB-backed multi-tenant app experiences throttling during a promotion. Most writes and reads target tenant "ACME" and use the same partition key value, causing a hot partition. Which design change most directly improves performance?

Question 91easymultiple choice

Read the full Design High-Performing Architectures explanation →

A service performs many repeated read requests for the same DynamoDB items. The reads are latency-sensitive, but the application can tolerate slightly stale data. Which AWS service is the best fit to reduce read latency?

Question 92easymultiple choice

Read the full Design High-Performing Architectures explanation →

An application uses an Amazon Aurora cluster. The workload becomes read-heavy, but the team cannot change the database schema. They need higher read throughput while keeping writes on the primary. What should they do?

Question 93easymultiple choice

Read the full Design High-Performing Architectures explanation →

A latency-sensitive API is implemented with AWS Lambda. During traffic ramp-ups, users sometimes experience slow responses due to cold starts. The team wants to ensure fast initialization for a baseline level of concurrent requests. Which AWS feature should they use?

Question 94mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A distributed system needs extremely low network latency between a set of EC2 instances running the same workload. The team wants the instances to be placed as close together as AWS allows to reduce round-trip time. Which placement strategy should the architect use?

Question 95mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A web API runs on an Auto Scaling group (ASG) behind an Application Load Balancer (ALB). During traffic spikes, users experience request timeouts even though CPU stays below 40%. After investigation, you find the ASG often has too few healthy targets to handle the current request rate. Which change will best improve responsiveness during spikes?

Question 96mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A DynamoDB-backed event processing system experiences throttling during a promotion. All events are written and read using the same partition key value (tenantId = "ACME"). The workload is time-ordered per tenant, and the application can tolerate slight reordering across partitions. Which design change will most directly increase throughput and reduce hot-partition throttling?

Question 97mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A team serves static assets from an S3 origin through CloudFront. Cache hit ratio is low. Analytics show that requests include an Authorization header (even though the assets are public) and the cache key currently varies on that header, causing CloudFront to treat the same asset as different cache entries. What is the best change to improve cache hit ratio without breaking access controls?

Question 98mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A containerized service fleet running on EC2 instances needs to share user-uploaded files and access them with low latency. The workload is bursty: sometimes dozens of instances concurrently read the same directory for short periods, and then traffic drops. Which Amazon EFS configuration best matches these performance needs?

Question 99mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A media platform runs a CPU-heavy thumbnail generation workload on an EC2 Auto Scaling group using t3.large instances. During peak traffic, p95 processing time increases significantly even though average CPU remains around 40–50%. CloudWatch also shows CPU credit depletion behavior. Which change will most directly improve performance predictability for this workload?

Question 100mediummultiple choice

Read the full Design High-Performing Architectures explanation →

Your company needs a high-throughput, low-latency TCP service using a custom binary protocol. Requirements: preserve the original client source IP for rate limiting, keep latency minimal, and use TCP health checks. The current setup uses an Application Load Balancer and performance is inconsistent. Which load balancer choice best meets these requirements?

Question 101mediummultiple choice

Read the full NAT/PAT explanation →

A site serves static assets (JS/CSS) through CloudFront from an S3 origin. After a recent frontend change, CloudFront shows a cache hit ratio below 20%. In CloudFront access logs, requests to the same asset URL path differ by a query parameter named rnd (a random value appended by the app on every request). The origin content is identical regardless of rnd. What is the best CloudFront configuration change to restore effective caching?

Question 102mediummultiple choice

Read the full NAT/PAT explanation →

An event ingestion service writes to a DynamoDB table where the partition key is tenantId and the sort key is eventTime. During a campaign, one tenant generates a disproportionate share of traffic, causing write throttling and increased latency for that tenant’s writes. You can change the data model and application queries, but you must still efficiently retrieve events for a tenant for the last 10 minutes. Which change best improves write throughput by reducing hot partitions?

Question 103hardmulti select

Read the full Design High-Performing Architectures explanation →

A media company serves versioned JavaScript and CSS from an S3 origin through CloudFront. After a release, the cache hit ratio drops because the SPA sends an Authorization header and several tracking query strings on every request, even though the assets are public and identical for all users. Which changes would most improve cache efficiency without changing the content returned? Select three.

Question 104hardmulti select

Read the full network assurance explanation →

An event-ingestion application writes telemetry to DynamoDB with partition key tenantId and sort key eventTime. During a promotion, one tenant generates 10 times the normal traffic. Dashboards repeatedly query the most recent items for that tenant, and they can tolerate slightly stale data. Which changes would most effectively reduce throttling and improve responsiveness? Select three.

Question 105hardmulti select

Read the full Design High-Performing Architectures explanation →

A rendering service runs on a single EC2 instance and writes a large working set of metadata to disk using sustained random reads and writes. The data must persist across stops and restarts, and the team sees queue depth spikes when the job reaches peak throughput. Which changes should the team make? Select three.

Question 106hardmulti select

Read the full Design High-Performing Architectures explanation →

A static website stores assets in S3 and is delivered through CloudFront. Analytics show low cache hit ratio, many origin fetches for the same JavaScript bundles, and elevated S3 GET request costs. Most requests include unnecessary cookies, and the text assets are uncompressed. Which changes should the team make? Select three.

Question 107hardmulti select

Read the full network assurance explanation →

A latency-sensitive telemetry service uses a custom TCP protocol on EC2 instances in private subnets. The service must preserve the client source IP for rate limiting, avoid HTTP header inspection, and keep per-request overhead as low as possible. Which changes should the team make? Select three.

Question 108hardmulti select

Read the full Design High-Performing Architectures explanation →

A customer portal uses Amazon Aurora MySQL. The application currently sends all SELECT queries to the writer instance endpoint. During traffic spikes, read latency increases, and the team wants the cluster to survive a writer failover without manual endpoint changes for the application. Which changes should the team make? Select three.

Question 109hardmulti select

Read the full Design High-Performing Architectures explanation →

A serverless checkout API runs on AWS Lambda behind API Gateway. Traffic spikes are predictable every weekday at 09:00 UTC, and p95 latency jumps for the first few minutes after each deployment because execution environments are cold. The team wants to reduce this startup impact without changing the API contract. Which changes should they make? Select three.

Question 110hardmulti select

Read the full Design High-Performing Architectures explanation →

A low-latency market-data engine runs 10 EC2 instances that exchange small messages thousands of times per second. The team wants the lowest possible network latency and jitter, and they can tolerate single-AZ placement for this tier because another layer handles disaster recovery. Which changes should they make? Select three.

Question 111easymultiple choice

Read the full Design High-Performing Architectures explanation →

Multiple EC2 instances need a shared filesystem so they can concurrently read and write the same files (for example, user uploads and rendered assets). The instances are in different Availability Zones and must mount the filesystem using NFS. Which AWS storage service best fits?

Question 112easymultiple choice

Read the full Design High-Performing Architectures explanation →

A latency-sensitive trading workload runs on 6 EC2 instances. You must distribute the instances so they do NOT share the same underlying hardware rack, reducing the risk of correlated rack-level faults. Which EC2 placement group strategy best meets this requirement?

Question 113easymultiple choice

Read the full NAT/PAT explanation →

A team wants to run containerized services with AWS-managed orchestration and autoscaling. They do NOT require Kubernetes compatibility. Which AWS service choice is most appropriate to meet these goals?

Question 114easymultiple choice

Read the full Design High-Performing Architectures explanation →

Your web application runs on EC2 instances behind an Application Load Balancer (ALB). During traffic spikes, p95 response time increases, but average CPU utilization remains below 40%. The current Auto Scaling policy scales based on average CPU%. What should you change to improve performance during spikes?

Question 115easymultiple choice

Read the full Design High-Performing Architectures explanation →

A company serves public JavaScript and CSS files from S3 using CloudFront. After a frontend change, customers report a low CloudFront cache hit ratio. Requests now include an Authorization header, but these assets do not require authentication. The CloudFront distribution is configured such that Authorization is included in the cache key. Which change best maximizes cache reuse?

Question 116easymultiple choice

Read the full Design High-Performing Architectures explanation →

An application repeatedly reads the same DynamoDB items with very low latency requirements. The application can tolerate slightly stale data (for example, within a few seconds). You want to improve read latency without changing the existing DynamoDB table schema. Which service is the best choice?

Question 117easymultiple choice

Review the full routing breakdown →

A web application uses an Amazon Aurora DB cluster. The workload is becoming read-heavy, and the application team wants to increase read throughput without changing the database schema. They can adjust the application to route reads differently. What should they do?

Question 118mediummulti select

Read the full Design High-Performing Architectures explanation →

A multi-tenant event system writes and reads data in DynamoDB. One tenant generates most of the traffic, causing throttling on a single partition key value, and the dashboards repeatedly read the most recent items for that tenant. Which two changes should the team make to improve performance? Select two.

Question 119mediummulti select

Read the full Design High-Performing Architectures explanation →

A distributed analytics platform runs on 12 EC2 instances in one Availability Zone. The nodes exchange a very high volume of east-west messages and the team wants the lowest possible network latency between instances. Which two changes should the architect make first? Select two.

Question 120mediummulti select

Read the full Design High-Performing Architectures explanation →

A marketing site serves versioned JavaScript and CSS from an Amazon S3 origin through Amazon CloudFront. After each release, the cache hit ratio drops sharply because clients keep sending request headers and query strings that are not needed for asset retrieval. Which two changes should improve cache efficiency the most? Select two.

Question 121mediummulti select

Read the full NAT/PAT explanation →

A web application uses an Amazon Aurora DB cluster for a read-heavy workload. The team wants to increase read throughput without changing the database schema or rewriting application data access patterns. Which two changes should they make? Select two.

Question 122mediummulti select

Read the full NAT/PAT explanation →

A CPU-bound batch rendering service runs on EC2. The application is Linux-based, compatible with ARM64, and the team wants the best throughput per dollar without changing the workload's architecture. Which two instance-family choices should the team consider first? Select two.

Question 123mediummulti select

Read the full Design High-Performing Architectures explanation →

A single EC2 instance hosts a low-latency database cache that writes a large random working set to block storage. The application needs sustained high IOPS and low latency, and the storage must remain attached to the instance while it runs. Which two design choices best meet the requirement? Select two.

Question 124hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, a media company serves versioned JavaScript and CSS files from an Amazon S3 origin through CloudFront. After a frontend release, the cache hit ratio dropped sharply even though the file names are versioned. The application team says the browser requests include the same Authorization header on every asset request because the frontend and API share one domain. What should the solutions architect do to improve CloudFront cache hit ratio without changing the application authentication model for the API?

Exhibit

CloudFront access log sample:
2026-04-18T09:12:41Z LAX1 1234 Miss GET d111111abcdef8.cloudfront.net /app/v42/main.8f3d2.js 200 - Mozilla/5.0 Authorization=Bearer eyJhbGciOi...
2026-04-18T09:12:42Z LAX1 1235 Miss GET d111111abcdef8.cloudfront.net /app/v42/vendor.9c1a0.css 200 - Mozilla/5.0 Authorization=Bearer eyJhbGciOi...

Distribution behavior summary:
- Origin: S3 bucket
- Cache policy: legacy default
- Origin request policy: forwards all headers, cookies, and query strings
- Objects are immutable after release and have content-hash file names

Question 125hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, a retail analytics service repeatedly reads the same DynamoDB items during an active campaign. The business can tolerate data that is a few seconds stale, but the application must minimize latency and reduce pressure on DynamoDB. A load test shows that 80% of reads target only 200 item keys. What should the solutions architect implement?

Exhibit

Load-test observations:
- DynamoDB table type: on-demand
- Primary access pattern: GetItem for 200 hot keys
- p95 latency without cache: 17-24 ms
- p95 latency under burst: 31 ms and rising
- Sample application note: "A few seconds of staleness is acceptable for dashboards and recommendations"
- CloudWatch: ConsumedReadCapacityUnits spikes during refresh cycles

Question 126hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, an application runs on Amazon Aurora MySQL. The writer instance is frequently near 85% CPU while the reader instance is under 20% CPU. Application traces show that most of the database traffic is read-only SELECT queries, but the code currently sends all queries to the writer endpoint. What should the solutions architect recommend to improve performance with the smallest functional change?

Exhibit

Aurora cluster summary:
- 1 writer instance: db.r6g.large
- 2 reader instances: db.r6g.large
- Writer CPU avg: 82% / p95 91%
- Reader CPU avg: 18% / p95 26%
- Database connections: 480 total, all established to the cluster writer endpoint
- Query sample: 72% SELECT, 22% INSERT/UPDATE, 6% administrative queries

Question 127hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, a serverless checkout API is implemented in AWS Lambda and deployed in one Region. The function has a cold-start time of 700-900 ms on the first request after idle periods. Marketing launches a predictable traffic spike every weekday at 09:00 UTC, and the p95 latency target is under 150 ms during the first five minutes of the spike. What should the solutions architect do to meet the latency target while controlling cost?

Exhibit

Lambda logs:
REPORT RequestId: 9d6b... Duration: 184.27 ms Billed Duration: 185 ms Memory Size: 1024 MB Max Memory Used: 612 MB Init Duration: 812.43 ms

Traffic pattern:
- Low traffic outside weekdays 09:00-09:15 UTC
- Predictable spike every weekday
- Function language: Python 3.12
- No need to keep spare capacity all day

Question 128hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, a trading platform exposes a custom binary TCP protocol to partner systems. The service must preserve the original client source IP for rate limiting, support TLS pass-through to the application, and minimize network latency. The team also wants a simple architecture that can scale across multiple Availability Zones. What load balancing option should the solutions architect choose?

Exhibit

Protocol and traffic notes:
- Transport: TCP over port 9000
- Payload: custom binary messages, not HTTP
- Requirement: preserve source IP address at the target
- Requirement: minimize latency and jitter
- Targets: EC2 instances in private subnets across 3 AZs
- Current proxy layer adds ~12 ms overhead and breaks client IP logging

Question 129hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, a media rendering job runs on a single EC2 instance and writes a large working set of metadata to block storage. The workload performs sustained random reads and writes and must keep latency consistently low for the entire run. The instance may be stopped and started between jobs, and the data must persist. Which storage choice best meets the requirements?

Exhibit

fio benchmark from the current volume:
- 4 KiB random read IOPS target: 22,000
- 4 KiB random write IOPS target: 18,000
- 99th percentile latency target: < 2 ms
- Current volume: gp3, 12,000 provisioned IOPS
- Observed latency during peak: 3.8-5.4 ms
- Data must remain attached to one EC2 instance and persist after stop/start

Question 130hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, a low-latency analytics platform runs 10 EC2 instances in the same Availability Zone. The nodes exchange a very high volume of east-west messages and must experience the lowest possible network latency and jitter. A separate operations team also wants to reduce the risk that all nodes land on the same physical hardware rack. Which placement strategy should the solutions architect use?

Exhibit

Network test results:
- Average node-to-node RTT: 180-240 microseconds
- Jitter spikes during busy periods: up to 4 ms
- Workload type: cluster-style analytics with frequent small messages
- Requirement: lowest possible latency among peers
- Deployment note: all 10 instances are currently in separate subnets within one AZ

Question 131hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, a batch-processing service runs on Amazon EC2. The workload is Linux-based, can run on ARM64, and is CPU-bound during its nightly processing window. The team wants the best throughput per dollar without changing the application logic. Which EC2 instance family should the solutions architect recommend?

Exhibit

Benchmark summary from current fleet:
- Current instances: c6i.2xlarge
- Average CPU during processing: 88%-96%
- Disk and network utilization remain below 30%
- Application runtime on test ARM build: 11% faster than x86 build
- Engineering note: binaries are already compatible with ARM64
- Business goal: lower cost while keeping or improving throughput

Question 132hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, a web application runs on an Amazon EC2 Auto Scaling group behind an Application Load Balancer. During traffic surges, the average CPU utilization stays below 35%, but request latency increases sharply and the ALB access logs show far more requests per target than expected. Which change is the best way to improve scaling behavior?

Exhibit

CloudWatch metrics for the Auto Scaling group (5-minute period):
- CPUUtilization: 28% average
- NetworkIn: 190 MB/min average, no saturation
- GroupDesiredCapacity: 4
- ALBRequestCountPerTarget: 4,800 during peaks
- TargetResponseTime p95: 2.7 seconds during peaks

ALB access log sample:
2026-04-28T09:02:11Z app/prod-alb 203.0.113.10:443 10.0.1.21:8080 0.000 2.698 0.000 200 200 1843 1920 "GET https://app.example.com/search?q=aws HTTP/1.1"

Question 133hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, a DynamoDB-backed event processing system is throttling during a promotion. The table uses tenantId as the partition key and eventTime as the sort key. One tenant accounts for most of the write traffic, and the application must preserve fast lookups for that tenant without relying on a single hot partition. What change is the best fix?

Exhibit

Table schema:
- TableName: EventStore
- PartitionKey: tenantId (String)
- SortKey: eventTime (Number)

CloudWatch metrics during promotion:
- WriteThrottleEvents: increasing steadily
- ConsumedWriteCapacityUnits: near provisioned limit
- SuccessfulRequestLatency p95: 14 ms

Sample traffic distribution:
- tenantId=ACME: 82% of writes, 79% of reads
- all other tenants combined: 18% of writes, 21% of reads

Application note:
- Queries must continue to support tenant-scoped lookups by time range.

Question 134hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, an Amazon Aurora MySQL application is read-heavy, but the database writer is nearing CPU limits while the reader instance is mostly idle. The application currently sends all queries to the writer endpoint. Which change should you make first to increase read throughput?

Exhibit

Aurora cluster configuration:
- 1 writer instance
- 1 reader instance
- Application JDBC string: jdbc:mysql://cluster-writer.endpoint.example.com:3306/orders

CloudWatch metrics (peak hour):
- DBWriterCPUUtilization: 84%
- DBReaderCPUUtilization: 17%
- DatabaseConnections: steady
- ReadLatency p95: 38 ms
- WriteLatency p95: 9 ms

Application trace sample:
SELECT order_id, status, total FROM orders WHERE customer_id=? ORDER BY created_at DESC LIMIT 20

Question 135hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, a static asset distribution site uses Amazon CloudFront with an S3 origin. The assets are versioned by filename, but the cache hit ratio remains low after each release. Which CloudFront change is the best way to improve cache reuse without changing the origin objects?

Exhibit

CloudFront distribution settings excerpt:
- Cache policy: custom
- Headers included in cache key: Authorization, CloudFront-Viewer-Country
- Query strings included in cache key: all
- Cookies included in cache key: none

Origin request sample:
GET /app.8f3a2c1.js?v=20260428 HTTP/1.1
Host: d123.cloudfront.net
Authorization: Bearer eyJhbGciOi...
User-Agent: Mozilla/5.0

CloudFront analytics:
- CacheHitRate: 18%
- OriginFetches: spike immediately after each deploy
- Origin bytes out: high for unchanged JS and CSS files

Question 136hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, an application repeatedly reads the same DynamoDB items with extremely low latency requirements. The business can tolerate data that is a few seconds stale. Which architecture change best improves read performance?

Exhibit

DynamoDB access pattern report:
- TableName: SessionState
- Read pattern: GetItem on the same 500 keys during active sessions
- Read frequency: 1.2 million reads/minute during peak periods
- Cacheability: yes, stale data up to 5 seconds is acceptable

CloudWatch metrics:
- ConsumedReadCapacityUnits: 92% of provisioned limit
- SuccessfulRequestLatency p95: 7.5 ms
- ThrottledRequests: intermittent during peaks

Application note:
- Writes are comparatively rare and do not need multi-Region replication.

Question 137hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, a serverless API on AWS Lambda experiences a predictable cold-start penalty every weekday at 09:00 UTC when a marketing campaign begins. The team wants the first requests to stay fast while minimizing extra cost during quiet periods. What is the best approach?

Exhibit

Lambda monitoring and deployment notes:
- Function: checkout-api-prod
- Current alias: live
- Invocations per day: low except weekdays 09:00-09:15 UTC
- REPORT log sample at 09:00 UTC:
  Init Duration: 842.31 ms
  Duration: 128.42 ms
  Billed Duration: 1000 ms
- REPORT log sample at 09:05 UTC:
  Init Duration: 0.00 ms
  Duration: 121.77 ms

Traffic pattern:
- Spikes are predictable and last about 15 minutes
- No need to keep high concurrency all day

Question 138hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, a distributed analytics workload runs on 12 EC2 instances in one Availability Zone. The nodes exchange thousands of small messages per second and require the lowest possible intra-cluster latency and jitter. Which EC2 placement strategy is the best fit?

Exhibit

Topology notes:
- 12 x Amazon EC2 c6i.large instances
- All instances run in us-east-1a
- Current network path between nodes averages 0.9 ms and occasionally spikes above 2 ms
- Workload logs: "gossip sync lag detected" and "broadcast step exceeded SLA"
- Requirement: minimize latency and jitter between nodes, not maximize fault isolation

Question 139hardmultiple choice

Read the full Design High-Performing Architectures explanation →

Based on the exhibit, a single EC2 instance hosts a latency-sensitive cache that performs sustained random reads and writes to persistent block storage. The current EBS volume is a general-purpose SSD, but BurstBalance is repeatedly depleted and p95 I/O latency has risen above 20 ms. The workload needs more than 16,000 sustained IOPS. Which change is the best fix?

Exhibit

Amazon CloudWatch metrics for the instance volume:
- VolumeType: gp2
- VolumeSize: 1 TiB
- ReadOps: 9,000-11,000 sustained
- WriteOps: 8,000-10,000 sustained
- BurstBalance: 0% for long periods
- VolumeQueueLength: elevated during peak use
- VolumeReadLatency p95: 23 ms
- VolumeWriteLatency p95: 19 ms

Application note:
- The working set is random and latency-sensitive
- The storage requirement is persistent block storage attached to one instance

Question 140hardmulti select

Read the full NAT/PAT explanation →

A media company serves versioned JavaScript and CSS files from Amazon S3 through CloudFront. After each release, the cache hit ratio drops sharply because the same distribution also fronts a personalized API path, and the current cache policy forwards cookies, all query strings, and several headers to every origin request. The static assets already use content-hashed filenames. Which two changes will most directly improve cache hit ratio for the static assets without changing the application behavior? Select two.

Question 141hardmulti select

Read the full Design High-Performing Architectures explanation →

A retail analytics table stores events in Amazon DynamoDB with partition key tenantId and sort key eventTime. During a promotion, one tenant generates most writes and repeatedly polls the same latest-status items, causing throttling on a single partition key and high latency on reads. The business can tolerate read results that are a few seconds stale. Which two changes will most effectively reduce throttling and latency? Select two.

Question 142hardmulti select

Read the full Design High-Performing Architectures explanation →

A distributed analytics engine runs 12 EC2 instances in one Availability Zone. The nodes exchange thousands of tiny messages per second and must keep jitter as low as possible. The current design launches the instances across multiple placement groups and uses general-purpose burstable instances. Which two changes will most directly lower east-west network latency and variability? Select two.

Question 143hardmulti select

Read the full Design High-Performing Architectures explanation →

A serverless checkout API uses AWS Lambda behind API Gateway. Every weekday at 09:00 UTC, marketing triggers a predictable surge. The first few minutes after each surge show cold-start latency, but traffic volume is forecastable and the business wants stable p95 latency. Which two changes should the team implement? Select two.

Question 144hardmulti select

Review the full subnetting walkthrough →

A partner integration sends a custom binary TCP protocol to a service running on EC2 instances in private subnets. The partners require static endpoint IPs for allowlisting, and the application must see the original client source IP for rate limiting. Which two changes best fit the protocol and network requirements? Select two.

Question 145hardmulti select

Read the full Design High-Performing Architectures explanation →

An application uses Amazon Aurora MySQL. CloudWatch shows the writer instance near 85% CPU while the only reader instance averages 15% CPU. Trace logs show that all SELECT statements still target the writer endpoint. The workload is read-heavy, and the application already tolerates eventual consistency for reads. Which two changes will best increase total read throughput without a schema redesign? Select two.

Question 146hardmulti select

Read the full Design High-Performing Architectures explanation →

Multiple EC2 instances in different Availability Zones need concurrent read/write access to the same shared files. The files are actively modified by several application servers, and low-latency metadata operations matter more than extremely high aggregate throughput. Which two changes should the team make? Select two.

Question 147hardmulti select

Read the full NAT/PAT explanation →

A nightly video rendering pipeline runs on Linux EC2 instances and is compatible with ARM64. The jobs are CPU-bound, checkpoint frequently, and can resume if interrupted. The business wants the best throughput per dollar for the batch window. Which two changes should the team make? Select two.

Question 148hardmulti select

Read the full Design High-Performing Architectures explanation →

A media company serves versioned JavaScript and CSS files from an Amazon S3 origin through CloudFront. After each release, origin requests spike even though the files are public. Browser requests include a tracking cookie, an Authorization header, and a cache-busting query string that the site no longer needs. Which three changes will most improve the CloudFront cache hit ratio without exposing private content? Select three.

Question 149mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A global video platform serves mostly static images and JavaScript files from an S3 origin. Users in distant countries report slow load times. What should improve performance most?

Question 150hardmultiple choice

Read the full Design High-Performing Architectures explanation →

A DynamoDB table for a retail API has a partition key based only on the current date. Write throttling occurs during business hours. What is the best design change?

Question 151mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A read-heavy document portal repeatedly queries the same product catalogue data from DynamoDB with millisecond latency requirements. Which service can reduce read latency and table load?

Question 152mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A analytics dashboard uses RDS MySQL and receives many read-only reporting queries that slow down the primary database. What should the architect add?

Question 153hardmulti select

Read the full Design High-Performing Architectures explanation →

A latency-sensitive mobile game backend uploads large files to S3 from users around the world. Which two features can improve upload performance?

Question 154hardmultiple choice

Read the full Design High-Performing Architectures explanation →

A Lambda-based travel booking site has unpredictable traffic spikes and users see latency caused by cold starts. The function must respond consistently during expected campaign windows. What should be configured?

Question 155mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A media archive requires consistent high IOPS for a transactional database on EC2. Which EBS volume type is most suitable?

Question 156mediummultiple choice

Read the full network assurance explanation →

A telemetry pipeline uses an Application Load Balancer in one Region. Global users need lower network latency to the application without caching dynamic responses. What should be considered?

Question 157mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A video platform uses Amazon Aurora. The workload has many short-lived database connections from Lambda functions, causing connection storms. What should be added?

Question 158easymultiple choice

Read the full Design High-Performing Architectures explanation →

A retail API uses EC2 instances behind an ALB. CPU is consistently high during peak traffic, and request latency rises. What should be configured?

Question 159hardmultiple choice

Read the full Design High-Performing Architectures explanation →

A document portal needs low-latency full-text search across product descriptions and filtered attributes. Which managed service is most suitable?

Question 160mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A high-volume analytics dashboard writes streaming click events that must be processed by multiple independent consumers. Which service is most appropriate?

Question 161mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A global mobile game backend serves mostly static images and JavaScript files from an S3 origin. Users in distant countries report slow load times. What should improve performance most?

Question 162hardmultiple choice

Read the full Design High-Performing Architectures explanation →

A DynamoDB table for a travel booking site has a partition key based only on the current date. Write throttling occurs during business hours. What is the best design change?

Question 163mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A read-heavy media archive repeatedly queries the same product catalogue data from DynamoDB with millisecond latency requirements. Which service can reduce read latency and table load?

Question 164mediummultiple choice

Read the full network assurance explanation →

A telemetry pipeline uses RDS MySQL and receives many read-only reporting queries that slow down the primary database. What should the architect add?

Question 165hardmulti select

Read the full Design High-Performing Architectures explanation →

A latency-sensitive video platform uploads large files to S3 from users around the world. Which two features can improve upload performance?

Question 166hardmultiple choice

Read the full Design High-Performing Architectures explanation →

A Lambda-based retail API has unpredictable traffic spikes and users see latency caused by cold starts. The function must respond consistently during expected campaign windows. What should be configured?

Question 167mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A document portal requires consistent high IOPS for a transactional database on EC2. Which EBS volume type is most suitable?

Question 168mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A analytics dashboard uses an Application Load Balancer in one Region. Global users need lower network latency to the application without caching dynamic responses. What should be considered?

Question 169mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A mobile game backend uses Amazon Aurora. The workload has many short-lived database connections from Lambda functions, causing connection storms. What should be added?

Question 170easymultiple choice

Read the full Design High-Performing Architectures explanation →

A travel booking site uses EC2 instances behind an ALB. CPU is consistently high during peak traffic, and request latency rises. What should be configured?

Question 171hardmultiple choice

Read the full Design High-Performing Architectures explanation →

A media archive needs low-latency full-text search across product descriptions and filtered attributes. Which managed service is most suitable?

Question 172mediummultiple choice

Read the full network assurance explanation →

A high-volume telemetry pipeline writes streaming click events that must be processed by multiple independent consumers. Which service is most appropriate?

Question 173mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A global video platform serves mostly static images and JavaScript files from an S3 origin. Users in distant countries report slow load times. What should improve performance most? The design must avoid adding custom operational scripts.

Question 174hardmultiple choice

Read the full Design High-Performing Architectures explanation →

A DynamoDB table for a retail API has a partition key based only on the current date. Write throttling occurs during business hours. What is the best design change? The design must avoid adding custom operational scripts.

Question 175mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A read-heavy document portal repeatedly queries the same product catalogue data from DynamoDB with millisecond latency requirements. Which service can reduce read latency and table load? The design must avoid adding custom operational scripts.

Question 176mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A analytics dashboard uses RDS MySQL and receives many read-only reporting queries that slow down the primary database. What should the architect add? The design must avoid adding custom operational scripts.

Question 177hardmulti select

Read the full Design High-Performing Architectures explanation →

A latency-sensitive mobile game backend uploads large files to S3 from users around the world. Which two features can improve upload performance? The design must avoid adding custom operational scripts.

Question 178hardmultiple choice

Read the full Design High-Performing Architectures explanation →

A Lambda-based travel booking site has unpredictable traffic spikes and users see latency caused by cold starts. The function must respond consistently during expected campaign windows. What should be configured? The design must avoid adding custom operational scripts.

Question 179mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A media archive requires consistent high IOPS for a transactional database on EC2. Which EBS volume type is most suitable? The design must avoid adding custom operational scripts.

Question 180mediummultiple choice

Read the full network assurance explanation →

A telemetry pipeline uses an Application Load Balancer in one Region. Global users need lower network latency to the application without caching dynamic responses. What should be considered? The design must avoid adding custom operational scripts.

Question 181mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A video platform uses Amazon Aurora. The workload has many short-lived database connections from Lambda functions, causing connection storms. What should be added? The design must avoid adding custom operational scripts.

Question 182easymultiple choice

Read the full Design High-Performing Architectures explanation →

A retail API uses EC2 instances behind an ALB. CPU is consistently high during peak traffic, and request latency rises. What should be configured? The design must avoid adding custom operational scripts.

Question 183hardmultiple choice

Read the full Design High-Performing Architectures explanation →

A document portal needs low-latency full-text search across product descriptions and filtered attributes. Which managed service is most suitable? The design must avoid adding custom operational scripts.

Question 184mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A high-volume analytics dashboard writes streaming click events that must be processed by multiple independent consumers. Which service is most appropriate? The design must avoid adding custom operational scripts.

Question 185mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A global mobile game backend serves mostly static images and JavaScript files from an S3 origin. Users in distant countries report slow load times. What should improve performance most? The design must avoid adding custom operational scripts.

Question 186hardmultiple choice

Read the full Design High-Performing Architectures explanation →

A DynamoDB table for a travel booking site has a partition key based only on the current date. Write throttling occurs during business hours. What is the best design change? The design must avoid adding custom operational scripts.

Question 187mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A read-heavy media archive repeatedly queries the same product catalogue data from DynamoDB with millisecond latency requirements. Which service can reduce read latency and table load? The design must avoid adding custom operational scripts.

Question 188mediummultiple choice

Read the full network assurance explanation →

A telemetry pipeline uses RDS MySQL and receives many read-only reporting queries that slow down the primary database. What should the architect add? The design must avoid adding custom operational scripts.

Question 189hardmulti select

Read the full Design High-Performing Architectures explanation →

A latency-sensitive video platform uploads large files to S3 from users around the world. Which two features can improve upload performance? The design must avoid adding custom operational scripts.

Question 190hardmultiple choice

Read the full Design High-Performing Architectures explanation →

A Lambda-based retail API has unpredictable traffic spikes and users see latency caused by cold starts. The function must respond consistently during expected campaign windows. What should be configured? The design must avoid adding custom operational scripts.

Question 191mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A document portal requires consistent high IOPS for a transactional database on EC2. Which EBS volume type is most suitable? The design must avoid adding custom operational scripts.

Question 192mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A analytics dashboard uses an Application Load Balancer in one Region. Global users need lower network latency to the application without caching dynamic responses. What should be considered? The design must avoid adding custom operational scripts.

Question 193mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A mobile game backend uses Amazon Aurora. The workload has many short-lived database connections from Lambda functions, causing connection storms. What should be added? The design must avoid adding custom operational scripts.

Question 194easymultiple choice

Read the full Design High-Performing Architectures explanation →

A travel booking site uses EC2 instances behind an ALB. CPU is consistently high during peak traffic, and request latency rises. What should be configured? The design must avoid adding custom operational scripts.

Question 195hardmultiple choice

Read the full Design High-Performing Architectures explanation →

A media archive needs low-latency full-text search across product descriptions and filtered attributes. Which managed service is most suitable? The design must avoid adding custom operational scripts.

Question 196mediummultiple choice

Read the full network assurance explanation →

A high-volume telemetry pipeline writes streaming click events that must be processed by multiple independent consumers. Which service is most appropriate? The design must avoid adding custom operational scripts.

Question 197mediummultiple choice

Read the full NAT/PAT explanation →

A global video platform serves mostly static images and JavaScript files from an S3 origin. Users in distant countries report slow load times. What should improve performance most? The architecture review board prefers a managed AWS-native control.

Question 198hardmultiple choice

Read the full NAT/PAT explanation →

A DynamoDB table for a retail API has a partition key based only on the current date. Write throttling occurs during business hours. What is the best design change? The architecture review board prefers a managed AWS-native control.

Question 199mediummultiple choice

Read the full NAT/PAT explanation →

A read-heavy document portal repeatedly queries the same product catalogue data from DynamoDB with millisecond latency requirements. Which service can reduce read latency and table load? The architecture review board prefers a managed AWS-native control.

Question 200mediummultiple choice

Read the full NAT/PAT explanation →

A analytics dashboard uses RDS MySQL and receives many read-only reporting queries that slow down the primary database. What should the architect add? The architecture review board prefers a managed AWS-native control.

Question 201hardmulti select

Read the full NAT/PAT explanation →

A latency-sensitive mobile game backend uploads large files to S3 from users around the world. Which two features can improve upload performance? The architecture review board prefers a managed AWS-native control.

Question 202hardmultiple choice

Read the full NAT/PAT explanation →

A Lambda-based travel booking site has unpredictable traffic spikes and users see latency caused by cold starts. The function must respond consistently during expected campaign windows. What should be configured? The architecture review board prefers a managed AWS-native control.

Question 203mediummultiple choice

Read the full NAT/PAT explanation →

A media archive requires consistent high IOPS for a transactional database on EC2. Which EBS volume type is most suitable? The architecture review board prefers a managed AWS-native control.

Question 204mediummultiple choice

Read the full network assurance explanation →

A telemetry pipeline uses an Application Load Balancer in one Region. Global users need lower network latency to the application without caching dynamic responses. What should be considered? The architecture review board prefers a managed AWS-native control.

Question 205mediummultiple choice

Read the full NAT/PAT explanation →

A video platform uses Amazon Aurora. The workload has many short-lived database connections from Lambda functions, causing connection storms. What should be added? The architecture review board prefers a managed AWS-native control.

Question 206easymultiple choice

Read the full NAT/PAT explanation →

A retail API uses EC2 instances behind an ALB. CPU is consistently high during peak traffic, and request latency rises. What should be configured? The architecture review board prefers a managed AWS-native control.

Question 207hardmultiple choice

Read the full NAT/PAT explanation →

A document portal needs low-latency full-text search across product descriptions and filtered attributes. Which managed service is most suitable? The architecture review board prefers a managed AWS-native control.

Question 208mediummultiple choice

Read the full NAT/PAT explanation →

A high-volume analytics dashboard writes streaming click events that must be processed by multiple independent consumers. Which service is most appropriate? The architecture review board prefers a managed AWS-native control.

Question 209mediummultiple choice

Read the full NAT/PAT explanation →

A global mobile game backend serves mostly static images and JavaScript files from an S3 origin. Users in distant countries report slow load times. What should improve performance most? The architecture review board prefers a managed AWS-native control.

Question 210hardmultiple choice

Read the full NAT/PAT explanation →

A DynamoDB table for a travel booking site has a partition key based only on the current date. Write throttling occurs during business hours. What is the best design change? The architecture review board prefers a managed AWS-native control.

Question 211mediummultiple choice

Read the full NAT/PAT explanation →

A read-heavy media archive repeatedly queries the same product catalogue data from DynamoDB with millisecond latency requirements. Which service can reduce read latency and table load? The architecture review board prefers a managed AWS-native control.

Question 212mediummultiple choice

Read the full network assurance explanation →

A telemetry pipeline uses RDS MySQL and receives many read-only reporting queries that slow down the primary database. What should the architect add? The architecture review board prefers a managed AWS-native control.

Question 213hardmulti select

Read the full NAT/PAT explanation →

A latency-sensitive video platform uploads large files to S3 from users around the world. Which two features can improve upload performance? The architecture review board prefers a managed AWS-native control.

Question 214hardmultiple choice

Read the full NAT/PAT explanation →

A Lambda-based retail API has unpredictable traffic spikes and users see latency caused by cold starts. The function must respond consistently during expected campaign windows. What should be configured? The architecture review board prefers a managed AWS-native control.

Question 215mediummultiple choice

Read the full NAT/PAT explanation →

A document portal requires consistent high IOPS for a transactional database on EC2. Which EBS volume type is most suitable? The architecture review board prefers a managed AWS-native control.

Question 216mediummultiple choice

Read the full NAT/PAT explanation →

A analytics dashboard uses an Application Load Balancer in one Region. Global users need lower network latency to the application without caching dynamic responses. What should be considered? The architecture review board prefers a managed AWS-native control.

Question 217mediummultiple choice

Read the full NAT/PAT explanation →

A mobile game backend uses Amazon Aurora. The workload has many short-lived database connections from Lambda functions, causing connection storms. What should be added? The architecture review board prefers a managed AWS-native control.

Question 218easymultiple choice

Read the full NAT/PAT explanation →

A travel booking site uses EC2 instances behind an ALB. CPU is consistently high during peak traffic, and request latency rises. What should be configured? The architecture review board prefers a managed AWS-native control.

Question 219hardmultiple choice

Read the full NAT/PAT explanation →

A media archive needs low-latency full-text search across product descriptions and filtered attributes. Which managed service is most suitable? The architecture review board prefers a managed AWS-native control.

Question 220mediummultiple choice

Read the full network assurance explanation →

A high-volume telemetry pipeline writes streaming click events that must be processed by multiple independent consumers. Which service is most appropriate? The architecture review board prefers a managed AWS-native control.

Question 221mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A global video platform serves mostly static images and JavaScript files from an S3 origin. Users in distant countries report slow load times. What should improve performance most? The team wants the control to be enforceable during normal operations.

Question 222hardmultiple choice

Read the full Design High-Performing Architectures explanation →

A DynamoDB table for a retail API has a partition key based only on the current date. Write throttling occurs during business hours. What is the best design change? The team wants the control to be enforceable during normal operations.

Question 223mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A read-heavy document portal repeatedly queries the same product catalogue data from DynamoDB with millisecond latency requirements. Which service can reduce read latency and table load? The team wants the control to be enforceable during normal operations.

Question 224mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A analytics dashboard uses RDS MySQL and receives many read-only reporting queries that slow down the primary database. What should the architect add? The team wants the control to be enforceable during normal operations.

Question 225hardmulti select

Read the full Design High-Performing Architectures explanation →

A latency-sensitive mobile game backend uploads large files to S3 from users around the world. Which two features can improve upload performance? The team wants the control to be enforceable during normal operations.

Question 226hardmultiple choice

Read the full Design High-Performing Architectures explanation →

A Lambda-based travel booking site has unpredictable traffic spikes and users see latency caused by cold starts. The function must respond consistently during expected campaign windows. What should be configured? The team wants the control to be enforceable during normal operations.

Question 227mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A media archive requires consistent high IOPS for a transactional database on EC2. Which EBS volume type is most suitable? The team wants the control to be enforceable during normal operations.

Question 228mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A company needs to implement session management for a web application. Sessions must persist across multiple EC2 instances, survive EC2 failures, and be accessible with sub-millisecond latency. Sessions must also be sortable by last-access time to expire the oldest sessions first. Which caching solution should a solutions architect recommend?

Question 229mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A company needs to replicate a DynamoDB table to three AWS regions so that users in each region can read and write to a local copy with the lowest possible latency. Changes must propagate to all regions within seconds. Which solution should a solutions architect implement?

Question 230mediummultiple choice

Read the full Design High-Performing Architectures explanation →

A company is deploying a high-performance computing (HPC) cluster with 16 EC2 instances. The workload requires the lowest possible network latency and highest throughput between all nodes for tightly coupled parallel MPI computations. Which EC2 placement group type should a solutions architect recommend?

Question 231mediummulti select

Read the full Design High-Performing Architectures explanation →

A media company is designing a high-performance architecture to serve video content to users worldwide. The solution must minimize latency for end users and reduce the load on the origin servers. The video files are stored in an Amazon S3 bucket. Which three options should be combined to meet these requirements? (Choose three.)

Question 232mediummulti select

Read the full Design High-Performing Architectures explanation →

A financial services application requires high-performance read access to a time-series dataset that is frequently updated with new records. The workload is write-heavy during market hours and read-heavy for reporting. The solution must support strong consistency and low-latency queries on a single key. Which three AWS services or features should be used together to meet these requirements? (Choose three.)

Question 233mediummulti select

Read the full Design High-Performing Architectures explanation →

A company is designing a high-performance architecture for a real-time analytics platform that ingests millions of events per second. The events must be processed with minimal latency and then stored for long-term analysis. Which three services should be combined to build this architecture? (Choose three.)

Question 234mediummulti select

Read the full Design High-Performing Architectures explanation →

A company is designing a high-performance web application that serves static and dynamic content to a global user base. The application runs on Amazon EC2 instances behind an Application Load Balancer (ALB). The static assets are stored in an S3 bucket. Which three architecture decisions will improve performance and reduce latency for users? (Choose three.)

Question 235mediummulti select

Read the full Design High-Performing Architectures explanation →

A DevOps team is designing a high-performance CI/CD pipeline to build and test code changes. The pipeline needs to scale to handle hundreds of concurrent builds, with fast build times and minimal idle compute cost. The builds are containerized and require consistent, reproducible environments. Which three options should be used to meet these requirements? (Choose three.)

Question 236mediummulti select

Read the full NAT/PAT explanation →

A company is designing a high-performance database architecture for an e-commerce platform that experiences rapid spikes in read traffic during flash sales. The database must handle millions of reads per second with sub-millisecond latency. The data is key-value in nature, with a small number of attributes per item. Which three options should be included in the architecture? (Choose three.)

Question 237mediumdrag order

Read the full Design High-Performing Architectures explanation →

Arrange the steps to migrate an on-premises database to Amazon RDS using AWS DMS.

Drag steps to the numbered slots on the right, or tap a step then tap a slot.

Steps

Order

1Step 1

2Step 2

3Step 3

4Step 4

5Step 5

Question 238mediumdrag order

Read the full Design High-Performing Architectures explanation →

Arrange the steps to troubleshoot an EC2 instance that is unreachable via SSH.

Drag steps to the numbered slots on the right, or tap a step then tap a slot.

Steps

Order

1Step 1

2Step 2

3Step 3

4Step 4

5Step 5