Knowledge + Practice

CCNA Monitoring and Troubleshooting Questions

75 of 300 questions · Page 3/4 · Monitoring and Troubleshooting · Answers revealed

Practice these questions Domain overview All questions

151

MCQhard

Refer to the exhibit. An IAM policy attached to an IAM role used by an AWS Lambda function. The Lambda function is invoked by Amazon API Gateway and reads from a DynamoDB table. The function is experiencing permission errors when calling DynamoDB GetItem. What is the most likely cause?

A.The IAM role trust policy does not allow Lambda to assume the role.

B.The policy does not grant access to DynamoDB indexes if the function uses a query on a Global Secondary Index.

C.The dynamodb:GetItem action is not listed in the policy.

D.The Lambda function ARN in the resource field is incorrect.

AnswerB

Reading from a GSI requires permission on the index ARN.

Why this answer

Option B is correct. The policy grants GetItem and Query on the table ARN, but it does not include the 'dynamodb:GetItem' action? Wait, it does include it. Actually, the correct answer is that the policy is missing the 'dynamodb:GetItem' action? No, it's there.

The issue is that the function is using a conditional write? Hmm. Let's analyze: The policy allows GetItem and Query on the table. That should work.

But perhaps the function is trying to access a specific index? The policy does not include access to indexes. Option B says missing index ARN. That is plausible.

Option A is wrong because the Lambda resource is fine. Option C is wrong because GetItem is included. Option D is wrong because the role trust policy is not shown.

Practice this question →

152

Matchingmedium

Match each backup/restore concept to its AWS database feature.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts

Matches

Daily snapshot and transaction log backups enabled by default

User-initiated snapshot stored until explicitly deleted

Restore to any second within the backup retention period

Copy snapshots to another AWS region for disaster recovery

Rewind an Aurora DB cluster to a specific time without restoring

Why these pairings

Backup and recovery options for different RPO/RTO requirements.

Practice this question →

153

MCQeasy

A company is running a production Amazon DynamoDB table with on-demand capacity. The application is experiencing increased latency and throttled requests during peak hours. Which monitoring tool should the database specialist use to identify the specific partition keys causing the throttling?

A.Amazon CloudWatch Contributor Insights for DynamoDB

B.Amazon Inspector

C.AWS Config

D.AWS CloudTrail logs

AnswerA

Contributor Insights analyzes access patterns and identifies throttled partition keys.

Why this answer

Amazon CloudWatch Contributor Insights for DynamoDB is the correct tool because it analyzes DynamoDB request logs to identify the most frequently accessed partition keys, including those causing throttling. It provides top-N keys by request count or throttled events, enabling the database specialist to pinpoint hot partitions responsible for increased latency and throttled requests during peak hours.

Exam trap

The trap here is that candidates often confuse CloudTrail (which logs all API calls) with Contributor Insights, assuming CloudTrail can provide per-key throttling data, but CloudTrail lacks the aggregation and top-N analysis needed to identify specific hot partition keys.

How to eliminate wrong answers

Option B (Amazon Inspector) is wrong because it is a vulnerability management service that assesses network and application security, not a tool for analyzing DynamoDB partition key access patterns or throttling. Option C (AWS Config) is wrong because it evaluates resource configurations and compliance rules, not real-time operational metrics like request throttling per partition key. Option D (AWS CloudTrail logs) is wrong because it records API calls for auditing and governance, but does not provide per-partition-key throttling details or aggregated access patterns needed to identify hot partitions.

Practice this question →

154

MCQeasy

Refer to the exhibit. A developer is trying to query the ProductCatalog table using the 'Id' attribute. The query returns no results even though the developer knows data was inserted. What is the MOST likely cause?

A.The table contains no items

B.The provisioned throughput is exceeded

C.The attribute definition is missing the 'Id' attribute

D.The table is not in ACTIVE status

AnswerA

ItemCount is 0, so the table is empty.

Why this answer

Option D is correct because the table has no items (ItemCount: 0). Option A is wrong because the table is active. Option B is wrong because provisioned throughput is not exceeded (ItemCount is 0).

Option C is wrong because the attribute definition exists.

Practice this question →

155

Multi-Selecthard

Which THREE steps should be taken to troubleshoot an Amazon DynamoDB table that is experiencing high read latency?

Select 3 answers

A.Review the table's partition distribution using DynamoDB metrics.

B.Increase the write capacity of the table.

C.Monitor the ConsumedReadCapacityUnits metric.

D.Disable auto scaling to prevent unexpected capacity changes.

E.Check the ThrottledReadEvents metric to see if reads are being throttled.

AnswersA, C, E

Uneven partition distribution can cause hot partitions and high latency.

Why this answer

Option A is correct because high read latency in DynamoDB can result from uneven partition distribution, where a 'hot' partition receives more read requests than others, causing throttling or increased latency. By reviewing the table's partition distribution using CloudWatch metrics like `ConsumedReadCapacityUnits` per partition, you can identify skewed access patterns and address them with strategies like partition key redesign or adaptive capacity.

Exam trap

The trap here is that candidates may confuse write capacity adjustments with read performance fixes, or assume disabling auto scaling is a troubleshooting step, when in fact auto scaling is a best practice for maintaining consistent throughput.

Practice this question →

156

MCQmedium

A company uses Amazon DynamoDB to store sensor data from IoT devices. The table has a partition key of DeviceID (String) and a sort key of Timestamp (Number). The table is configured with provisioned capacity of 1000 read capacity units (RCUs) and 500 write capacity units (WCUs). Auto Scaling is enabled with target utilization of 70% and is working correctly. Recently, a new batch job was introduced that performs Scan operations on the entire table every hour. The Scan consumes many read capacity units and causes throttling of the sensor write requests. The team wants to minimize the impact on the write-heavy sensor ingestion. Which approach is BEST?

A.Increase the provisioned write capacity to 2000 WCUs to handle bursts.

B.Schedule the Scan to run during off-peak hours when sensor writes are lower.

C.Enable DAX (DynamoDB Accelerator) to cache Scan results and reduce read consumption on the table.

D.Switch the table to on-demand capacity mode to eliminate throttling.

AnswerC

DAX caches Scan results, reducing the number of read capacity units consumed from the table, thus freeing capacity for writes.

Why this answer

Option C is correct because enabling DynamoDB Accelerator (DAX) provides an in-memory cache for reads, which reduces the read load on the table itself. Since the Scan still consumes read capacity, caching can absorb some of the read traffic and free up RCUs for other operations, but more importantly, the team should also consider other measures. However, among the options, DAX is the best to reduce read load on the table.

Option A is incorrect because increasing write capacity does not address the read throttling; it just increases cost. Option B is incorrect because reducing Scan frequency may not be acceptable for the batch job. Option D is incorrect because changing to on-demand may eliminate throttling but will be more expensive for this predictable workload and does not leverage existing provisioned capacity.

Practice this question →

157

MCQmedium

A company is running a MongoDB-compatible Amazon DocumentDB cluster. The application team reports that write operations are failing intermittently with a `WriteConcernError` indicating that the write concern could not be satisfied. The cluster has one primary and two replicas. What is the MOST likely cause of this issue?

A.One of the replicas is down or experiencing high replication lag

B.The cluster does not have enough replica instances to satisfy the write concern

C.The application is using an incorrect read preference

D.The primary instance is overloaded with read requests

AnswerA

If a replica is down, the write concern 'majority' cannot be satisfied because only the primary and one replica are available, but majority may require two replicas depending on configuration.

Why this answer

Option D is correct because if the write concern is set to 'majority' and a replica is down or lagging, the write cannot be acknowledged. Option A is wrong because the primary is not overloaded; the issue is write concern. Option B is wrong because read preference does not affect writes.

Option C is wrong because the number of replicas is sufficient for majority if all are healthy.

Practice this question →

158

Multi-Selecthard

A company is migrating an on-premises SQL Server database to Amazon RDS for SQL Server. During the migration, the DBA notices that the network latency between the source and target is high. Which THREE actions can the DBA take to reduce the impact of network latency on migration performance?

Select 3 answers

A.Increase the size of the RDS instance

B.Use multiple parallel tasks in AWS DMS

C.Use AWS Direct Connect for a dedicated network connection

D.Enable data compression for the migration task

E.Use AWS Site-to-Site VPN

AnswersB, C, D

Parallel tasks improve throughput, mitigating latency effects.

Why this answer

Options A, C, and D are correct. Using AWS Direct Connect provides a dedicated network connection, reducing latency. Compressing data reduces the amount of data transferred, mitigating latency impact.

Using multiple threads parallelizes the data transfer, improving throughput. B is wrong because increasing the instance size does not reduce network latency. E is wrong because using a VPN may introduce additional overhead.

Practice this question →

159

MCQhard

A company is using Amazon DynamoDB with global tables to support a multi-region application. They notice that writes to the table in us-east-1 are not replicating to the table in eu-west-1. The replication lag metric shows increasing values. What is the most likely cause?

A.The table in eu-west-1 has a lower write capacity and is throttling incoming writes.

B.DynamoDB Streams is disabled on the table in us-east-1.

C.The DynamoDB table in us-east-1 is not accessible from eu-west-1 due to VPC restrictions.

D.The table in eu-west-1 has been removed as a replica region from the global table configuration.

AnswerD

If the replica region is removed, replication stops.

Why this answer

Option C is correct because DynamoDB global tables replicate writes asynchronously; if the replica table in eu-west-1 is not properly configured as a global table replica, replication will fail. Option A is incorrect because global tables do not require VPC endpoints. Option B is incorrect because auto scaling does not affect replication.

Option D is incorrect because DynamoDB Streams are used for global tables replication, but disabling them would stop replication, not cause lag.

Practice this question →

160

MCQhard

A company runs a critical application on Amazon RDS for PostgreSQL with Multi-AZ deployment. The database is a db.r5.xlarge instance with 500 GB of Provisioned IOPS (io1) storage. The application writes a large volume of time-series data every 5 minutes. Recently, the team noticed that the replica lag between the primary and standby instance has increased to over 10 seconds during peak write periods. The application requires that the standby be within 5 seconds of the primary for disaster recovery purposes. The team has already verified that network latency between AZs is minimal, and the standby instance is the same size as the primary. The CloudWatch metric 'TransactionLogsDiskUsage' shows that the primary is generating logs at a high rate. Which combination of actions should the database administrator take to reduce the replica lag? (Choose the best option.)

A.Disable synchronous commit on the primary to reduce log generation

B.Change the standby instance to a larger instance type (e.g., db.r5.2xlarge) to improve apply throughput

C.Reduce the batch write frequency to every 10 minutes to lower the log generation rate

D.Increase the Provisioned IOPS on both the primary and standby instances to improve log write throughput

AnswerD

Higher IOPS reduces the time to flush transaction logs, reducing replica lag.

Why this answer

Option B is correct because increasing the storage IOPS on both instances reduces the time to write transaction logs to disk, which is the bottleneck. The primary generates logs, and the standby applies them; both need fast I/O. Option A is insufficient because disabling synchronous commit might cause data loss.

Option C is incorrect because the standby is already the same size; increasing its size alone may not help if the bottleneck is I/O. Option D is incorrect because reducing the batch size reduces write volume, affecting application performance.

Practice this question →

161

MCQmedium

A company is running an Amazon Aurora MySQL-Compatible DB cluster. The application is experiencing increased latency during peak hours. The DB cluster's CPU utilization is at 40%, but the ReadReplicaLag metric is high. Which of the following is the most likely cause?

A.The writer instance is experiencing I/O bottlenecks, causing slow replication

B.The read replica is receiving too many write requests

C.The DB cluster parameter group is misconfigured

D.The binary log retention period is too long

AnswerA

I/O bottlenecks on the writer can delay writing to the binary log, increasing replica lag.

Why this answer

Option A is correct because high replica lag indicates the replica is falling behind, often due to insufficient I/O capacity on the writer or replica. Option B is wrong because read replicas handle SELECT queries. Option C is wrong because binary log retention does not affect replica lag directly.

Option D is wrong because the DB cluster parameter group is shared.

Practice this question →

162

MCQeasy

A company uses Amazon DynamoDB with auto scaling enabled. During a sales event, the write capacity consumption increases, but the table does not scale up as expected, resulting in throttled requests. The table has read/write capacity mode set to 'Provisioned' with auto scaling configured. What should the team check first to troubleshoot the issue?

A.Check the target utilization percentage in the auto scaling policy

B.Verify that the auto scaling role has the necessary IAM permissions

C.Check whether the table class is DynamoDB Standard-IA

D.Check if a global secondary index (GSI) has its own write capacity that is throttling

AnswerA

If target utilization is high (e.g., 90%), scaling may not trigger until sustained high usage.

Why this answer

Option B is correct because auto scaling uses a target utilization percentage; if it's set too high, it may not trigger scaling soon enough. Option A is incorrect because auto scaling controls the provisioned capacity, not the table class. Option C is incorrect because auto scaling does not require manual approval.

Option D is incorrect because auto scaling can increase capacity up to the maximum limit, regardless of the GSI.

Practice this question →

163

MCQmedium

A company is using Amazon ElastiCache for Redis as a caching layer. The application is experiencing higher latency than expected. The team suspects that cache evictions are occurring due to memory pressure. Which ElastiCache metric should be monitored to confirm this?

A.CurrConnections

B.Evictions

C.ReplicationLag

D.CacheHits

AnswerB

Evictions metric directly counts the number of keys evicted due to memory pressure.

Why this answer

Option D is correct because `Evictions` metric shows the number of evicted keys due to memory limit. Option A is wrong because `CacheHits` shows successful reads. Option B is wrong because `CurrConnections` shows current connections.

Option C is wrong because `ReplicationLag` shows replication delay, not evictions.

Practice this question →

164

Multi-Selectmedium

A database specialist is monitoring an Amazon RDS for MySQL instance and notices that the 'CPUUtilization' metric is consistently above 90%. Which TWO actions should the specialist take to investigate the issue? (Select TWO.)

Select 2 answers

A.Disable the MySQL query cache to free up CPU.

B.Use Performance Insights to identify the queries that are consuming the most CPU.

C.Increase the instance class to provide more CPU resources.

D.Enable Multi-AZ to distribute the load to a standby instance.

E.Enable Enhanced Monitoring to get OS-level metrics for the RDS instance.

AnswersB, E

Performance Insights shows top queries by load.

Why this answer

Option A is correct because Performance Insights can identify high-load queries. Option B is correct because Enhanced Monitoring provides OS-level metrics like CPU, memory, and disk I/O. Option C is wrong because increasing instance class is a solution, not an investigation step.

Option D is wrong because enabling Multi-AZ does not help investigate CPU usage. Option E is wrong because disabling query cache is not a diagnostic action.

Practice this question →

165

MCQeasy

A company has an Amazon DynamoDB table with auto scaling enabled. During a traffic spike, the application experiences high write latencies. Which action should the company take to troubleshoot the latency issue?

A.Monitor the ThrottledWriteEvents metric in CloudWatch.

B.Switch the table to on-demand capacity mode.

C.Disable auto scaling and manually increase write capacity.

D.Increase the read capacity of the table.

AnswerA

ThrottledWriteEvents indicates if writes are being throttled, causing high latency.

Why this answer

Option B is correct because CloudWatch metrics for ThrottledWriteEvents help identify if auto scaling is keeping up. Option A is wrong because disabling auto scaling may cause throttling. Option C is wrong because changing to on-demand may not be cost-effective and is not a troubleshooting step.

Option D is wrong because increasing read capacity does not affect write latency.

Practice this question →

166

MCQhard

A company has an Amazon Aurora MySQL DB cluster with one writer and two readers. The application is reporting increased read latency. The DB cluster is using Aurora Auto Scaling for readers. Which metric should be used to trigger scaling out readers?

A.ReadLatency

B.ReaderNodeCPU

C.CPUUtilization

D.AverageActiveConnections

AnswerD

Aurora Auto Scaling uses the number of active connections to the reader to add or remove readers.

Why this answer

Aurora Auto Scaling for readers uses the 'AverageActiveConnections' metric to determine when to add readers. Option B is correct. Option A is incorrect because the auto scaling is based on connections, not CPU.

Option C is incorrect because ReaderNodeCPU is not a predefined metric for auto scaling. Option D is incorrect because ReadLatency is an effect, not a scaling trigger.

Practice this question →

167

Multi-Selectmedium

Which TWO metrics should be monitored to detect an Amazon RDS for MySQL instance that is experiencing memory pressure? (Choose 2.)

Select 2 answers

A.SwapUsage

B.DatabaseConnections

C.ReadIOPS

D.FreeableMemory

E.CPUUtilization

AnswersA, D

High swap usage indicates memory pressure.

Why this answer

B and D are correct. SwapUsage indicates memory pressure when it exceeds 0. FreeableMemory shows available memory.

A is wrong because CPUUtilization is CPU, not memory. C is wrong because DatabaseConnections affects memory but is indirect. E is wrong because ReadIOPS is disk I/O.

Practice this question →

168

Multi-Selecthard

Which THREE steps should be taken to troubleshoot high replica lag in an Amazon Aurora MySQL DB cluster? (Choose THREE.)

Select 3 answers

A.Review network latency between the writer and replica

B.Increase the instance size of the replica

C.Check the replica's CPU and memory utilization

D.Disable binary logging on the writer

E.Examine the replication thread status using SHOW SLAVE STATUS

AnswersA, C, E

Network latency can increase replication lag.

Why this answer

Options B, D, and E are correct. Checking the replica's load (B), examining replication threads (D), and reviewing network latency (E) are common troubleshooting steps. Option A (increasing instance size) is a solution, not a troubleshooting step.

Option C (disabling binary logging) would break replication.

Practice this question →

169

Multi-Selecteasy

An Amazon RDS for MySQL instance is running out of storage. Which TWO actions can be taken to resolve this issue without downtime?

Select 2 answers

A.Enable storage auto scaling to automatically add storage.

B.Delete unnecessary data from the instance.

C.Modify the DB instance to increase allocated storage.

D.Take a snapshot and restore to a larger instance.

E.Convert the instance to Amazon Aurora.

AnswersA, C

Auto scaling adds storage when needed without manual intervention.

Why this answer

RDS supports modifying storage to increase allocated storage without downtime. You can also enable storage auto scaling to automatically increase storage. Option C (deleting data) may free space but is not recommended as a proactive measure.

Option D (snapshot and restore) involves downtime. Option E (conversion to Aurora) is a migration, not a direct fix.

Practice this question →

170

Multi-Selecteasy

A database administrator needs to monitor the free storage space on an Amazon RDS for SQL Server instance. Which TWO CloudWatch metrics should be used? (Choose two.)

Select 2 answers

A.DatabaseConnections

B.ReadLatency

C.BinLogDiskUsage

D.FreeStorageSpaceInPercent

E.FreeStorageSpace

AnswersD, E

Percentage of free storage space.

Why this answer

Option A (FreeStorageSpace) and Option D (FreeStorageSpaceInPercent) are correct. Option B is wrong because BinLogDiskUsage is for MySQL. Option C is wrong because DatabaseConnections is connections.

Option E is wrong because ReadLatency is for read operations.

Practice this question →

171

MCQhard

A company is running an Amazon DynamoDB table with on-demand capacity. The application is experiencing high latency for GetItem operations. CloudWatch shows that 'ThrottledRequests' is zero, but 'ConsumedReadCapacityUnits' is consistently below the provisioned limit. The table has a global secondary index (GSI) with a different partition key. What is the most likely cause of the high latency?

A.High write activity on the base table causing read latency

B.Insufficient provisioned read capacity on the base table

C.Throttling on the global secondary index due to uneven access patterns

D.Throttling on the base table due to hot partitions

AnswerC

GSI throttling can cause backpressure on the base table.

Why this answer

Option D is correct because GSIs have their own capacity and if the GSI is throttled, writes to the base table can be throttled, affecting read latency. Option A is wrong because throttled requests are zero. Option B is wrong because on-demand capacity does not have a provisioned limit.

Option C is wrong because hot partitions would cause throttling.

Practice this question →

172

MCQhard

A database specialist is troubleshooting a degraded Amazon Aurora MySQL database cluster. The primary instance is showing elevated read latency and the replica lag is increasing. The DB cluster has one writer and one reader instance. The writer instance is using 90% of its allocated memory, and the reader instance is using 70%. Which action is most likely to reduce read latency and replica lag?

A.Increase the allocated storage for the cluster to improve I/O throughput.

B.Increase the writer instance size to provide more memory for write operations.

C.Increase the reader instance size to provide more buffer pool memory.

D.Add a second reader instance to distribute the read load.

AnswerC

More memory on the reader allows a larger buffer pool, reducing read latency and helping catch up replication.

Why this answer

Option D is correct because increasing the instance size of the reader provides more memory for the buffer pool, improving read performance and reducing replica lag. Option A is wrong because increasing the writer instance size does not directly help read latency on the reader. Option B is wrong because adding another reader does not help if the existing reader is under-resourced.

Option C is wrong because increasing storage does not directly improve memory or query performance.

Practice this question →

173

MCQhard

A company is using Amazon DynamoDB with auto scaling. The application is experiencing increased read latency. The monitoring shows that the ReadThrottleEvents metric is high. Which action should the company take to reduce read throttling?

A.Implement DynamoDB Accelerator (DAX) for caching.

B.Switch the table to on-demand capacity mode.

C.Enable DynamoDB Streams on the table.

D.Increase the write capacity units.

AnswerA

DAX caches reads, reducing pressure on the table.

Why this answer

Option C is correct because DynamoDB Accelerator (DAX) reduces read latency and throttling by caching. Option A is wrong because increasing write capacity doesn't help reads. Option B is wrong because enabling Streams may add overhead.

Option D is wrong because switching to on-demand may increase cost but not necessarily fix throttling if workload is bursty.

Practice this question →

174

MCQeasy

A developer reports that an Amazon RDS for PostgreSQL database instance's storage is full and the instance is in 'storage-full' state. The team wants to resolve this without downtime. Which action should be taken?

A.Create a read replica and promote it

B.Modify the DB instance to increase allocated storage

C.Delete old automated snapshots

D.Reboot the DB instance

AnswerB

RDS supports dynamic storage scaling without downtime for most engines.

Why this answer

Option A is correct because modifying the allocated storage for an RDS instance is a dynamic operation that can increase storage without downtime. Option B is wrong because deleting old snapshots does not free up storage on the instance. Option C is wrong because creating a read replica requires the source to have available storage.

Option D is wrong because rebooting does not free storage.

Practice this question →

175

Multi-Selectmedium

A database administrator notices that an Amazon RDS for MySQL DB instance is experiencing high CPU utilization and increased latency during peak hours. The administrator wants to identify the queries causing the issue. Which TWO actions should be taken to diagnose the problem? (Select TWO.)

Select 2 answers

A.Enable Performance Insights on the DB instance.

B.Enable Amazon DevOps Guru for RDS.

C.Enable the slow query log and set a threshold for query duration.

D.Use RDS Proxy to manage connections.

E.Enable RDS Enhanced Monitoring.

AnswersA, C

Performance Insights provides detailed performance metrics and top SQL queries.

Why this answer

Option A: Enabling Performance Insights provides a performance schema and wait events to identify problematic queries. Option C: Enabling the slow query log captures queries that take longer than a specified time, helping to pinpoint slow queries. Option B: RDS Enhanced Monitoring is for OS-level metrics, not query-level.

Option D: Amazon DevOps Guru is for broader anomaly detection, not specific query identification. Option E: RDS Proxy is for connection pooling, not query analysis.

Practice this question →

176

MCQhard

A company uses Amazon DynamoDB with auto scaling enabled. The application experiences increased latency during peak hours. The DynamoDB table has a read capacity of 10,000 RCU and write capacity of 5,000 WCU. The auto scaling target utilization is 70%. During peak hours, the consumed read capacity reaches 8,000 RCU, but auto scaling does not increase capacity. What is the most likely reason?

A.The consumed capacity is still below the provisioned capacity, so no throttling occurs, and auto scaling does not trigger.

B.The auto scaling configuration has a maximum capacity that prevents scaling beyond a certain limit.

C.Auto scaling for DynamoDB does not support increasing read capacity; it only decreases capacity.

D.Auto scaling only scales out when the consumed capacity exceeds the provisioned capacity.

AnswerB

If the maximum capacity is set to 10,000 RCU, auto scaling cannot increase further.

Why this answer

Auto scaling uses the consumed capacity metric over a period. If the consumed capacity is consistently at 8,000 RCU, the target utilization is 70% of provisioned (10,000 RCU), so the current utilization is 80%, which is above the target. Auto scaling should increase capacity.

However, auto scaling has a cooldown period and also scales based on sustained consumption. Option A is not correct because auto scaling can scale out. Option B is not correct because the table is not throttled yet.

Option C is correct because auto scaling might be configured with a maximum capacity that limits scaling. Option D is not correct because auto scaling can increase RCU if needed.

Practice this question →

177

MCQmedium

A company is using Amazon Redshift for data warehousing. The users report that queries are slow, and the CloudWatch metric 'CPUUtilization' shows high usage. The cluster has 4 dc2.large nodes. The administrator runs the following query: SELECT * FROM stl_query WHERE starttime > '2023-12-01' ORDER BY duration DESC LIMIT 10; This shows that the longest query runs for over 5 minutes. Which of the following is the MOST effective way to improve query performance?

A.Increase the size of the cluster volume by adding more storage

B.Add more nodes to the cluster (scale out)

C.Run the VACUUM command to reclaim space and improve query performance

D.Modify the workload management (WLM) queue to increase concurrency

AnswerB

Adding nodes increases compute capacity and parallelism.

Why this answer

Option C is correct because distributing data across more nodes allows parallel processing, which reduces query execution time. Option A is incorrect because the issue is CPU, not storage. Option B is incorrect because WLM queues manage concurrency, not individual query performance.

Option D is incorrect because vacuum reclaims space but does not directly speed up queries.

Practice this question →

178

MCQeasy

Refer to the exhibit. A database specialist is investigating an application outage that occurred at 14:25 UTC. The RDS event log shows the events above. What is the most likely cause of the outage?

A.A maintenance window update was applied, causing the instance to become unavailable.

B.The DB instance ran out of storage and became read-only.

C.A Multi-AZ failover occurred at 14:25 UTC, causing a brief outage.

D.The DB instance restarted at 14:30 UTC due to a manual reboot.

AnswerC

Failover involves promoting the standby, which causes a momentary outage.

Why this answer

Option A is correct because a Multi-AZ failover can cause a brief outage during the transition. The restart at 14:30 is likely the result of the failover. Option B is wrong because the restart occurred after the failover.

Option C is wrong because the events do not indicate a maintenance window. Option D is wrong because there is no evidence of insufficient storage.

Practice this question →

179

Multi-Selecthard

A company's Amazon Redshift cluster is experiencing slow query performance. Which THREE diagnostic steps should be taken to identify the bottleneck?

Select 3 answers

A.Query STL_LOAD_ERRORS to check for data load errors

B.Query SVL_QUERY_REPORT to see step-level performance

C.Query STL_QUERY to analyze query execution times

D.Query STL_WLM_QUERY to check for queue waits

E.Query STV_BLOCKLIST to check disk allocation

AnswersB, C, D

Shows time spent per step.

Why this answer

Option A is correct because STL_QUERY provides query execution details. Option C is correct because SVL_QUERY_REPORT shows step-level execution. Option E is correct because STL_WLM_QUERY shows WLM queue waits.

Option B is wrong because STV_BLOCKLIST shows disk allocation, not performance. Option D is wrong because STL_LOAD_ERRORS is for load errors.

Practice this question →

180

MCQhard

A team is troubleshooting an Amazon Aurora MySQL cluster where the writer instance fails over unexpectedly every few days. The failover is not triggered by manual intervention or maintenance events. CloudWatch metrics show that the failover occurs when the replica lag on the reader instances spikes to over 10 seconds. What is the MOST likely cause of the failover?

A.The reader instances are over-provisioned, causing them to become idle and drop connections

B.The Aurora cluster is experiencing a storage subsystem bottleneck in a single Availability Zone

C.The writer instance is running a long-running transaction that blocks replication

D.The reader instances are under-provisioned and cannot keep up with the write rate from the writer

AnswerD

Under-provisioned readers cause replica lag, which may trigger failover if the reader fails health checks.

Why this answer

Option D is correct because if the reader instances are overloaded, they may not be able to apply changes quickly enough, causing replica lag. The failover could be due to the failover priority settings or a health check failure. Option A is incorrect because storage issues would affect both writer and readers.

Option B is incorrect because an oversized instance would reduce lag, not increase it. Option C is incorrect because long-running queries on the writer would not directly cause replica lag on readers.

Practice this question →

181

MCQmedium

A company is using Amazon ElastiCache for Redis as a caching layer for a web application. Users report that some cached data is missing, causing slower responses. Which ElastiCache feature should be checked first to understand key evictions?

A.ReplicationLag metric

B.CPUUtilization metric

C.Evictions metric in CloudWatch

D.CacheHits metric in CloudWatch

AnswerC

High evictions indicate memory pressure causing key removal.

Why this answer

Option A is correct because Evictions metric shows how many keys are evicted due to memory pressure. Option B is wrong because CacheHits shows hits, not evictions. Option C is wrong because ReplicationLag is for replication.

Option D is wrong because CPUUtilization does not directly show evictions.

Practice this question →

182

MCQmedium

A company has an Amazon RDS for SQL Server database that stores customer orders. The database is running on a db.m5.large instance. Recently, the DBA noticed that the 'Write IOPS' metric is consistently high during business hours, and the 'Write Latency' metric is also elevated. The application is experiencing slow order placement. The DBA checks the disk queue depth and finds it to be high. The database uses General Purpose SSD (gp2) storage with 500 GB. What should the DBA do to improve write performance?

A.Enable Multi-AZ to offload writes to standby.

B.Change the storage type to gp3 and increase the IOPS.

C.Increase the storage size to 1000 GB to get more baseline IOPS.

D.Migrate to Provisioned IOPS (io2) storage with appropriate IOPS.

AnswerD

Provisioned IOPS provides consistent low latency for write-heavy workloads.

Why this answer

Option C is correct because Provisioned IOPS (io2) provides consistent low latency. Option A is wrong because gp3 may not provide enough IOPS for the workload. Option B is wrong because increasing storage size increases baseline IOPS for gp2, but 500 GB already gives 1500 IOPS; more IOPS may be needed.

Option D is wrong because Multi-AZ does not improve write performance; it may increase latency.

Practice this question →

183

MCQhard

An e-commerce application uses Amazon Aurora MySQL. During a flash sale, the database experiences a surge in write operations, and the application reports 'Lock wait timeout exceeded' errors. The DBA suspects a long-running transaction is blocking others. Which AWS tool or feature should the DBA use to identify the blocking transaction?

A.Amazon Aurora Performance Insights with the 'Wait for lock' dimension

B.Amazon Inspector

C.CloudWatch Logs Insights

D.Amazon RDS Enhanced Monitoring

AnswerA

Performance Insights can display blocking sessions and wait events.

Why this answer

Option D is correct because Aurora's Performance Insights with the 'Wait for lock' dimension can identify blocking sessions. Option A is wrong because Amazon Inspector is a security assessment tool. Option B is wrong because CloudWatch Logs Insights can analyze logs but does not directly identify blocking transactions.

Option C is wrong because Enhanced Monitoring provides OS-level metrics, not database lock information.

Practice this question →

184

MCQmedium

A company is using Amazon RDS for SQL Server with Multi-AZ. The application experiences a brief interruption during a planned maintenance window. The DBA wants to verify that the failover occurred correctly. Which set of metrics should the DBA examine?

A.FreeStorageSpace on the primary instance

B.DatabaseConnections and ReplicaLag

C.WriteIOPS and ReadIOPS for both instances

D.NetworkThroughput on the replica instance

AnswerB

Connections drop to zero during failover, and ReplicaLag becomes zero after failover.

Why this answer

Option D is correct because a failover results in a brief downtime (DatabaseConnections drop to zero) followed by a switch from master to standby (ReplicaLag disappears). Option A is wrong because WriteIOPS and ReadIOPS may change but not specifically indicate failover. Option B is wrong because FreeStorageSpace does not change during failover.

Option C is wrong because NetworkThroughput may fluctuate but is not a direct indicator of failover.

Practice this question →

185

Multi-Selectmedium

A company is using Amazon Redshift and has a query that is running slowly. The DBA wants to identify if the query is I/O-bound. Which TWO metrics from Amazon CloudWatch can indicate I/O-bound queries?

Select 2 answers

A.NetworkReceiveThroughput

B.ReadIOPS

C.CPUUtilization

D.WriteIOPS

E.ReadLatency

AnswersB, E

High ReadIOPS indicates many I/O operations.

Why this answer

Options A and D are correct. ReadIOPS measures read operations per second, and ReadLatency measures the time for read operations; both are I/O metrics. WriteIOPS and WriteLatency are also I/O metrics but given the question, the correct ones are ReadIOPS and ReadLatency.

Option C is wrong because CPUUtilization indicates CPU-bound, not I/O-bound. Option E is wrong because NetworkReceiveThroughput measures network throughput.

Practice this question →

186

Multi-Selectmedium

A company is using Amazon DynamoDB for a session management application. The application is experiencing higher latency during peak hours. The DynamoDB table uses on-demand capacity. Which TWO steps should a database specialist take to diagnose the issue?

Select 2 answers

A.Review the table's ReadCapacityUnits and WriteCapacityUnits to ensure they are sufficient.

B.Examine the CloudWatch metric for UserErrors to identify client-side issues.

C.Check the CloudWatch metric for ThrottledRequests to see if any requests are being throttled.

D.Enable DynamoDB Streams to capture all write events for analysis.

E.Analyze the PartitionKey metrics to detect if a single partition is receiving a disproportionate amount of traffic.

AnswersC, E

Throttling can cause increased latency.

Why this answer

Checking ThrottledRequests helps identify if requests are being throttled, even with on-demand. Reviewing PartitionKey metrics helps detect hot partitions. ReadCapacityUnits and WriteCapacityUnits are not applicable for on-demand.

Latency metrics show end-to-end performance but are less diagnostic.

Practice this question →

187

MCQhard

Refer to the exhibit. A database specialist is troubleshooting an automation script that fails when trying to create a snapshot of the RDS DB instance 'mydb' using an IAM role with the attached policy. The error message indicates that the user is not authorized to perform the operation. Which statement best explains the failure?

A.The resource ARN for the snapshot is incorrect; it should specify the DB instance ID.

B.The policy does not allow the rds:DescribeDBInstances action on the snapshot resource.

C.The policy does not allow the rds:CreateDBSnapshot action on the specific snapshot name.

D.The policy does not grant the rds:CreateDBSnapshot permission on the DB instance resource.

AnswerD

CreateDBSnapshot requires permission on the DB instance.

Why this answer

Option C is correct because the CreateDBSnapshot action requires permissions on both the DB instance and the snapshot resource. The policy only grants access to the snapshot resource, not the DB instance. Options A and B are wrong because the actions are allowed.

Option D is wrong because the resource is specified correctly.

Practice this question →

188

MCQeasy

An administrator is troubleshooting an Amazon RDS for PostgreSQL instance that is experiencing high CPU utilization. The administrator has enabled Performance Insights. Which metric should be examined first to identify the queries consuming the most CPU?

A.db.sessions

B.db.cpu.avg

C.db.load.avg

D.db.bytes_sent

AnswerC

This metric shows the average number of active sessions and is key for identifying high-load queries.

Why this answer

Option B is correct because the 'db.load.avg' metric in Performance Insights shows the average number of active sessions, which helps identify queries causing high load. Option A is incorrect because 'db.cpu.avg' is not a standard Performance Insights metric. Option C is incorrect because 'db.sessions' is not a standard metric.

Option D is incorrect because 'db.bytes_sent' relates to network throughput, not CPU.

Practice this question →

189

Multi-Selectmedium

A company is experiencing slow query performance on an Amazon RDS for MySQL database. The DBA wants to identify the most time-consuming queries. Which TWO actions should the DBA take? (Choose two.)

Select 2 answers

A.Enable the audit log.

B.Enable the general log and review it.

C.Use Amazon RDS Enhanced Monitoring.

D.Use Amazon RDS Performance Insights.

E.Enable the slow query log and monitor it in CloudWatch Logs.

AnswersD, E

Performance Insights identifies top SQL by load.

Why this answer

Amazon RDS Performance Insights (Option D) provides a database performance tuning and monitoring feature that visualizes database load and identifies the most time-consuming queries by breaking down wait events, SQL statements, and hosts. It directly helps the DBA pinpoint the specific queries causing performance degradation without additional configuration or overhead.

Exam trap

The trap here is that candidates often confuse general logging (Option B) with slow query logging, or assume Enhanced Monitoring (Option C) provides query-level insights, when in fact only Performance Insights and the slow query log directly identify the most time-consuming queries.

Practice this question →

190

Multi-Selecthard

A company uses Amazon Aurora MySQL for its e-commerce platform. The DB cluster has one writer and two readers. Recently, the application started showing occasional deadlock errors during order processing. The error logs show: 'Transaction (Process ID 123) was deadlocked on lock resources with another process and has been chosen as the deadlock victim. Rerun the transaction.' The application retries three times before failing. The development team wants to reduce the likelihood of deadlocks. Which three actions should the team take? (Choose three.)

Select 3 answers

A.Shorten the duration of transactions by committing frequently.

B.Switch to READ UNCOMMITTED isolation level to reduce locking.

C.Use READ COMMITTED isolation level instead of REPEATABLE READ.

D.Increase the number of retry attempts to 10.

E.Ensure that transactions access tables in the same order.

AnswersA, C, E

Shorter transactions hold locks for less time, reducing the chance of conflicts.

Why this answer

Options A, C, and D are correct. Option A: Reducing transaction duration minimizes the window for lock conflicts. Option C: Accessing tables in a consistent order avoids cycle dependencies.

Option D: Using READ COMMITTED reduces shared locks for reads. Option B is incorrect because increasing retries does not prevent deadlocks; it only delays failure. Option E is incorrect because lowering isolation level to READ UNCOMMITTED can cause dirty reads and is not recommended for e-commerce.

Practice this question →

191

Multi-Selecteasy

Which TWO CloudWatch metrics should be monitored to determine if an Amazon Aurora MySQL DB cluster has sufficient I/O capacity?

Select 2 answers

A.ReadIOPS

B.WriteIOPS

C.FreeableMemory

D.CPUUtilization

E.DatabaseConnections

AnswersA, B

ReadIOPS shows the number of read I/O operations per second.

Why this answer

Options B and D are correct. ReadIOPS and WriteIOPS show actual I/O operations. Option A is wrong because CPUUtilization is about CPU.

Option C is wrong because DatabaseConnections is about connections. Option E is wrong because FreeableMemory is about memory.

Practice this question →

192

MCQhard

A company runs an Amazon Aurora MySQL database cluster with one writer and two readers. The application suddenly fails with 'Too many connections' error. The writer instance's maximum connections is set to 1000. Which configuration change would best resolve the issue while maintaining high availability?

A.Enable Amazon RDS Proxy for the database cluster.

B.Increase the max_connections parameter on the writer instance to 5000.

C.Redirect all write requests to one of the read replicas.

D.Manually kill idle connections from the database.

AnswerA

Correct. RDS Proxy efficiently manages database connections, reducing the number of open connections.

Why this answer

Option D is correct because a multi-threaded application may exhaust connections; using a connection pooler like RDS Proxy reduces connection churn. Option A is wrong because increasing max_connections may overload the instance. Option B is wrong because read replicas do not accept writes.

Option C is wrong because it's not a configuration change.

Practice this question →

193

MCQmedium

A company is experiencing increased latency on their RDS for PostgreSQL instance. The CloudWatch metrics show high ReadIOPS but low CPU utilization. Which action is MOST likely to resolve the issue?

A.Migrate the database to Amazon Aurora PostgreSQL

B.Increase the instance size or switch to a gp3 volume

C.Enable Multi-AZ deployment

D.Enable storage auto-scaling

AnswerB

More memory and I/O capacity reduces wait events.

Why this answer

Option C is correct because increasing the instance size or switching to a storage-optimized instance reduces I/O latency. Option A is wrong because Aurora is not the same as PostgreSQL. Option B is wrong because Multi-AZ does not improve performance.

Option D is wrong because enabling auto-scaling only scales storage, not performance.

Practice this question →

194

MCQeasy

A developer is troubleshooting an issue where an application using Amazon DynamoDB is receiving occasional 'ThrottlingException' errors. The application uses eventually consistent reads. What is the MOST likely cause of this error?

A.The application is using an incorrect table name

B.The read capacity units are set too low for the current traffic pattern

C.The application is using eventually consistent reads instead of strongly consistent reads

D.There is a network connectivity issue between the application and DynamoDB

AnswerB

Throttling happens when traffic exceeds provisioned capacity.

Why this answer

Option B is correct because ThrottlingException occurs when the request rate exceeds the provisioned throughput capacity. Option A is wrong because eventually consistent reads consume half the read capacity units, so they are less likely to cause throttling. Option C is wrong because network issues would cause timeout errors, not throttling.

Option D is wrong because incorrect table name would cause ResourceNotFoundException.

Practice this question →

195

Multi-Selecteasy

A company uses Amazon DynamoDB Global Tables for a multi-region application. The table is configured with on-demand capacity. The application writes data in the us-east-1 region and reads from us-west-2. Users in us-west-2 report that data written in us-east-1 is not appearing in us-west-2 within the expected replication latency of under 5 seconds. Instead, replication sometimes takes up to 30 seconds. Which two factors could be causing this increased replication latency? (Choose two.)

Select 2 answers

A.The application is reading from the us-west-2 table before replication completes.

B.High network latency or packet loss between us-east-1 and us-west-2.

C.The read capacity in us-west-2 is insufficient, causing read throttling.

D.The on-demand capacity mode is causing write throttling in us-east-1.

E.A large volume of writes to the table creates a backlog in the replication stream.

AnswersB, E

Replication relies on network connectivity; latency increases replication time.

Why this answer

Options B and C are correct. Option B: Network latency between regions directly affects replication time. Option C: Large write volumes can cause backpressure on the replication stream.

Option A is incorrect because on-demand capacity does not throttle writes; it scales automatically. Option D is incorrect because Global Tables replicate all data to all regions; us-west-2 must receive all writes. Option E is incorrect because read capacity on a replica does not affect replication; replication uses write capacity in the replica region.

Practice this question →

196

MCQmedium

A company is using Amazon Neptune and notices that some queries are slow. The DBA wants to identify which queries consume the most time. Which feature should be used?

A.Database audit log

B.Query profiler

C.Slow query log

D.Performance Insights

AnswerB

Query profiler captures execution details of queries.

Why this answer

Neptune's query profiler provides detailed information about query execution time. The slow query log only logs queries that exceed a threshold. The audit log is for security events.

The performance insights is for RDS, not Neptune.

Practice this question →

197

MCQmedium

A company uses Amazon ElastiCache for Redis as a caching layer for a web application. They observe a sudden increase in CPU utilization on the cache cluster, and the application experiences higher latency. Which action should be taken to diagnose the issue?

A.Enable query profiling on the ElastiCache cluster.

B.Monitor Amazon CloudWatch metrics for the ElastiCache cluster to identify the cause.

C.Disable persistence to reduce CPU overhead.

D.Immediately scale up to a larger node type.

AnswerB

Correct. CloudWatch metrics provide insight into CPU usage, cache hits/misses, and other performance indicators.

Why this answer

Option A is correct because CloudWatch metrics like CPUUtilization and CacheHits are the first step to diagnose. Option B is wrong because increasing node size is a solution, not a diagnostic step. Option C is wrong because ElastiCache does not have a built-in query profiler.

Option D is wrong because disabling persistence is unrelated.

Practice this question →

198

MCQmedium

An organization is using Amazon DynamoDB with on-demand capacity. They notice that the 'ThrottledRequests' metric is non-zero during a specific hour each day. The table stores session data and has a partition key of 'user_id'. The workload is uniform. What is the most likely cause of throttling?

A.The table has on-demand capacity and cannot throttle.

B.The account-level read/write capacity limit is exceeded.

C.A global secondary index is consuming write capacity.

D.The partition key is not evenly distributed.

AnswerD

Uneven access pattern can cause a hot partition leading to throttling even on on-demand tables.

Why this answer

Option D is correct because the 'ThrottledRequests' metric indicates that requests are being throttled due to partition-level capacity limits. Even with on-demand capacity, DynamoDB imposes a per-partition throughput limit (typically 1,000 WCU or 3,000 RCU per partition). If the partition key 'user_id' is not evenly distributed, a single partition can become a hot key, exceeding its limit and causing throttling despite the table having on-demand capacity.

Exam trap

The trap here is that candidates assume on-demand capacity eliminates all throttling, but DynamoDB still throttles at the partition level when a hot key causes uneven distribution, even with on-demand mode.

How to eliminate wrong answers

Option A is wrong because on-demand capacity can still throttle requests when a single partition exceeds its throughput limit, contrary to the misconception that on-demand mode never throttles. Option B is wrong because account-level read/write capacity limits are separate from DynamoDB throttling; the 'ThrottledRequests' metric reflects partition-level throttling, not account-level limits. Option C is wrong because a global secondary index consuming write capacity would affect the index's own throttling metrics, not the base table's 'ThrottledRequests' metric, and the question specifies the table stores session data with a uniform workload.

Practice this question →

199

MCQhard

A company's DynamoDB table uses provisioned capacity and has a global table for disaster recovery. Write requests to the replica region are failing with ProvisionedThroughputExceededException even though the replica table shows low consumed capacity. What is the MOST likely cause?

A.Auto-scaling is disabled on the replica table

B.The replica table is exhausting its burst capacity

C.The replica table's write capacity is insufficient for replication writes

D.DAX is enabled on the replica table

AnswerC

Global tables replicate writes asynchronously; if replica's write capacity is lower than the replication traffic, throttling occurs.

Why this answer

Option D is correct because replicas have independent capacity; if replication writes exceed provisioned capacity, throttling occurs. Option A is wrong because burst capacity is not relevant to throttling. Option B is wrong because auto-scaling doesn't cause throttling.

Option C is wrong because DAX is for reads.

Practice this question →

200

MCQmedium

A company is using Amazon Aurora MySQL. The database performance has degraded, and the DBA wants to identify the queries that are waiting for locks. Which system table should be queried to find information about locking conflicts?

A.information_schema.TABLES

B.information_schema.PROCESSLIST

C.performance_schema.events_waits_current

D.information_schema.INNODB_LOCKS

AnswerD

INNODB_LOCKS shows current InnoDB locks.

Why this answer

Option C is correct because INNODB_LOCKS shows current locks and InnoDB lock conflicts. Option A is wrong because PERFORMANCE_SCHEMA provides performance data but not specifically locks. Option B is wrong because INFORMATION_SCHEMA.PROCESSLIST shows current processes but not lock details.

Option D is wrong because INFORMATION_SCHEMA.TABLES shows table metadata.

Practice this question →

201

MCQeasy

A database administrator notices that an Amazon RDS for Oracle instance has a high number of connections, causing performance degradation. Which tool can be used to identify the active sessions and their queries?

A.Amazon Inspector

B.Performance Insights

C.RDS Enhanced Monitoring

D.AWS CloudTrail

AnswerB

Performance Insights shows active sessions and their queries.

Why this answer

Option D is correct because Performance Insights provides a dashboard to monitor active sessions and the queries they are running. Option A is wrong because AWS CloudTrail logs API calls, not database sessions. Option B is wrong because RDS Enhanced Monitoring provides OS metrics, not database session details.

Option C is wrong because Amazon Inspector is a security assessment tool.

Practice this question →

202

MCQeasy

A database administrator notices that the /var/log/mysql/error.log file on an Amazon RDS for MySQL DB instance is growing rapidly. The administrator wants to monitor the log file size and receive alerts when it exceeds a certain threshold. Which AWS service should be used to set up this monitoring?

A.AWS CloudTrail

B.Amazon S3

C.Amazon RDS Enhanced Monitoring

D.Amazon CloudWatch Logs

AnswerD

CloudWatch Logs can monitor log file size and trigger alarms.

Why this answer

Option D is correct because CloudWatch Logs can monitor log file sizes and trigger alarms based on metric filters. Option A is wrong because CloudTrail does not monitor log file sizes. Option B is wrong because RDS Enhanced Monitoring provides OS-level metrics, not log file metrics.

Option C is wrong because S3 is a storage service, not a monitoring service.

Practice this question →

203

Multi-Selecthard

A company is using Amazon DynamoDB with global tables. The application is experiencing high write latency in one region. Which THREE factors could contribute to this issue? (Choose three.)

Select 3 answers

A.Replication lag from global tables causing write conflicts.

B.Insufficient read capacity units provisioned for the table.

C.High network latency between the application and the DynamoDB endpoint.

D.The table's auto scaling configuration is set to increase write capacity too aggressively.

E.Insufficient write capacity units provisioned for the table.

AnswersA, C, E

Global tables replicate asynchronously, and conflicts can increase latency.

Why this answer

Option A (network latency), Option C (throttling due to insufficient write capacity), and Option D (replication lag) are correct. Option B is wrong because auto scaling doesn't increase latency; it adds capacity. Option E is wrong because read capacity affects reads, not writes.

Practice this question →

204

Multi-Selecteasy

A company is troubleshooting an Amazon DynamoDB table that is experiencing high latency. The table uses on-demand capacity. Which TWO steps should be taken to diagnose the issue?

Select 2 answers

A.Increase the provisioned read capacity

B.Examine CloudWatch metrics for throttling and latency patterns

C.Consider using DynamoDB Accelerator (DAX) for caching

D.Disable TTL to reduce write overhead

E.Split the table into multiple partitions manually

AnswersB, C

CloudWatch provides key performance metrics.

Why this answer

Option A is correct because CloudWatch metrics can show throttling, latency, and utilization. Option D is correct because DynamoDB Accelerator (DAX) can reduce latency if the issue is high read volume. Option B is wrong because increasing capacity is not needed for on-demand.

Option C is wrong because partitions are managed automatically. Option E is wrong because disabling TTL is unrelated.

Practice this question →

205

MCQmedium

Refer to the exhibit. A database specialist is investigating performance degradation on an Amazon RDS for MySQL DB instance. The BurstBalance metric shows the values above. What does this indicate, and what action should be taken?

A.The instance has exhausted its I/O burst credits; scale up the allocated storage to increase baseline I/O.

B.The instance is experiencing high read load; add a read replica to offload reads.

C.The instance's compute capacity is insufficient; change the instance class to a larger size.

D.The instance is experiencing a memory bottleneck; enable Performance Insights to analyze query performance.

AnswerA

Larger storage volumes have higher baseline I/O and accumulate burst credits faster.

Why this answer

Option B is correct because BurstBalance dropping to 0 indicates that the instance has exhausted its I/O burst credits, causing I/O throttling. Scaling up storage to a larger size increases the baseline I/O performance and burst credit earning rate. Option A is wrong because read replicas do not increase I/O credits.

Option C is wrong because enabling Performance Insights only helps diagnose, not resolve. Option D is wrong because changing the instance class does not affect I/O credits.

Practice this question →

206

MCQmedium

Refer to the exhibit. A DBA sees the above error log entries for an Amazon RDS for MySQL DB instance. Which action should the DBA take to resolve the 'Too many connections' error?

A.Modify the DB parameter group to increase max_connections

B.Reboot the DB instance

C.Modify the DB security group to allow more connections

D.Reset the DB instance master user password

AnswerA

Increasing max_connections allows more connections.

Why this answer

Option B is correct because increasing the max_connections parameter allows more concurrent connections. Option A is wrong because rebooting does not increase the connection limit. Option C is wrong because modifying the security group does not affect database connections.

Option D is wrong because the error is not about authentication.

Practice this question →

207

MCQmedium

A database engineer is monitoring an Amazon RDS for PostgreSQL instance and notices that the 'DiskQueueDepth' metric is consistently above 100. The instance uses gp2 storage with 1000 GB allocated. What is the most likely cause of the high disk queue depth?

A.Replication lag between the primary and standby instance

B.The instance has reached the IOPS limit of the gp2 volume

C.Insufficient memory allocated to the instance

D.Network throughput limit is being exceeded

AnswerB

For a 1000 GB gp2 volume, baseline IOPS is 3000; sustained I/O beyond that causes queuing.

Why this answer

Option B is correct because a consistently high DiskQueueDepth (above 100) on an Amazon RDS for PostgreSQL instance with gp2 storage indicates that the volume is saturating its IOPS limit. gp2 volumes provide a baseline of 3 IOPS per GB (up to 16,000 IOPS), so a 1000 GB gp2 volume has a baseline of 3000 IOPS. When the workload exceeds this baseline, the volume relies on burst credits, and once credits are exhausted, IOPS are throttled to the baseline, causing I/O requests to queue up and the DiskQueueDepth metric to rise.

Exam trap

The trap here is that candidates may confuse DiskQueueDepth with memory or network metrics, or assume that any high queue depth automatically indicates a hardware failure, rather than recognizing it as a symptom of IOPS exhaustion on gp2 storage.

How to eliminate wrong answers

Option A is wrong because replication lag between primary and standby instances is measured by the 'ReplicaLag' metric, not DiskQueueDepth, and it does not directly cause high disk queue depth on the primary instance. Option C is wrong because insufficient memory allocated to the instance would manifest as high swap usage or low FreeableMemory, not as a high DiskQueueDepth, which is a storage I/O metric. Option D is wrong because network throughput limits are tracked by metrics like 'NetworkThroughput' or 'NetworkPacketsIn/Out', and exceeding them would cause packet loss or latency, not a buildup of I/O requests at the disk level.

Practice this question →

208

MCQeasy

A database administrator notices that the CPU utilization on an Amazon RDS for PostgreSQL instance is consistently above 90% during peak hours. Which CloudWatch metric should be checked first to identify the cause of the high CPU usage?

A.DatabaseConnections

B.NetworkThroughput

C.SwapUsage

D.ReadIOPS

AnswerA

High connections can lead to high CPU from session management.

Why this answer

Option A is correct because DatabaseConnections can indicate high concurrent sessions causing CPU spikes. Option B is wrong because ReadIOPS may not directly cause CPU. Option C is wrong because SwapUsage is memory-related.

Option D is wrong because NetworkThroughput is network-related.

Practice this question →

209

MCQmedium

A company's Amazon RDS for PostgreSQL instance is running out of storage. The DB instance has auto-scaling enabled, but the storage did not increase. What is the most likely cause?

A.The DB instance class does not support storage auto-scaling.

B.The DB instance is smaller than the minimum storage size for auto-scaling.

C.The DB instance has exceeded the Maximum Storage Duration setting.

D.The storage usage has not reached the maximum allocated storage.

AnswerD

Auto-scaling only occurs when storage is nearly full.

Why this answer

Option C is correct because auto-scaling only triggers when storage usage reaches the maximum allocated storage and there is at least 10% free space remaining. Option A is wrong because Maximum Storage Duration is the maximum time the instance can run on provisioned storage before scaling. Option B is wrong because there is no minimum storage size for scaling.

Option D is wrong because storage auto-scaling does not require modification to the DB instance class.

Practice this question →

210

MCQeasy

A company runs an Amazon RDS for Oracle DB instance. The database administrator wants to receive an alert when the storage space is below 10% of the allocated storage. Which Amazon CloudWatch metric and alarm threshold should be used?

A.Metric: FreeStorageSpace, Condition: < 10% of 100 GB (10 GB)

B.Metric: BinaryLogUsage, Condition: > 10%

C.Metric: FreeableMemory, Condition: < 10% of total memory

D.Metric: DiskQueueDepth, Condition: > 10

AnswerA

FreeStorageSpace metric with a threshold of 10 GB (assuming 100 GB allocated) would trigger when free space is below 10%.

Why this answer

The metric FreeStorageSpace shows the available storage in bytes. The alarm should be set to when FreeStorageSpace is less than 10% of allocated storage. Option B is correct.

Option A is wrong because it's a percentage of total. Option C is wrong because it's a fixed value. Option D is wrong because BinaryLog usage is not relevant.

Practice this question →

211

MCQmedium

A database administrator is troubleshooting an Amazon RDS for SQL Server instance that is experiencing high 'ReadIOPS' and 'ReadLatency'. The instance uses General Purpose SSD (gp2) storage. The 'BurstBalance' metric is 0%. What should the administrator do to improve performance?

A.Enable Multi-AZ to distribute the load

B.Increase the allocated storage or switch to Provisioned IOPS

C.Create a read replica to offload read traffic

D.Disable automatic backups to reduce I/O

AnswerB

Increasing volume size increases baseline IOPS for gp2; switching to io1/io2 provides consistent IOPS.

Why this answer

Option D is correct because when BurstBalance is 0%, the gp2 volume has exhausted its burst credits and is operating at baseline IOPS. To improve performance, you can increase the volume size (which increases baseline IOPS) or switch to Provisioned IOPS (io1/io2). Option A is wrong because enabling Multi-AZ does not increase I/O performance.

Option B is wrong because disabling backups reduces CPU but not IOPS. Option C is wrong because creating a read replica does not improve write performance.

Practice this question →

212

MCQeasy

A developer executed a DELETE statement without a WHERE clause on an Amazon RDS for PostgreSQL instance. The transaction is still open. Which action should the developer take to undo the DELETE without affecting other operations?

A.Execute COMMIT and then run a recovery script.

B.Use the Point-in-Time Recovery feature to restore the database to a time before the DELETE.

C.Execute ROLLBACK in the same session.

D.Stop the DB instance and restore from the latest snapshot.

AnswerC

ROLLBACK undoes all changes made in the current transaction.

Why this answer

Option A is correct because an open transaction can be rolled back using the ROLLBACK command. Option B is incorrect because Point-in-Time Recovery restores the entire DB instance to a previous timestamp, not just the transaction. Option C is incorrect because ROLLBACK will undo the DELETE.

Option D is incorrect because COMMIT would make the DELETE permanent.

Practice this question →

213

MCQhard

A company is running an Amazon DynamoDB table with provisioned capacity. The table has a partition key of 'user_id' and a sort key of 'timestamp'. The application performs frequent Query operations using the partition key and a range of sort keys. Recently, the 'ThrottledRequests' metric has spiked. The read and write capacity units are consistently at 80% utilization. What is the most effective way to resolve the throttling?

A.Increase the provisioned read and write capacity units

B.Add a global secondary index with a different partition key

C.Change the partition key to a more uniformly distributed attribute

D.Enable DynamoDB Auto Scaling with a higher target utilization

AnswerD

Auto Scaling with adaptivity can handle hot partitions by adjusting throughput.

Why this answer

Option D is correct because using an adaptive capacity add-on allows DynamoDB to automatically manage throughput for hot partitions. Option A is wrong because increasing capacity units may not help if throttling is due to hot partitions. Option B is wrong because GSI does not directly solve hot partition throttling.

Option C is wrong because changing partition key distribution would require table redesign and data migration.

Practice this question →

214

MCQhard

A company is using Amazon DynamoDB Accelerator (DAX) to improve read performance. Recently, the cache hit ratio has dropped significantly. The application uses strongly consistent reads. What is the most likely cause of the low cache hit ratio?

A.The DynamoDB table's write capacity is too low

B.The application is using strongly consistent reads, which bypass the DAX cache

C.The DAX cluster has too few nodes

D.The DAX cluster's TTL is set too low

AnswerB

DAX only caches eventually consistent reads; strongly consistent reads go directly to DynamoDB.

Why this answer

Option C is correct because DAX caches eventually consistent reads, not strongly consistent reads. Option A is wrong because TTL affects item expiration but not the consistency model. Option B is wrong because write capacity does not directly affect cache hits.

Option D is wrong because the DAX cluster size affects performance but not the consistency mismatch.

Practice this question →

215

MCQmedium

A company is running a production Amazon RDS for MySQL DB instance. The application team reports intermittent connection timeouts. The DBA notices that the DB instance's CPU utilization spikes to 100% during these times. Which metric should be monitored to determine if the issue is due to a specific query?

A.DatabaseConnections

B.NetworkThroughput

C.ReadIOPS

D.Queries (engine-specific counter)

AnswerD

Queries reflects the number of queries executed, helping identify query load spikes.

Why this answer

Option D is correct because the RDS for MySQL engine-specific counter 'Queries' reflects the number of queries executed. Option A is wrong because DatabaseConnections shows connections, not query performance. Option B is wrong because ReadIOPS measures disk I/O, not query volume.

Option C is wrong because NetworkThroughput measures network traffic.

Practice this question →

216

MCQhard

A company uses Amazon DynamoDB Global Tables with strong consistent reads. They notice that a write to us-east-1 is not visible in eu-west-1 after several seconds. Which configuration setting is MOST likely causing this behavior?

A.DynamoDB Streams is not enabled on the table

B.Auto Scaling is configured for write capacity

C.Last writer wins (LWW) conflict resolution is disabled

D.Strongly consistent reads are used on a global table

AnswerA

Global Tables require DynamoDB Streams to replicate writes; without it, replication does not occur.

Why this answer

Option A is correct because Global Tables use last writer wins (LWW) conflict resolution by default; if the application writes to both regions simultaneously, the latest write from the region with the highest timestamp wins, but replication is typically sub-second. However, if the 'ReplicationGroup' is misconfigured, replication can be delayed. But the question mentions strong consistency reads, which are not supported for Global Tables (they support eventual consistency only).

Actually, strong consistency reads are not available for Global Tables; using them would cause an error. However, the scenario says they notice a delay, which is expected for eventual consistency. Option A is wrong: strong consistency reads are not supported for Global Tables.

Option B is correct: strong consistency reads are not supported, but the scenario says they are using them; this would cause an error, not delay. Option C: DynamoDB Streams must be enabled for Global Tables; if disabled, writes would not replicate at all. Option D: Auto Scaling does not affect replication.

Given the options, the most likely cause is that DynamoDB Streams is not enabled on the table, which would prevent replication entirely. But the question says 'not visible after several seconds', which suggests eventual consistency delay. Actually, the correct answer is that Global Tables only support eventual consistency, so strong consistency reads are not possible; but the question says they are using them, which might be a misconfiguration.

However, typical exam: strong consistency reads on Global Tables are not supported; using them would cause an error, not a delay. So Option B is plausible: they might have requested strong consistency reads on a global table, which would fail. But the scenario says they notice a delay, not an error.

I think the intended answer is that Global Tables rely on DynamoDB Streams; if streams are disabled, writes won't replicate. So Option C. Let me re-evaluate: DynamoDB Streams must be enabled for the table to be used in Global Tables.

If streams are disabled, writes will not be replicated. That matches the symptom: writes to us-east-1 are not visible in eu-west-1. Option A: LWW is default, but it doesn't cause delay.

Option B: Strong consistency reads are not supported for Global Tables, but the scenario says they are using them; it's unclear if that causes delay or error. Option D: Auto Scaling does not affect replication. So Option C is most likely.

Practice this question →

217

Multi-Selecthard

Which THREE metrics should be monitored in Amazon CloudWatch to detect a potential memory leak in an Amazon RDS for SQL Server instance? (Choose three.)

Select 3 answers

A.DatabaseConnections

B.CPUUtilization

C.ReadIOPS

D.SwapUsage

E.FreeableMemory

AnswersA, D, E

If connections are not released, memory usage may increase.

Why this answer

Options B, C, and E are correct. FreeableMemory shows available memory. SwapUsage indicates memory pressure.

DatabaseConnections can increase if connections are not closed properly. Option A is incorrect because CPUUtilization is not a memory metric. Option D is incorrect because ReadIOPS relates to I/O.

Practice this question →

218

MCQhard

A company is migrating an on-premises Oracle database to Amazon RDS for Oracle. During the migration, the database administrator notices that the CPU utilization on the RDS instance is consistently above 90% during peak hours, even though the on-premises server had similar specifications. The application queries are mostly SELECT statements with occasional DML. The RDS instance is db.r5.large with 500 GB of General Purpose SSD (gp2) storage. Which change would most likely reduce CPU utilization?

A.Create a read replica and redirect all SELECT queries to the replica.

B.Enable Multi-AZ to offload CPU to the standby instance.

C.Increase the allocated storage to 1 TB to improve I/O performance.

D.Upgrade to a larger instance type, such as db.r5.xlarge.

AnswerD

A larger instance provides more CPU cores and better performance, directly addressing high CPU utilization.

Why this answer

The db.r5.large instance type has 2 vCPUs and 16 GiB of memory. Sustained CPU utilization above 90% during peak hours indicates that the instance is compute-bound for the workload. Upgrading to db.r5.xlarge (4 vCPUs, 32 GiB memory) doubles the available CPU capacity, directly reducing CPU utilization for the same query load.

The on-premises server had similar specifications, but RDS instances may have different CPU architectures or hypervisor overhead, making the larger instance the most direct fix.

Exam trap

AWS often tests the misconception that increasing storage or adding a read replica can solve CPU bottlenecks, but the correct answer requires recognizing that CPU saturation is a compute issue best addressed by scaling instance size.

How to eliminate wrong answers

Option A is wrong because creating a read replica and redirecting SELECT queries offloads read traffic from the primary instance, but the primary still handles all DML and writes; if the CPU bottleneck is from both SELECT and DML processing on the primary, the replica does not reduce the primary's CPU load. Option B is wrong because Multi-AZ provides a standby instance for failover only; the standby does not serve read traffic or offload CPU from the primary—it is a synchronous replica that is not active for queries. Option C is wrong because increasing gp2 storage to 1 TB increases baseline IOPS from 1500 to 3000, which improves I/O throughput, but the problem is CPU utilization, not I/O latency or throughput; the instance is compute-bound, not storage-bound.

Practice this question →

219

MCQhard

A company is running an Amazon RDS for SQL Server instance with Multi-AZ deployment. The database is used by a critical application. During a recent failover test, the application experienced a 2-minute downtime. The application's connection string uses the DB instance endpoint, not the cluster endpoint. Which change would minimize downtime during future failovers?

A.Modify the application to use the cluster endpoint instead of the instance endpoint

B.Increase the DB instance class size

C.Create a read replica in a different Availability Zone

D.Enable Multi-AZ on the DB instance

AnswerA

The cluster endpoint points to the current primary and updates automatically after failover, reducing downtime.

Why this answer

Option D is correct because using the cluster endpoint (CNAME) automatically redirects to the current primary, reducing downtime. Option A is wrong because enabling Multi-AZ is already done. Option B is wrong because increasing instance size does not reduce failover time.

Option C is wrong because read replicas are for read scaling, not failover.

Practice this question →

220

Multi-Selectmedium

A company is using Amazon DynamoDB and wants to monitor the read/write capacity utilization of a table. Which TWO AWS services can be used to set up alarms for capacity consumption?

Select 2 answers

A.Amazon DynamoDB Auto Scaling

B.AWS CloudTrail

C.Amazon CloudWatch Logs

D.Amazon CloudWatch

E.AWS Config

AnswersC, D

CloudWatch Logs can be used with metric filters to monitor log entries for capacity events.

Why this answer

Options A and B are correct. CloudWatch provides metrics like ConsumedReadCapacityUnits and can trigger alarms. CloudWatch Logs can capture log events but is not directly used for capacity alarms.

CloudTrail records API calls, not capacity metrics. DynamoDB Auto Scaling adjusts capacity but does not monitor. AWS Config tracks configuration changes, not capacity.

Practice this question →

221

Multi-Selectmedium

A company is using Amazon Redshift and notices that queries are running slowly. Which TWO system views should be used to identify the cause of the slow queries? (Choose TWO.)

Select 2 answers

A.STV_TBL_PERM

B.PG_TABLE_DEF

C.STL_QUERY

D.STL_LOAD_COMMITS

E.SVV_QUERY_STATE

AnswersC, E

STL_QUERY records completed queries with duration and resource usage.

Why this answer

Options A and C are correct. STL_QUERY contains query execution details. SVV_QUERY_STATE shows currently running queries.

Option B is wrong because STV_TBL_PERM shows permissions. Option D is wrong because PG_TABLE_DEF shows table definitions. Option E is wrong because STL_LOAD_COMMITS shows load commit information.

Practice this question →

222

MCQeasy

A developer is receiving timeout errors when connecting to an Amazon ElastiCache for Redis cluster from an Amazon EC2 instance. The security group for the EC2 instance allows outbound traffic to the Redis cluster's security group on port 6379. The Redis cluster's security group does not allow inbound traffic from the EC2 instance. What is the most likely cause of the timeout?

A.The network ACL for the Redis subnet is blocking inbound traffic on port 6379

B.The Redis cluster security group does not have an inbound rule allowing traffic from the EC2 security group on port 6379

C.The subnet route table does not have a route to the Redis cluster

D.The Redis cluster is not accessible from within the same VPC

AnswerB

Inbound rules are required for the target security group.

Why this answer

Option A is correct because security groups are stateful; for inbound traffic, the target security group must allow the incoming connection. Option B is incorrect because the subnet routing does not block specific ports. Option C is incorrect because NACLs are stateless but the issue is with security groups.

Option D is incorrect because the Redis cluster is in a VPC, not on-premises.

Practice this question →

223

Multi-Selecthard

A company is using Amazon DynamoDB with a global table in two regions. The application is experiencing high write latency on the replica table in the secondary region. Which THREE factors could contribute to this issue?

Select 3 answers

A.Large item sizes being written to the table.

B.Network latency between the primary and secondary regions.

C.Auto scaling configuration on the replica table.

D.Low read capacity on the replica table.

E.Insufficient write capacity on the replica table.

AnswersA, B, E

Larger items take longer to replicate.

Why this answer

Global tables replicate writes asynchronously; network latency between regions affects replication. The write capacity on the replica table must be sufficient to handle replication writes. Item size affects write latency.

Option D (read capacity) does not affect write latency. Option E (auto scaling) is not a direct cause of high latency.

Practice this question →

224

MCQhard

A team is troubleshooting a DynamoDB table that has high read latency. The table uses on-demand capacity and has a global secondary index (GSI). Which configuration is MOST likely causing the issue?

A.The GSI has provisioned capacity set too low

B.Time-to-Live (TTL) is enabled

C.DAX is enabled for the table

D.The table uses on-demand capacity

AnswerA

GSIs have independent capacity; throttling on GSI causes high latency.

Why this answer

Option A is correct because an under-provisioned GSI can throttle reads even if the base table uses on-demand. Option B is wrong because on-demand handles bursts. Option C is wrong because TTL does not affect read latency.

Option D is wrong because DynamoDB Accelerator (DAX) reduces latency, not increases.

Practice this question →

225

MCQhard

A company has an Amazon Aurora MySQL DB cluster with a primary instance and two Aurora Replicas. The application is experiencing high write latency. The primary instance's CPU utilization is at 90%, while the replicas are at 30%. The DB cluster parameter group has the default values. Which change is most likely to reduce write latency?

A.Increase the DB instance class of the primary instance.

B.Add more Aurora Replicas to distribute the write load.

C.Disable the binary log (binlog) on the DB cluster.

D.Increase the allocated storage of the cluster.

AnswerC

Binary logging adds CPU overhead; disabling it can reduce write latency.

Why this answer

High CPU on the primary could be due to the binary log (binlog) being enabled by default for Aurora MySQL. Option B is correct because disabling binlog reduces CPU overhead. Option A is wrong because increasing instance size would help but is not the most likely fix.

Option C is wrong because read replicas don't affect write latency. Option D is wrong because increasing storage doesn't reduce CPU.

Practice this question →

← PreviousPage 3 of 4 · 300 questions totalNext →

Ready to test yourself?

Try a timed practice session using only Monitoring and Troubleshooting questions.

Start 20-question session