Knowledge + Practice

CCNA Soa Reliability Bc Questions

75 of 240 questions · Page 3/4 · Soa Reliability Bc topic · Answers revealed

Practice these questions Exam hub All questions

151

Multi-Selectmedium

A company is designing a disaster recovery strategy for its production database hosted on Amazon RDS for MySQL. The primary database is in us-east-1. The company requires an RPO of less than 5 minutes and an RTO of less than 1 hour in the event of a Regional failure. Which TWO actions should the company take to meet these requirements?

Select 2 answers

A.Take daily manual snapshots and copy them to us-west-2.

B.Create a cross-Region Read Replica in us-west-2 and promote it during a disaster.

C.Enable cross-Region automated backups.

D.Enable Multi-AZ deployment for the RDS instance.

E.Use a single-AZ RDS instance with automated backups.

AnswersB, C

Correct: A cross-Region Read Replica can be promoted to a primary, providing fast failover with low RTO.

Why this answer

Options B and D are correct. Enabling cross-Region automated backups (Option B) provides continuous backup to another Region with low RPO. Option D, creating a cross-Region Read Replica and promoting it, allows fast failover with low RTO.

Option A (Multi-AZ) is for AZ-level failure, not regional. Option C (snapshots) has higher RPO. Option E (single-AZ) does not provide failover.

Practice this question →

152

Multi-Selecthard

A company is designing a disaster recovery strategy for its AWS environment. The primary Region is us-east-1, and the secondary Region is us-west-2. The application uses Amazon RDS for MySQL, Amazon S3 for static assets, and EC2 instances in an Auto Scaling group. The RTO is 30 minutes, and the RPO is 15 minutes. Which TWO actions should the SysOps administrator take to meet these requirements? (Choose two.)

Select 2 answers

A.Configure a cross-Region read replica for the RDS instance in us-west-2.

B.Deploy a single RDS instance in us-west-2 as a standby.

C.Create AMI backups of EC2 instances every hour and copy to us-west-2.

D.Take daily EBS snapshots and copy them to us-west-2.

E.Enable S3 cross-Region replication for the static assets bucket.

AnswersA, E

A read replica can be promoted to a standalone DB in minutes, meeting RTO and RPO.

Why this answer

Options A and D are correct. A cross-Region read replica in us-west-2 can be promoted to a primary in case of disaster, providing an RPO of seconds and RTO of minutes. S3 cross-Region replication ensures assets are available in the secondary Region with minimal RPO.

Option B is incorrect because EBS snapshots are incremental and copying a snapshot would not meet RPO of 15 minutes if done daily. Option C is incorrect because AMI backups are for EC2 instances, not for database recovery. Option E is incorrect because a single RDS instance does not provide cross-Region failover.

Practice this question →

153

MCQeasy

A company runs a static website on Amazon S3 with a custom domain name (www.example.com). The website is accessed via Amazon CloudFront. The company's marketing team recently updated the website content, but users are reporting that they still see the old content. The SysOps administrator checks the S3 bucket and confirms that the new files are present. The administrator also checks CloudFront and finds that the default TTL for the cache behavior is 24 hours. The marketing team needs the new content to be visible immediately. What should the administrator do to make the new content available to users as quickly as possible?

A.Disable the CloudFront distribution and re-enable it after 5 minutes.

B.Change the default TTL for the CloudFront cache behavior to 0 seconds.

C.Create a CloudFront invalidation for the path '/*' to remove all cached files.

D.Change the S3 bucket's lifecycle policy to expire objects after 1 day.

AnswerC

Invalidation immediately removes cached objects, forcing CloudFront to fetch updated content.

Why this answer

Option A is correct because creating a CloudFront invalidation for the path '/*' removes all cached objects, so subsequent requests fetch the new content from the origin. Option B is wrong because disabling and re-enabling the distribution would cause downtime. Option C is wrong because changing the default TTL to 0 would require waiting for TTL to expire or invalidations.

Option D is wrong because S3 lifecycle policies do not affect cache.

Practice this question →

154

MCQmedium

A company runs a stateful web application on EC2 instances behind a Network Load Balancer. The application requires that client requests from a particular session are always sent to the same target instance. Which feature should the SysOps administrator configure on the NLB to meet this requirement?

A.Configure path-based routing rules

B.Enable health checks on the target group

C.Configure sticky sessions using a cookie

D.Enable cross-zone load balancing

AnswerC

Sticky sessions ensure client requests go to the same target.

Why this answer

Option B is correct because NLB supports sticky sessions (session affinity) using a cookie or source IP. Option A is wrong because cross-zone load balancing distributes traffic across AZs, not stickiness. Option C is wrong because NLB does not support path-based routing; that's ALB.

Option D is wrong because health checks do not affect stickiness.

Practice this question →

155

MCQeasy

A company wants to ensure that its S3 bucket is accessible only from a VPC. Which configuration should the SysOps Administrator implement?

A.Create an S3 VPC endpoint and attach a bucket policy that restricts access to that endpoint.

B.Configure a bucket policy that allows access from the public internet.

C.Make the bucket public and rely on IAM roles.

D.Attach a security group to the S3 bucket.

AnswerA

Ensures access only from within the VPC.

Why this answer

Option B is correct because S3 bucket policies can restrict access to a specific VPC or VPC endpoint. Option A is wrong because it does not restrict to a VPC. Option C is wrong because it grants public access.

Option D is wrong because S3 does not support security group associations.

Practice this question →

156

MCQhard

A company runs a critical microservices application on Amazon ECS with Fargate launch type. The application consists of several services that communicate via internal HTTP calls. The SysOps Administrator notices that during periods of increased load, some services become unresponsive and the health checks fail. The ECS service auto scaling is configured based on CPU utilization, but it does not scale quickly enough. The administrator needs to improve the reliability and responsiveness of the application. The services are stateless and can be scaled horizontally. The current architecture uses a single Application Load Balancer for each service. The ALB health checks are set to a 30-second interval with a 5-second timeout and 2 unhealthy thresholds. The administrator has observed that when a service instance becomes unhealthy, it takes too long for the ALB to stop sending traffic to it, causing errors. What should the SysOps Administrator do to improve the reliability and responsiveness of the application?

A.Increase the ALB health check interval to 60 seconds and unhealthy threshold to 5.

B.Configure the ALB health check to have a 5-second interval, 2-second timeout, and 2 unhealthy threshold.

C.Increase the ECS service auto scaling target CPU utilization to 90%.

D.Replace the ALB with a Network Load Balancer and use TCP health checks.

AnswerB

Faster health checks detect failures quickly and reduce error rates.

Why this answer

Option A is correct because reducing the health check interval and unhealthy threshold will detect failures faster and stop routing traffic sooner, improving responsiveness. Option B is wrong because increasing the interval and threshold would make detection slower. Option C is wrong because using a Network Load Balancer does not change health check behavior and may not support HTTP health checks.

Option D is wrong because increasing CPU threshold delays scaling, which would worsen the problem.

Practice this question →

157

MCQhard

A company runs a production workload on a fleet of EC2 instances in an Auto Scaling group (ASG). The ASG spans three Availability Zones. To avoid regional failure, the company wants to replicate the infrastructure in a second AWS Region and be able to fail over within 30 minutes. The application state is stored in an RDS MySQL database. What is the MOST cost-effective and reliable solution?

A.Create a cross-Region read replica of the RDS database. In the secondary Region, deploy a duplicate ASG and ALB. In a disaster, promote the read replica to a standalone instance and update Route 53 DNS.

B.Use an Application Load Balancer with cross-Region load balancing to distribute traffic to both Regions.

C.Use RDS Multi-AZ in both Regions and configure synchronous replication between them.

D.Take daily snapshots of the RDS database and copy them to the secondary Region. In the event of a failure, restore the latest snapshot.

AnswerA

Cross-Region read replicas provide near real-time replication and can be promoted in minutes.

Why this answer

RDS cross-Region read replicas can be promoted to a standalone instance in the secondary Region during failover, providing a recovery point objective (RPO) of seconds. Option A is wrong because snapshot restore takes longer than 30 minutes. Option C is wrong because synchronous replication is not possible cross-Region.

Option D is wrong because it does not address the database replication.

Practice this question →

158

MCQhard

A SysOps administrator runs the above command for an EC2 instance. The instance is running but the system status check is impaired. What does this indicate?

A.The instance is still running but the application is not responding.

B.The instance is unreachable due to a misconfigured security group.

C.There is a problem with the underlying physical host that requires stopping and starting the instance.

D.The operating system on the instance has crashed.

AnswerC

System status checks detect problems like loss of network connectivity, power loss, or hardware failure on the host.

Why this answer

Option C is correct because a system status check failure indicates a problem with the underlying physical host or AWS systems. Option A is wrong because instance status checks are separate. Option B is wrong because system status checks cover the host, not the OS.

Option D is wrong because the instance status is 'ok', so the OS is reachable.

Practice this question →

159

MCQmedium

An application uses an Amazon DynamoDB table with on-demand capacity. The SysOps administrator needs to ensure the table remains available during an AWS regional outage. Which strategy should be used?

A.Enable DynamoDB Accelerator (DAX).

B.Create a read replica in another region.

C.Use DynamoDB global tables.

D.Increase read and write capacity units.

AnswerC

Correct. Global tables replicate data across multiple AWS Regions and provide automatic failover for high availability.

Why this answer

DynamoDB global tables provide multi-region, multi-active replication, ensuring the table remains available during an AWS regional outage by automatically replicating data across selected AWS Regions. This is the only option that addresses regional fault tolerance by design, as it uses DynamoDB's built-in replication to maintain availability and data durability across regions.

Exam trap

The trap here is that candidates often confuse read replicas (an RDS concept) with DynamoDB's global tables, or assume that DAX or scaling capacity can provide regional resilience, when in fact only global tables offer multi-region active-active replication for DynamoDB.

How to eliminate wrong answers

Option A is wrong because DynamoDB Accelerator (DAX) is an in-memory cache that improves read performance but operates within a single region and does not provide any cross-region availability or disaster recovery. Option B is wrong because DynamoDB does not support read replicas in the traditional RDS sense; the correct multi-region replication feature is global tables, not read replicas. Option D is wrong because increasing read and write capacity units (even with on-demand scaling) only affects performance within a single region and cannot protect against a regional outage.

Practice this question →

160

MCQeasy

A company runs a stateless web application on EC2 instances in an Auto Scaling group. The application is deployed across multiple Availability Zones. The SysOps administrator wants to ensure that the application remains available even if an entire Availability Zone fails. What is the MOST effective way to achieve this?

A.Configure the Auto Scaling group to launch instances in at least two Availability Zones.

B.Create a CloudWatch alarm to reboot instances when they become unhealthy.

C.Use a single Availability Zone to reduce complexity.

D.Use a larger instance type to handle more traffic.

AnswerA

Distributing instances across AZs provides fault tolerance.

Why this answer

Configuring the Auto Scaling group to use multiple Availability Zones ensures that instances are distributed, so failure of one AZ does not affect all instances. Option A is wrong because it does not provide redundancy. Option B is wrong because it does not automatically recover.

Option D is wrong because a single AZ does not protect against AZ failure.

Practice this question →

161

Multi-Selectmedium

A company runs a web application on EC2 instances in an Auto Scaling group behind an ALB. The application uses an RDS MySQL database. The SysOps administrator needs to improve the reliability of the database layer. Which TWO actions should the administrator take? (Choose two.)

Select 2 answers

A.Enable Multi-AZ on the RDS instance.

B.Take a manual snapshot every hour.

C.Create a read replica in a different Region.

D.Configure automated backups with a retention period of 30 days.

E.Increase the DB instance class to the largest available.

AnswersA, D

Multi-AZ provides automatic failover to a standby in another AZ.

Why this answer

Multi-AZ provides automatic failover, and automated backups enable point-in-time recovery. Read replicas improve read performance but not high availability. Manual snapshots are not sufficient for automatic recovery.

Larger instance class improves performance, not reliability.

Practice this question →

162

MCQmedium

A company runs a critical web application on EC2 instances behind an Application Load Balancer (ALB) across three Availability Zones. The application stores session data in memory on the EC2 instances. During a deployment, a new version of the application is released by terminating and replacing instances. Users report that they are unexpectedly logged out during the deployment. What should a SysOps administrator do to improve the reliability of the application during deployments?

A.Reduce the deployment to a single Availability Zone to minimize instance churn.

B.Store session data in an RDS Multi-AZ database.

C.Enable sticky sessions (session affinity) on the ALB.

D.Use an ElastiCache cluster to store session state externally.

AnswerD

Externalizing session state to ElastiCache ensures sessions survive instance replacements.

Why this answer

Using an ElastiCache cluster for session state decouples session data from EC2 instances, so instance replacements do not cause session loss. Option A is wrong because sticky sessions still lose sessions when instances are terminated. Option C is wrong because RDS is not designed for transient session data.

Option D is wrong because it reduces reliability by removing AZs.

Practice this question →

163

Multi-Selecthard

A SysOps administrator is designing a highly available architecture for a web application using an Application Load Balancer and an Auto Scaling group across three Availability Zones. The application must be able to withstand the loss of an entire AZ. Which THREE components are necessary to meet this requirement? (Choose THREE.)

Select 3 answers

A.Use a single NAT gateway to provide internet access.

B.Launch EC2 instances in at least two Availability Zones.

C.Configure health checks on the ALB target group.

D.Use a cluster placement group for EC2 instances.

E.Enable cross-zone load balancing on the ALB.

AnswersB, C, E

Multi-AZ deployment is essential for AZ failure tolerance.

Why this answer

Options A, C, and E are correct. Option A: Multi-AZ deployment ensures instances are in multiple AZs. Option C: Cross-zone load balancing ensures ALB can route to instances in any AZ.

Option E: Health checks ensure only healthy instances receive traffic. Option B is wrong because a single NAT gateway in one AZ is a single point of failure. Option D is wrong because placement groups are for low latency within a single AZ, not high availability across AZs.

Practice this question →

164

Multi-Selectmedium

A company runs a stateless web application on EC2 instances behind an Application Load Balancer. The company wants to improve the application's availability and fault tolerance. Which TWO actions should the SysOps administrator take?

Select 2 answers

A.Configure Auto Scaling to maintain a minimum number of instances.

B.Use Amazon CloudFront as an origin for the ALB.

C.Deploy EC2 instances across multiple Availability Zones.

D.Disable termination protection on EC2 instances.

E.Use larger EC2 instance types.

AnswersA, C

Auto Scaling ensures capacity.

Why this answer

Options B and C are correct. Multi-AZ deployment ensures availability if one AZ fails, and Auto Scaling maintains desired capacity. Option A is wrong because larger instances do not improve fault tolerance.

Option D is wrong because CloudFront is for content delivery, not compute fault tolerance. Option E is wrong because stopping termination protection does not improve availability.

Practice this question →

165

MCQmedium

A company runs a web application on EC2 instances in an Auto Scaling group behind an Application Load Balancer. The application stores session data locally on each instance. During a traffic spike, the Auto Scaling group launches new instances, but users report that they are logged out and lose session data. Which solution addresses this issue without modifying the application?

A.Modify the application to use ElastiCache for session storage.

B.Enable sticky sessions (session affinity) on the Application Load Balancer.

C.Increase the cooldown period for the Auto Scaling group.

D.Use larger EC2 instance types to handle the traffic spike.

AnswerB

Sticky sessions route users to the same instance, maintaining session data during scaling.

Why this answer

Option D is correct because enabling sticky sessions on the ALB ensures that users are routed to the same instance, preserving session data even as instances are added or removed. Option A is wrong because increasing the cooldown period does not preserve sessions during scaling. Option B is wrong, while it offloads state to ElastiCache, it requires application changes.

Option C is wrong because a larger instance size reduces scaling frequency but does not prevent session loss when instances are terminated.

Practice this question →

166

MCQhard

A company has an Amazon RDS for PostgreSQL DB instance with Multi-AZ deployment in us-east-1. The SysOps administrator must design a disaster recovery strategy to recover from a regional outage. The Recovery Time Objective (RTO) is 1 hour and the Recovery Point Objective (RPO) is 5 minutes. Which solution meets these requirements at the lowest cost?

A.Create a Read Replica in a different region and promote it during a disaster.

B.Take daily snapshots and copy them to another region.

C.Use cross-region automated backups.

D.Deploy a second Multi-AZ DB instance in another region.

AnswerA

A cross-region Read Replica provides low replication lag (often <5 minutes) and can be promoted quickly, meeting both RTO and RPO at a lower cost than a full standby instance.

Why this answer

A cross-region Read Replica meets the RPO of 5 minutes because replication is continuous (asynchronous) with minimal lag, and the RTO of 1 hour is achievable by promoting the replica during a disaster. This is the lowest-cost option because it uses a single standby instance in another region without the overhead of a full Multi-AZ deployment or frequent snapshot transfers.

Exam trap

The trap here is that candidates confuse 'cross-region automated backups' (which do not exist as a native feature) with automated snapshot copying, or assume that daily snapshots can meet a 5-minute RPO by increasing snapshot frequency, ignoring the fundamental limitation of snapshot scheduling and transfer time.

How to eliminate wrong answers

Option B is wrong because daily snapshots cannot achieve an RPO of 5 minutes (snapshots are taken at most every 24 hours, and copying to another region adds latency). Option C is wrong because cross-region automated backups are not a native RDS feature; automated backups are region-specific and cannot be automatically copied to another region without manual or scripted snapshot copy operations. Option D is wrong because deploying a second Multi-AZ DB instance in another region incurs the cost of a full primary and standby pair, which is significantly more expensive than a single Read Replica, and does not provide a faster RTO/RPO than a promoted Read Replica.

Practice this question →

167

Multi-Selecthard

A company runs a web application on EC2 instances behind an Application Load Balancer. The instances are in an Auto Scaling group. The SysOps administrator wants to ensure that the application can handle a sudden increase in traffic without downtime. Which THREE actions should be taken?

Select 3 answers

A.Configure the Auto Scaling group to launch instances in multiple Availability Zones.

B.Configure a target tracking scaling policy based on the ALB's RequestCountPerTarget metric.

C.Configure the Auto Scaling group with a dynamic scaling policy.

D.Configure a scheduled scaling policy to add instances during known peak hours.

E.Use Spot Instances to reduce costs.

AnswersA, B, C

Correct: Multiple AZs provide high availability and fault tolerance.

Why this answer

Options B, C, and E are correct. Configuring the Auto Scaling group with a dynamic scaling policy (Option B) allows it to scale based on demand. Placing instances in multiple Availability Zones (Option C) provides high availability.

Configuring a target tracking scaling policy based on ALB RequestCountPerTarget (Option E) ensures the group scales based on traffic. Option A (scheduled scaling) is not reactive. Option D (spot instances) can be interrupted, causing potential downtime.

Practice this question →

168

MCQmedium

A SysOps administrator is designing a disaster recovery strategy for a critical application that runs on EC2 instances. The application data is stored on EBS volumes. The recovery point objective (RPO) is 15 minutes, and the recovery time objective (RTO) is 1 hour. Which solution meets these requirements MOST cost-effectively?

A.Use AWS CloudEndure to continuously replicate the EC2 instances to another Region.

B.Use AWS Backup to back up the application data to Amazon S3 every 15 minutes.

C.Take hourly AMIs of the instances and copy them to another Region.

D.Take EBS snapshots every 15 minutes and copy them to another Region using cross-region snapshot copy.

AnswerD

Meets RPO of 15 minutes and RTO of 1 hour cost-effectively.

Why this answer

Option D is correct because EBS snapshots can be taken every 15 minutes (or via Data Lifecycle Manager) and restored quickly to new volumes in another region. Option A is wrong because cross-region replication of EC2 instances is more expensive and complex. Option B is wrong because AMIs do not capture data changes frequently enough.

Option C is wrong because S3 is not suitable for live application data.

Practice this question →

169

MCQmedium

A SysOps administrator is troubleshooting an issue where an Auto Scaling group is not launching EC2 instances despite having a scaling policy that should trigger when CPU utilization exceeds 80%. The CloudWatch alarm shows that the metric is breaching the threshold, but no instances are launched. What is the most likely cause?

A.The scaling policy is incorrectly configured to use a simple scaling policy instead of a step scaling policy.

B.The health check grace period is too long.

C.The Auto Scaling group has reached its maximum size.

D.The CloudWatch alarm is in insufficient data state.

AnswerC

Max size prevents scaling out.

Why this answer

Option A is correct because if the Auto Scaling group has reached its maximum size, it cannot launch new instances. Option B is wrong because the alarm is breaching, so it's not in insufficient data. Option C is wrong because health check grace period affects instance health, not launching.

Option D is wrong because if the policy is properly configured and alarm triggers, instances should launch unless max size is reached.

Practice this question →

170

MCQmedium

A company has an AWS Lambda function that processes S3 events. The function is critical and must be available even if one Availability Zone fails. How can a SysOps administrator ensure high availability for the Lambda function?

A.Use an Application Load Balancer to distribute events to multiple Lambda functions.

B.No action is required; Lambda functions are inherently highly available within a region.

C.Configure the Lambda function to run in two subnets in different Availability Zones.

D.Deploy the Lambda function in two separate regions and use Route 53 failover.

AnswerB

Lambda runs across multiple AZs automatically.

Why this answer

Option D is correct because AWS Lambda automatically runs across multiple Availability Zones in a region; no additional configuration is needed. Option A is wrong because Lambda is inherently multi-AZ. Option B is wrong because Lambda functions are not tied to AZs.

Option C is wrong because the Lambda service itself handles availability.

Practice this question →

171

MCQmedium

A company runs a critical database on an Amazon RDS for MySQL DB instance. The SysOps administrator needs to ensure that the database can survive a single Availability Zone failure with minimal downtime. Which configuration should the administrator implement?

A.Enable automatic backups.

B.Deploy a read replica in a different AZ.

C.Enable Multi-AZ deployment.

D.Take a manual snapshot and copy it to another AZ.

AnswerC

Correct. Multi-AZ provides a synchronous standby in another AZ and automatic failover, minimizing downtime during an AZ failure.

Why this answer

Multi-AZ deployment for Amazon RDS automatically provisions and maintains a synchronous standby replica in a different Availability Zone. If the primary AZ fails, Amazon RDS automatically fails over to the standby, typically within 60–120 seconds, minimizing downtime without manual intervention. This meets the requirement for surviving a single AZ failure with minimal downtime.

Exam trap

The trap here is that candidates often confuse read replicas (which require manual promotion and are for read scaling) with Multi-AZ (which provides automatic failover for high availability), leading them to select Option B incorrectly.

How to eliminate wrong answers

Option A is wrong because automatic backups only provide point-in-time recovery to restore data to a new instance, not automatic failover or high availability during an AZ failure. Option B is wrong because a read replica in a different AZ is designed for read scaling and can be promoted to a primary, but promotion is a manual process that takes time and does not provide automatic failover with minimal downtime. Option D is wrong because taking a manual snapshot and copying it to another AZ requires manual restoration steps, which results in significant downtime and does not provide automatic failover.

Practice this question →

172

MCQhard

A company runs a stateful application on a single Amazon EC2 instance with a 200 GB EBS volume. The application data changes frequently. The SysOps administrator needs a disaster recovery (DR) plan with a Recovery Point Objective (RPO) of 1 hour and a Recovery Time Objective (RTO) of 15 minutes. The DR region is us-west-2. Which combination of steps should the administrator implement?

A.Take hourly EBS snapshots and copy them to us-west-2. In a disaster, launch a new instance from the snapshot and attach it.

B.Use Amazon Data Lifecycle Manager (DLM) to take snapshots every hour and use AWS Backup to copy them to us-west-2. In a disaster, restore the snapshot as a volume and attach to a new instance in us-west-2.

C.Enable Amazon S3 Cross-Region Replication for the EBS volume's snapshot repository.

D.Attach an additional EBS volume configured with RAID 1, and use rsync to replicate data every hour to a volume in us-west-2.

AnswerB

DLM automates hourly snapshot creation, and AWS Backup can automate cross-region copy. This meets RPO of 1 hour. With a pre-prepared instance, RTO of 15 minutes is achievable.

Why this answer

Option B is correct because AWS Backup can copy EBS snapshots across regions, meeting the RPO of 1 hour via hourly snapshots, and restoring a volume from a snapshot in us-west-2 and attaching it to a new instance can achieve the RTO of 15 minutes. Amazon Data Lifecycle Manager (DLM) automates the snapshot creation, while AWS Backup handles cross-region copy, ensuring the DR region has the latest data within the RPO window.

Exam trap

The trap here is that candidates may assume EBS snapshots can be directly replicated across regions using S3 Cross-Region Replication, but snapshots are managed through EC2 and require explicit cross-region copy actions via AWS Backup or the EC2 API.

How to eliminate wrong answers

Option A is wrong because while hourly EBS snapshots can be taken and copied to us-west-2, the process of copying snapshots manually or via scripting is not automated and reliable for meeting the RPO/RTO as described; AWS Backup provides managed cross-region copy with lifecycle policies, which is more robust. Option C is wrong because Amazon S3 Cross-Region Replication applies to S3 buckets, not to EBS snapshots; EBS snapshots are stored in S3 but are managed via the EC2 snapshot API, and S3 replication cannot be directly enabled for the snapshot repository. Option D is wrong because using rsync to replicate data every hour to a volume in us-west-2 would require a separate EC2 instance in the DR region to receive the data, and RAID 1 on the source instance does not provide cross-region replication; this approach also introduces complexity and potential latency that may not meet the 15-minute RTO.

Practice this question →

173

MCQeasy

A SysOps administrator is designing a backup strategy for an Amazon EFS file system. The file system stores critical data that must be recoverable within 15 minutes of a failure. Which solution meets these requirements?

A.Use AWS Backup to create automated backups of the EFS file system and restore to a new file system if needed.

B.Configure EFS lifecycle management to move files to Infrequent Access storage class.

C.Take periodic EBS snapshots of the EC2 instance that mounts the EFS volume.

D.Create a Lambda function that copies files to an S3 bucket every hour.

AnswerA

AWS Backup supports EFS and allows quick restore.

Why this answer

Option A is correct because EFS-to-EFS replication using AWS Backup allows point-in-time recovery and meets the 15-minute RTO. Option B is wrong because lifecycle policies do not create backups. Option C is wrong because Lambda functions must be custom-built.

Option D is wrong because EBS snapshots cannot be used for EFS directly.

Practice this question →

174

MCQmedium

A SysOps administrator is designing a disaster recovery plan for a critical RDS MySQL database. The database must be available with a Recovery Point Objective (RPO) of less than 1 hour and a Recovery Time Objective (RTO) of less than 2 hours. The primary region is us-east-1. Which solution meets these requirements?

A.Enable Multi-AZ deployment in the primary region

B.Enable automated backups and restore to a new region when needed

C.Take daily manual snapshots and copy them to another region

D.Create a cross-Region read replica in us-west-2 and promote it during a disaster

AnswerD

Replica replication is fast and promotion meets RTO.

Why this answer

Correct answer is C. Cross-Region read replicas provide asynchronous replication with RPO typically seconds to minutes, and promoting a replica can be done in minutes, meeting both RPO and RTO. Option A is wrong because automated backups have RPO of up to 5 minutes but restoring in another region takes longer than 2 hours.

Option B is wrong because Multi-AZ is for high availability within a region, not cross-region DR. Option D is wrong because manual snapshots require manual intervention and restore time is unpredictable.

Practice this question →

175

Multi-Selectmedium

A company is designing a highly available architecture for a web application. The application uses an Application Load Balancer (ALB) and an Auto Scaling group of EC2 instances. Which TWO steps should the company take to ensure the architecture is resilient to an Availability Zone failure? (Select TWO.)

Select 2 answers

A.Set the Auto Scaling group's desired capacity to a high number.

B.Create a CloudWatch alarm that triggers if the ALB has elevated 5xx errors.

C.Configure the Auto Scaling group to launch instances in at least two Availability Zones.

D.Use a single EC2 instance type for all instances.

E.Configure the ALB to be internet-facing and enable cross-zone load balancing.

AnswersC, E

Distributing instances across AZs ensures availability if one AZ fails.

Why this answer

Options A and C are correct. Option A ensures the ALB can route traffic to targets in other AZs. Option C ensures EC2 instances are distributed across AZs.

Option B is wrong because a single instance type does not improve availability. Option D is wrong because increasing min size in one AZ does not protect against AZ failure. Option E is wrong because CloudWatch alarms do not provide AZ redundancy.

Practice this question →

176

Multi-Selecteasy

A company wants to ensure that its Amazon S3 data is resilient to an accidental deletion of an entire bucket. Which two actions should the company take? (Choose TWO.)

Select 2 answers

A.Configure Cross-Region Replication to replicate objects to a bucket in another AWS Region.

B.Configure a lifecycle policy to transition objects to S3 Glacier after 30 days.

C.Enable MFA Delete on the bucket.

D.Enable S3 Versioning on the bucket.

E.Enable S3 Transfer Acceleration on the bucket.

AnswersA, D

CRR provides a copy of data in a different region, protecting against region-wide failures.

Why this answer

Options A and D are correct. Cross-Region Replication (A) copies objects to another bucket, ensuring data survives regional issues. S3 Versioning (D) allows recovery of deleted objects within the same bucket.

Option B is wrong because MFA Delete requires multi-factor authentication to delete objects, but does not protect against bucket deletion. Option C is wrong because lifecycle policies delete objects, not protect them. Option E is wrong because Transfer Acceleration is for speed, not durability.

Practice this question →

177

MCQmedium

A company has a production AWS account with a single VPC and multiple subnets across two Availability Zones. The company hosts a web application on EC2 instances in an Auto Scaling group. The application uses an Amazon Aurora MySQL database cluster with one writer and two reader instances in the same VPC. The SysOps administrator configured AWS CloudTrail to log API calls and Amazon CloudWatch alarms for operational monitoring. After a recent network partition event in one Availability Zone, the application became unavailable for several minutes. The administrator wants to improve the application's resilience to such events without changing the database cluster configuration. The administrator has budget for additional resources but wants to minimize costs. What should the administrator do?

A.Enable Multi-AZ for the Aurora cluster, which is already enabled by default. Additionally, increase the instance size of the writer instance.

B.Configure a cross-region read replica for the Aurora cluster and promote it to primary if the primary AZ fails.

C.Create a second VPC in a different AWS Region and set up a second Aurora cluster. Use Route 53 with failover routing to direct traffic to the secondary region if the primary fails.

D.Ensure the Auto Scaling group launches instances in both Availability Zones. Configure the Application Load Balancer to be cross-zone load balancing enabled.

AnswerD

This distributes traffic across AZs, so if one AZ fails, the ALB routes to the other AZ's instances.

Why this answer

Option C is correct because distributing EC2 instances across multiple AZs and configuring the ALB as cross-zone enabled ensures that traffic is routed to healthy instances in any AZ. This provides high availability for the application tier without changing the database. Option A is wrong because adding a second VPC with its own database cluster would be expensive and complex.

Option B is wrong because creating a read replica in another region does not help with AZ failure within the same region. Option D is wrong because Multi-AZ for Aurora is already inherent; the database cluster already has instances in multiple AZs by design.

Practice this question →

178

Multi-Selecthard

A company wants to back up its on-premises file servers to AWS for disaster recovery. The data changes frequently, and the company needs to minimize data loss. Which THREE steps should the company take? (Select THREE.)

Select 3 answers

A.Use AWS Backup to create backup plans for the Storage Gateway.

B.Use S3 Transfer Acceleration for uploads.

C.Configure the file gateway to cache data locally for frequently accessed files.

D.Set up S3 Cross-Region Replication from the backup bucket.

E.Deploy an AWS Storage Gateway file gateway on-premises.

AnswersA, C, E

AWS Backup automates backups of gateway data to S3.

Why this answer

Options A, C, and E are correct. AWS Storage Gateway with cached volumes stores data in AWS and caches frequently accessed data locally. Option B is wrong because S3 Transfer Acceleration is for upload speed, not backup.

Option D is wrong because S3 Replication is for copying between buckets, not for initial backup.

Practice this question →

179

MCQeasy

A company is using Amazon S3 to store critical data with versioning enabled. The SysOps administrator needs to implement a solution that automatically transitions objects to S3 Glacier Deep Archive after 90 days and permanently deletes them after 7 years. Which S3 feature should be used?

A.S3 Lifecycle policies

B.S3 Intelligent-Tiering

C.S3 Object Lock

D.S3 Cross-Region Replication

AnswerA

Lifecycle policies can transition and expire objects.

Why this answer

Option C is correct because S3 Lifecycle policies can transition objects to different storage classes and expire (delete) objects after a specified number of days. Option A is wrong because S3 Object Lock is for write-once-read-many (WORM) protection. Option B is wrong because S3 Intelligent-Tiering moves data between access tiers but does not delete.

Option D is wrong because S3 Replication replicates objects, not lifecycle management.

Practice this question →

180

Multi-Selectmedium

A company runs a critical application on EC2 instances in an Auto Scaling group. The group uses a dynamic scaling policy based on CPU utilization. The SysOps administrator wants to ensure that the application remains available during a planned maintenance event that will take down one of the Availability Zones. Which TWO actions should the administrator take? (Choose two.)

Select 2 answers

A.Update the Auto Scaling group to remove the affected Availability Zone from the list of enabled AZs.

B.Manually terminate all instances in the affected Availability Zone.

C.Increase the desired capacity of the Auto Scaling group to account for the lost capacity.

D.Create a new launch configuration with a different AMI.

E.Disable the dynamic scaling policy to prevent scaling.

AnswersA, C

Removing the AZ prevents the ASG from launching instances in the affected AZ.

Why this answer

Updating the ASG to use only the remaining AZs and rebalancing ensures capacity is maintained. Increasing desired capacity preemptively provides a buffer. Terminating instances prematurely reduces capacity.

Updating the AMI does not help with AZ maintenance.

Practice this question →

181

MCQmedium

A SysOps administrator is testing the failover of an Amazon RDS for PostgreSQL Multi-AZ DB instance. The application currently writes to the primary instance in us-east-1a. Which action will manually trigger a failover to the standby instance in us-east-1b?

A.Reboot the DB instance and select 'Reboot with failover'.

B.Modify the DB instance to Single-AZ and then back to Multi-AZ.

C.Reboot the DB instance without selecting any failover option.

D.Promote the standby instance using the Amazon RDS console.

AnswerA

Correct. This explicitly triggers a failover to the standby instance.

Why this answer

Option A is correct because the 'Reboot with failover' option in the Amazon RDS console explicitly triggers a failover by rebooting the primary DB instance and forcing the Multi-AZ configuration to promote the standby instance in us-east-1b to become the new primary. This is the designed method for manually testing or initiating a failover in a Multi-AZ deployment.

Exam trap

The trap here is that candidates confuse Amazon RDS Multi-AZ failover with Amazon Aurora's reader promotion, where you can explicitly promote a read replica to primary, leading them to incorrectly select Option D.

How to eliminate wrong answers

Option B is wrong because modifying the DB instance to Single-AZ and back to Multi-AZ would delete the standby instance and then create a new one, which is not a failover but a reconfiguration that causes downtime and does not test the existing standby. Option C is wrong because rebooting the DB instance without selecting 'Reboot with failover' will simply restart the primary instance without promoting the standby, so no failover occurs. Option D is wrong because Amazon RDS does not support manually promoting a standby instance via the console; the standby is not directly accessible and failover is controlled only through the primary instance's reboot with failover option or an automatic failure.

Practice this question →

182

MCQhard

A company runs a critical MySQL database on an Amazon RDS DB instance in a single Availability Zone. The SysOps administrator needs to implement a disaster recovery solution with a Recovery Point Objective (RPO) of 5 minutes and a Recovery Time Objective (RTO) of 1 hour, while minimizing costs. Which solution meets these requirements?

A.Enable Multi-AZ deployment with a synchronous standby replica in another Availability Zone

B.Create a cross-Region read replica and promote it to a standalone DB instance during a disaster

C.Enable cross-Region automated backups to another Region

D.Take daily automated snapshots and copy them to another Region manually

AnswerC

Cross-Region automated backups replicate snapshots and transaction logs to another region, achieving an RPO of about 5 minutes and RTO of under 1 hour, with no extra compute cost until recovery.

Why this answer

Option C is correct because cross-Region automated backups replicate transaction logs to another AWS Region with a typical lag of a few minutes, enabling point-in-time recovery (PITR) that can meet an RPO of 5 minutes. When a disaster occurs, you can restore the automated backup to a new DB instance in the destination Region, and the RTO depends on the restore time, which can be under 1 hour for a properly sized instance. This solution minimizes costs by avoiding the continuous compute and storage overhead of a standby replica or read replica.

Exam trap

The trap here is that candidates often confuse cross-Region read replicas (asynchronous, higher RPO) with cross-Region automated backups (log-based, lower RPO), or assume Multi-AZ provides cross-Region disaster recovery when it only covers AZ failures within a single Region.

How to eliminate wrong answers

Option A is wrong because Multi-AZ with a synchronous standby replica only protects against an Availability Zone failure within the same Region, not a cross-Region disaster, and it incurs the cost of a full standby instance. Option B is wrong because a cross-Region read replica is asynchronous and can have replication lag exceeding 5 minutes, making it unable to guarantee an RPO of 5 minutes; additionally, promoting a read replica to a standalone instance can take longer than 1 hour due to the need to stop replication and apply pending changes. Option D is wrong because daily automated snapshots provide an RPO of up to 24 hours, far exceeding the required 5-minute RPO, and manual copying adds operational overhead and delay.

Practice this question →

183

MCQeasy

A company runs a critical web application on a single Amazon EC2 instance with a 100 GiB gp2 EBS volume. The SysOps administrator needs to ensure data durability by taking automated snapshots of the root volume every hour. The snapshots should be retained for 7 days. Which AWS service can be used to automate this task with minimal configuration?

A.Amazon Data Lifecycle Manager (DLM)

B.AWS Backup

C.Amazon CloudWatch Events

D.AWS Systems Manager

AnswerA

DLM provides built-in automation for creating and retaining EBS snapshots according to a schedule, making it the ideal choice.

Why this answer

Amazon Data Lifecycle Manager (DLM) is the correct choice because it is specifically designed to automate the creation, retention, and deletion of EBS snapshots with minimal configuration. It supports custom schedules (e.g., every hour) and retention policies (e.g., 7 days) directly on EBS volumes, making it ideal for this task without requiring additional scripting or infrastructure.

Exam trap

The trap here is that candidates often choose AWS Backup because it is a centralized backup service, but they overlook that DLM is the simpler, purpose-built service for EBS snapshot lifecycle automation with minimal configuration, especially for a single volume and a straightforward retention policy.

How to eliminate wrong answers

Option B (AWS Backup) is wrong because while it can automate EBS snapshots, it requires setting up a backup plan and vault, which introduces unnecessary overhead for a simple hourly snapshot retention policy that DLM handles natively. Option C (Amazon CloudWatch Events) is wrong because it can trigger a Lambda function or Systems Manager Automation to create snapshots, but it does not natively manage snapshot retention or deletion, requiring custom code and additional configuration. Option D (AWS Systems Manager) is wrong because it is primarily for operational management (e.g., patching, inventory) and does not provide a built-in, automated snapshot lifecycle management feature; any snapshot automation would require custom Automation documents or scripts.

Practice this question →

184

MCQmedium

A company runs a critical application on a single Amazon EC2 instance. The SysOps administrator needs to ensure that if the instance fails, a new instance is automatically provisioned in a different Availability Zone. Which configuration should the administrator implement?

A.Create an Auto Scaling group with the instance in multiple Availability Zones

B.Create a placement group and launch the instance in it

C.Place the instance behind an Elastic Load Balancer

D.Configure an Amazon Route 53 health check with failover routing

AnswerA

An Auto Scaling group with instances in multiple AZs and a desired capacity of 1 will automatically launch a new instance in another AZ if the existing one fails.

Why this answer

Option A is correct because an Auto Scaling group (ASG) can be configured with a minimum, desired, and maximum size of 1, spanning multiple Availability Zones (AZs). When the EC2 instance fails, the ASG health check replacement mechanism automatically terminates the unhealthy instance and launches a new one in a different AZ, ensuring the application remains available across AZ boundaries without manual intervention.

Exam trap

The trap here is that candidates often confuse the health-check and failover capabilities of Route 53 or ELB with the automatic instance provisioning provided by Auto Scaling groups, mistakenly thinking DNS or load balancer health checks alone can replace a failed instance.

How to eliminate wrong answers

Option B is wrong because a placement group is designed to influence the physical placement of instances for low-latency or high-throughput networking (e.g., cluster, spread, or partition groups), but it does not provide automatic instance replacement or cross-AZ failover. Option C is wrong because an Elastic Load Balancer (ELB) distributes traffic across healthy instances but does not automatically provision a new instance when an existing one fails; it only routes traffic away from unhealthy targets. Option D is wrong because an Amazon Route 53 health check with failover routing can redirect DNS traffic to a different endpoint (e.g., a static website or another resource) but does not automatically launch a new EC2 instance; it requires a pre-provisioned secondary resource.

Practice this question →

185

MCQhard

A company runs a containerized application on Amazon ECS with the Fargate launch type. The application is deployed across two Availability Zones and uses an Application Load Balancer (ALB) to distribute traffic. The application stores state in an Amazon ElastiCache Redis cluster. Recently, the development team deployed a new version of the application that introduced a memory leak. The leak causes the ECS tasks to consume more memory over time until they are OOM-killed. The SysOps administrator is asked to ensure that the application remains available and responsive. The administrator has configured an ECS service auto scaling based on average CPU utilization, but the memory leak causes tasks to fail before CPU utilization triggers scaling. Which action should the administrator take to automatically detect and replace failing tasks?

A.Increase the task memory limit to the maximum allowed for the Fargate task size. This will prevent OOM-kills.

B.Configure an ECS service auto scaling with a target tracking policy based on average CPU utilization. Set the target value to 50%.

C.Configure the ALB health check to use a more sensitive path that fails when memory is low. The ECS service will automatically replace unhealthy tasks.

D.Create a custom CloudWatch metric for memory utilization and configure an ECS service auto scaling with a step scaling policy that adds tasks when memory exceeds a threshold.

AnswerD

This proactively scales out before tasks fail, maintaining availability.

Why this answer

Option B is correct because configuring an ECS service auto scaling with a step scaling policy based on a custom CloudWatch metric for memory utilization (e.g., from CloudWatch agent) allows scaling out before tasks are OOM-killed. Option A is wrong because increasing task memory limit does not prevent the leak; it only delays failure. Option C is wrong because using a target tracking policy based on CPU does not address memory issues.

Option D is wrong because an ALB health check only detects HTTP failures, not memory exhaustion if the process is still responding.

Practice this question →

186

MCQhard

A company's S3 bucket contains critical data. The bucket policy accidentally allowed public write access, and a malicious actor uploaded several objects. The company needs to recover the bucket to a known good state as quickly as possible. What should the SysOps administrator do?

A.Enable S3 Object Lock on the bucket to prevent further modifications.

B.Enable MFA Delete on the bucket to secure delete operations.

C.Use S3 Versioning to restore the bucket to a previous version.

D.Configure S3 Cross-Region Replication to replicate data to another region.

AnswerC

Versioning retains old versions for restoration.

Why this answer

Option B is correct because S3 Versioning, when enabled, retains all object versions, allowing restoration to a previous state by deleting the current version. Option A is wrong because S3 Object Lock prevents deletion but does not allow rollback to a prior version. Option C is wrong because MFA Delete does not help with restoring prior versions.

Option D is wrong because S3 Cross-Region Replication does not provide version rollback capability.

Practice this question →

187

MCQmedium

Regulatory requirements mandate that all RDS and EBS backups are replicated to a secondary AWS region within 24 hours of creation. The company has workloads in us-east-1 and must replicate backups to eu-west-1. Restoring from the secondary region must be possible without manual copying steps during a disaster. What service and configuration implements this requirement?

A.Create an AWS Backup plan with a cross-Region copy rule that replicates recovery points to a backup vault in eu-west-1 within 24 hours

B.Schedule a Lambda function that calls CreateDBSnapshot and CopyDBSnapshot to replicate RDS snapshots, and CreateSnapshot and CopySnapshot for EBS volumes to eu-west-1

C.Enable RDS automated backups with cross-region replication and configure EBS snapshot copy separately using Data Lifecycle Manager

D.Use S3 Cross-Region Replication to replicate the backup bucket containing RDS and EBS snapshots to eu-west-1

AnswerA

AWS Backup's cross-Region copy rule runs automatically after each successful backup job. The copy is encrypted with the destination vault's KMS key. In a disaster, operators restore directly from the eu-west-1 vault — no manual cross-region data transfer is needed. A single backup plan can cover multiple resource types (RDS and EBS), satisfying the consolidated requirement.

Why this answer

AWS Backup is the correct service because it natively supports cross-Region copy rules that automatically replicate recovery points (including RDS snapshots and EBS snapshots) to a backup vault in a secondary Region within a specified time window. This meets the 24-hour replication requirement and enables direct restores from the secondary Region without manual copying, as the backup vault in eu-west-1 contains the replicated recovery points ready for use.

Exam trap

The trap here is that candidates often assume they need to use separate services (like Lambda or DLM) for each resource type, missing that AWS Backup provides a unified, managed solution that handles both RDS and EBS snapshots with cross-Region replication and direct restore capabilities.

How to eliminate wrong answers

Option B is wrong because while a Lambda function could technically replicate snapshots, it requires custom code, error handling, and scheduling, and does not provide the native, managed cross-Region restore capability without manual steps; it also lacks the built-in compliance tracking of AWS Backup. Option C is wrong because RDS automated backups with cross-Region replication only apply to RDS, not EBS volumes, and Data Lifecycle Manager (DLM) for EBS snapshots does not support cross-Region copy natively; DLM only copies within the same Region, so EBS snapshots would not be replicated to eu-west-1. Option D is wrong because S3 Cross-Region Replication replicates objects in an S3 bucket, but RDS and EBS snapshots are not stored as S3 objects by default; they are stored in AWS-managed snapshot storage, and even if you manually copy snapshots to S3, the replication would not create usable snapshots in the secondary Region for direct restore.

Practice this question →

188

Multi-Selecteasy

A company wants to protect its data in Amazon S3 from accidental deletion. Which TWO methods should the SysOps administrator use? (Choose TWO.)

Select 2 answers

A.Set up cross-Region replication.

B.Enable S3 Transfer Acceleration.

C.Configure S3 event notifications.

D.Enable S3 Versioning on the bucket.

E.Enable MFA Delete on the bucket.

AnswersD, E

Preserves previous versions.

Why this answer

Options A and C are correct. S3 Versioning preserves previous versions of objects, allowing recovery from accidental deletes. MFA Delete adds an extra layer of protection.

Option B is wrong because S3 Transfer Acceleration is for upload speed. Option D is wrong because S3 cross-region replication does not prevent deletion; it replicates objects. Option E is wrong because S3 event notifications do not prevent deletion.

Practice this question →

189

MCQhard

A company runs a stateful application on EC2 instances behind a Network Load Balancer (NLB). The application uses sticky sessions (session affinity) to maintain client state. During a deployment, the SysOps administrator needs to replace instances without disrupting active sessions. Which approach should be used?

A.Stop the NLB, replace instances, and restart the NLB

B.Deregister the old instances from the target group with connection draining enabled, then register new instances

C.Update the target group health check to remove old instances faster

D.Terminate the old instances immediately and launch new ones

AnswerB

Connection draining allows existing sessions to finish.

Why this answer

Correct answer is A. By deregistering instances with a connection draining timeout, the NLB will stop sending new traffic but allow existing connections to complete. Then new instances can be registered.

Option B is wrong because terminating instances immediately will drop active sessions. Option C is wrong because stopping the NLB would affect all traffic. Option D is wrong because sticky sessions are not configured on target group health checks; health checks are separate.

Practice this question →

190

MCQmedium

An organization is using AWS CloudFormation to deploy infrastructure. The SysOps administrator needs to ensure that if a stack update fails, the stack automatically rolls back to the last known good state. Which stack update option should be configured?

A.Disable rollback

B.Change sets

C.Rollback on failure

D.Stack policy

AnswerC

This setting ensures automatic rollback to the previous state on update failure.

Why this answer

Option A is correct because the Rollback on failure setting causes CloudFormation to automatically revert to the previous state if the update fails. Option B is incorrect because Disable rollback is the opposite. Option C is incorrect because Stack policy controls updates to specific resources, not rollback behavior.

Option D is incorrect because Change sets allow preview of changes but do not control automatic rollback.

Practice this question →

191

MCQeasy

A SysOps administrator deploys the above CloudFormation template. The stack creation fails with an error. What is the most likely reason?

A.The EBS volume must specify a SnapshotId.

B.The template uses a deprecated AWSTemplateFormatVersion.

C.The instance and volume are in different Availability Zones.

D.The VolumeAttachment resource is missing the Device property.

AnswerD

The Device property (e.g., /dev/sdh) is required when attaching a volume.

Why this answer

Option B is correct because the VolumeAttachment resource requires the Device property, which is missing. Option A is wrong because AWS::EC2::Volume does not require a snapshot. Option C is wrong because the format is correct.

Option D is wrong because the AvailabilityZone is specified for both the instance and volume.

Practice this question →

192

MCQmedium

A company runs a critical application on EC2 instances in an Auto Scaling group. The application processes messages from an Amazon SQS queue. The SysOps administrator notices that during periods of high load, the SQS queue depth increases significantly, and the application takes a long time to recover. The administrator wants to improve the application's ability to handle spikes in traffic without over-provisioning resources. The application is stateless and can scale horizontally. What should the administrator do?

A.Change the SQS queue from standard to FIFO to ensure messages are processed in order.

B.Configure an auto scaling policy for the Auto Scaling group based on the SQS queue depth (ApproximateNumberOfMessagesVisible).

C.Use a larger EC2 instance type with enhanced networking to process messages faster.

D.Increase the EC2 instance size to a larger type with more CPU and memory.

AnswerB

This allows the number of instances to scale with the workload, handling spikes efficiently.

Why this answer

Option B is correct because configuring auto scaling based on the SQS queue depth (e.g., ApproximateNumberOfMessagesVisible) allows the Auto Scaling group to scale out proactively as the queue grows and scale in as it shrinks. Option A is wrong because increasing instance size (vertical scaling) is less elastic than horizontal scaling. Option C is wrong because using a larger instance type with more vCPUs still has limits.

Option D is wrong because changing to a FIFO queue does not affect throughput scaling; FIFO queues have lower throughput.

Practice this question →

193

MCQeasy

A company is designing a highly available web application using an Application Load Balancer (ALB) with EC2 instances in an Auto Scaling group across two Availability Zones. Which configuration ensures that the application remains available if one Availability Zone fails?

A.Use a single large EC2 instance instead of multiple instances

B.Disable health checks on the ALB to avoid false positives

C.Configure the Auto Scaling group to launch instances in at least two Availability Zones

D.Launch all EC2 instances in a single Availability Zone

AnswerC

Distributes instances across AZs for high availability.

Why this answer

Correct answer is B. Distributing instances across multiple Availability Zones ensures that a failure in one zone does not affect the other. Option A is wrong because placing all instances in a single AZ creates a single point of failure.

Option C is wrong because using a single instance is not highly available. Option D is wrong because disabling health checks would prevent the ALB from routing traffic away from unhealthy instances.

Practice this question →

194

MCQeasy

A company uses Amazon S3 to store sensitive financial documents. The company's compliance team requires that all objects be encrypted at rest using server-side encryption with AWS KMS managed keys (SSE-KMS). Additionally, the compliance team requires that if an object is not accessed for 90 days, it should be automatically moved to Amazon S3 Glacier to reduce costs. The SysOps administrator is tasked with implementing these requirements. The administrator creates an S3 bucket and enables default encryption with SSE-KMS. Then, the administrator creates a lifecycle policy with a transition action to Glacier after 90 days. During testing, the administrator notices that objects uploaded to the bucket are not being transitioned to Glacier after 90 days. What is the most likely cause of this issue?

A.The bucket has S3 Versioning enabled, and the lifecycle policy does not apply to previous versions.

B.The bucket's default encryption setting overwrites the lifecycle policy.

C.The bucket has S3 Object Lock enabled, which prevents objects from being transitioned to Glacier.

D.The lifecycle policy was created after the test objects were uploaded, and the policy does not apply to existing objects.

AnswerD

Lifecycle policies apply to newly uploaded objects by default. To apply to existing objects, the policy must be configured with a filter that includes all objects.

Why this answer

Option D is correct because lifecycle policies only apply to objects that are uploaded after the policy is created, unless the policy is applied to the entire bucket and includes existing objects. If the lifecycle policy was created after the test objects were uploaded, those objects are not affected unless the policy is configured to cover existing objects. Option A is wrong because SSE-KMS does not prevent lifecycle transitions.

Option B is wrong because S3 Versioning does not block transitions. Option C is wrong because S3 Object Lock can prevent deletion but not transitions to Glacier.

Practice this question →

195

MCQmedium

A company is running a critical application on EC2 instances in an Auto Scaling group. The application experiences occasional CPU spikes. The SysOps administrator needs to configure a scaling policy that reacts quickly to increased load but avoids unnecessary scaling actions due to short bursts. Which scaling policy type should be used?

A.Manual scaling

B.Scheduled scaling policy

C.Simple scaling policy

D.Target tracking scaling policy with a CPU utilization target of 70%

AnswerD

Target tracking adjusts capacity to maintain target, with built-in cooldown to avoid oscillation.

Why this answer

Correct answer is B. A target tracking policy with a higher CPU target (e.g., 70%) and a warm-up time can handle spikes smoothly. Step scaling can also be used but requires more configuration.

Option A is wrong because simple scaling has cooldown periods that can react slowly. Option C is wrong because scheduled scaling is for predictable load, not spikes. Option D is wrong because manual scaling is not automated.

Practice this question →

196

MCQeasy

A company uses AWS CloudTrail to log API activity. The SysOps administrator needs to ensure that log files are protected from accidental deletion and are available for compliance audits for at least 7 years. Which service should be used to meet these requirements?

A.Enable S3 Object Lock in Compliance mode.

B.Enable S3 Versioning on the CloudTrail S3 bucket.

C.Move CloudTrail logs to Amazon S3 Glacier after 90 days.

D.Store logs in Amazon CloudWatch Logs with an expiration policy of 7 years.

AnswerA

Compliance mode prevents any user from deleting or overwriting objects for the retention period.

Why this answer

Option B is correct because S3 Object Lock with Compliance mode prevents deletion and ensures immutability. Option A is incorrect because versioning alone does not prevent deletion. Option C is incorrect because Glacier has retrieval delays and no write-once-read-many (WORM) protection.

Option D is incorrect because CloudWatch Logs can store logs but does not provide WORM protection natively.

Practice this question →

197

MCQeasy

A SysOps administrator is setting up a backup plan for an RDS MySQL database. The database is 500 GB in size and is used for a critical application. The company requires a Recovery Point Objective (RPO) of 5 minutes and a Recovery Time Objective (RTO) of 1 hour. Which solution meets these requirements?

A.Deploy the RDS instance in a Multi-AZ configuration with automatic failover.

B.Use AWS Backup to copy snapshots to a different AWS Region.

C.Take manual snapshots every 5 minutes and store them in Amazon S3.

D.Configure a cross-region read replica and promote it during failover.

AnswerA

Multi-AZ provides synchronous standby and automatic failover, meeting both RPO and RTO targets.

Why this answer

Option A is correct because Multi-AZ with automatic failover provides a low RTO (typically under 1 minute) and synchronous replication ensures RPO of seconds. Option B is wrong because manual snapshots have higher RPO and RTO. Option C is wrong because cross-region read replicas are asynchronous and may have higher RPO.

Option D is wrong because S3 backups would have high RTO.

Practice this question →

198

MCQmedium

A company runs a web application on EC2 instances behind an Application Load Balancer. The application stores session data in a DynamoDB table. To improve reliability, the SysOps administrator needs to ensure that session data is preserved if an EC2 instance fails. Which action should the administrator take?

A.Enable sticky sessions (session affinity) on the ALB.

B.Mount the same EBS volume to all EC2 instances to share session data.

C.Store session data in an ElastiCache Memcached cluster.

D.Use DynamoDB to store session data, making it accessible from any instance.

AnswerD

DynamoDB is a durable, external session store that survives instance failure.

Why this answer

Option A is correct because storing session data externally (e.g., in DynamoDB) ensures it survives instance failure. Option B is incorrect because EBS volumes are tied to a single AZ; mounting the same volume to multiple instances is not supported. Option C is incorrect because ElastiCache sessions are ephemeral and lost on cache node failure.

Option D is incorrect because ELB sticky sessions cause uneven load and do not preserve session data on instance failure.

Practice this question →

199

MCQmedium

A company runs a web application on Amazon EC2 instances in an Auto Scaling group behind an Application Load Balancer (ALB). The application stores session state in memory on each instance. The SysOps administrator wants to make the application highly available across multiple Availability Zones without losing session data when instances are terminated or replaced. The solution must minimize application changes. Which approach should the administrator take?

A.Use sticky sessions (session affinity) on the ALB and configure the Auto Scaling group with a larger min size.

B.Store session data in a shared Amazon ElastiCache cluster and modify the application to read/write session state to ElastiCache.

C.Deploy the application in multiple AWS Regions and use Amazon Route 53 with latency-based routing.

D.Store session data in an Amazon RDS for MySQL database and configure the application to read/write session state to the database.

AnswerB

ElastiCache provides a centralized, highly available in-memory cache that decouples session state from individual EC2 instances. Instances can be replaced or scaled without losing session data.

Why this answer

Option B is correct because storing session state in a shared Amazon ElastiCache cluster decouples session data from individual EC2 instances, allowing any instance in the Auto Scaling group to serve any user request without losing session data when instances are terminated or replaced. This approach requires minimal application changes (only modifying the session handler to point to ElastiCache) and supports high availability across multiple Availability Zones by using a replicated ElastiCache cluster (e.g., Redis with replication).

Exam trap

The trap here is that candidates often choose sticky sessions (Option A) because they seem to solve session affinity without code changes, but they fail to realize that sticky sessions do not persist session data across instance terminations, which is the core requirement for high availability without data loss.

How to eliminate wrong answers

Option A is wrong because sticky sessions (session affinity) bind a user's session to a specific EC2 instance; if that instance is terminated or replaced, the session data stored in memory is lost, violating the requirement to not lose session data. Option C is wrong because deploying across multiple AWS Regions with Route 53 latency-based routing does not address session state persistence within a single region; it introduces cross-region latency and complexity without solving the fundamental issue of in-memory session loss on instance termination. Option D is wrong because while storing session data in Amazon RDS for MySQL would persist session state, it introduces significant overhead (e.g., database connection management, schema design, and slower read/write compared to in-memory caching) and requires more extensive application changes than using ElastiCache, which is purpose-built for session storage.

Practice this question →

200

MCQmedium

Refer to the exhibit. An IAM policy is attached to a group. A SysOps Administrator in that group tries to terminate an EC2 instance with the tag 'Environment=production'. The action fails. What is the most likely reason?

A.The condition key 'ec2:ResourceTag/Environment' is invalid.

B.The policy does not allow the ec2:TerminateInstances action.

C.The instance does not have the tag 'Environment=production'.

D.The condition uses 'StringEquals' instead of 'StringLike'.

AnswerC

The policy requires the tag to be present for the action to be allowed.

Why this answer

Option A is correct because the policy allows termination only for instances with the tag 'Environment=production', but there might be an explicit deny elsewhere or the instance does not have the tag. Option B is wrong because the policy allows the action. Option C is wrong because the condition uses StringEquals, which is case-sensitive.

Option D is wrong because the condition key is valid.

Practice this question →

201

MCQeasy

A company wants to ensure that its Amazon RDS database can withstand the loss of an entire Availability Zone. Which feature should the SysOps administrator enable?

A.Enable automated backups with a retention period of 35 days.

B.Enable Multi-AZ deployment.

C.Take manual snapshots and copy them to another Region.

D.Create a read replica in a different Availability Zone.

AnswerB

Multi-AZ provides automatic failover to a standby in another AZ.

Why this answer

Option B is correct: Multi-AZ deployment creates a standby in a different AZ; if the primary AZ fails, RDS automatically fails over to the standby. Option A is incorrect because read replicas can be in a different AZ but are not used for automatic failover. Option C is incorrect because manual snapshots require manual restoration.

Option D is incorrect because automated backups are stored in S3 and do not provide automatic failover.

Practice this question →

202

MCQmedium

A company is running a critical web application on EC2 instances behind an Application Load Balancer. The instances are in an Auto Scaling group across two Availability Zones. The company needs to ensure that if an entire Availability Zone fails, the application remains available. Which configuration meets this requirement?

A.Use larger EC2 instance types to handle the load during a failure.

B.Use Amazon CloudFront to distribute traffic across multiple regions.

C.Launch all instances in the same Availability Zone to reduce latency.

D.Configure the Auto Scaling group to launch instances in at least two Availability Zones.

AnswerD

Multiple AZs provide redundancy if one AZ fails.

Why this answer

Spreading instances across multiple Availability Zones ensures that if one AZ fails, the other AZ continues to serve traffic. Option B is correct because it explicitly uses multiple AZs. Option A is wrong because using a single AZ does not provide AZ-level fault tolerance.

Option C is wrong because Amazon CloudFront is a CDN, not a solution for AZ failure. Option D is wrong because increasing instance size does not provide AZ redundancy.

Practice this question →

203

MCQmedium

A company hosts a critical web application on Amazon EC2 instances in a single AWS Region (us-east-1). The SysOps administrator needs to implement a Disaster Recovery (DR) solution using a different AWS Region (us-west-2). The DR plan requires a Recovery Time Objective (RTO) of 1 hour and a Recovery Point Objective (RPO) of 15 minutes. The application uses an Amazon Aurora MySQL DB cluster and static assets stored in an Amazon S3 bucket. Which combination of actions should the administrator take to meet these requirements?

A.Create an Aurora cross-Region read replica in us-west-2. Configure S3 Cross-Region Replication from the source bucket to a destination bucket in us-west-2. During DR, promote the read replica to a primary cluster and update DNS.

B.Take a manual snapshot of the Aurora DB cluster every 15 minutes and copy it to us-west-2. Use S3 batch operations to copy assets to us-west-2 daily.

C.Enable Aurora Multi-AZ in us-east-1 and configure S3 transfer acceleration to us-west-2.

D.Use AWS Database Migration Service (DMS) for continuous replication to a DB instance in us-west-2. Use S3 versioning to keep previous object versions.

AnswerA

A cross-Region read replica provides continuous replication for the database, achieving RPO seconds. Promoting it can be done in minutes, meeting RTO of 1 hour. S3 CRR replicates objects asynchronously, typically within minutes, satisfying the RPO.

Why this answer

Option A is correct because Aurora cross-Region read replicas provide asynchronous replication with an RPO typically under 1 second, easily meeting the 15-minute RPO requirement. Promoting the read replica to a primary cluster in us-west-2 can be completed within minutes, satisfying the 1-hour RTO. S3 Cross-Region Replication (CRR) automatically replicates static assets to the destination bucket in us-west-2 with near-real-time latency, ensuring the S3 data is also current within the RPO window.

Exam trap

The trap here is that candidates often confuse Multi-AZ (which provides high availability within a single region) with cross-region disaster recovery, or they assume manual snapshots and DMS are simpler alternatives without considering the RPO/RTO constraints and operational overhead.

How to eliminate wrong answers

Option B is wrong because taking manual snapshots every 15 minutes is operationally impractical and cannot guarantee an RPO of 15 minutes due to snapshot creation and copy latency; also, copying assets daily via S3 batch operations far exceeds the 15-minute RPO. Option C is wrong because Aurora Multi-AZ in us-east-1 only provides high availability within a single region, not cross-region disaster recovery, and S3 Transfer Acceleration only improves upload speed to a single bucket, not replication to another region. Option D is wrong because AWS DMS for continuous replication to a DB instance in us-west-2 introduces additional complexity and potential lag that may not meet the 15-minute RPO as reliably as Aurora native replication; S3 versioning alone does not replicate objects to another region, so it fails to provide cross-region DR for static assets.

Practice this question →

204

MCQmedium

A company runs a production database on Amazon RDS for MySQL with Multi-AZ enabled. During a recent Availability Zone outage, the database experienced a failover. After the failover, the application team notices that the database endpoint in the connection string no longer works. What is the most likely cause?

A.The application is using the IP address of the database instance instead of the DNS endpoint.

B.The DB instance identifier changed after the failover.

C.The security group for the RDS instance was modified during the failover.

D.The DNS CNAME record for the RDS endpoint was manually changed.

AnswerA

Using the IP address bypasses DNS updates; after failover, the IP changes and the application cannot connect.

Why this answer

Option A is correct because the DNS name for an RDS instance automatically resolves to the primary instance; after failover, the DNS is updated to point to the new primary, so the endpoint should still work. Option B is wrong because the CNAME is automatically updated by AWS. Option C is wrong because the DB instance identifier does not change.

Option D is wrong because the security group rules apply to the RDS instance, not the connection string.

Practice this question →

205

MCQmedium

A company uses an RDS for MySQL Multi-AZ DB instance. They want to minimize downtime during a planned maintenance update that requires a database engine version upgrade. What should the SysOps administrator do?

A.Use the AWS Console to apply the maintenance update immediately; the Multi-AZ configuration will minimize downtime.

B.Modify the DB instance to be a Single-AZ deployment, then apply the upgrade.

C.Take a snapshot, restore it as a new instance, and upgrade that instance.

D.Create a read replica in the same region, upgrade the replica, and promote it.

AnswerA

RDS applies the upgrade to the standby first, then performs a failover, so the primary is only briefly unavailable.

Why this answer

Option D is correct because Multi-AZ updates are applied to the standby first, then a failover occurs, minimizing downtime. Option A is wrong because a snapshot restore takes longer. Option B is wrong because a read replica is not used for writes and promotion takes time.

Option C is wrong because Multi-AZ already provides high availability; disabling it increases downtime.

Practice this question →

206

MCQeasy

A company processes orders using an Amazon SQS standard queue. The order processing application occasionally fails to process a message. The SysOps administrator wants to ensure that any message that fails to be successfully processed after three attempts is automatically moved to a separate queue for manual review. Which SQS feature should be configured?

A.Configure a Dead Letter Queue (DLQ) with a redrive policy

B.Increase the visibility timeout of the queue

C.Convert the queue to a FIFO queue

D.Enable redrive allow policy on the queue

AnswerA

DLQs automatically move messages to a separate queue after the specified number of receive attempts.

Why this answer

A Dead Letter Queue (DLQ) with a configured redrive policy is the correct SQS feature to automatically move messages that have failed processing after a specified number of attempts (in this case, three) to a separate queue for manual review. The redrive policy defines the source queue, the DLQ, and the maximum receive count (maxReceiveCount) threshold. When a message is received from the source queue more times than the maxReceiveCount, SQS automatically redirects it to the DLQ, isolating problematic messages without manual intervention.

Exam trap

The trap here is that candidates may confuse increasing the visibility timeout (which only delays reprocessing) with the automatic isolation provided by a Dead Letter Queue, or think that converting to FIFO or enabling a redrive allow policy alone solves the problem, when the core requirement is a DLQ with a configured redrive policy that specifies the maxReceiveCount.

How to eliminate wrong answers

Option B is wrong because increasing the visibility timeout only prevents other consumers from processing a message for a longer period, but does not move failed messages to a separate queue; it merely delays reprocessing. Option C is wrong because converting the queue to a FIFO queue changes the ordering and exactly-once processing guarantees, but does not provide automatic redirection of failed messages to a separate queue; FIFO queues also support DLQs, but the conversion itself is not the solution. Option D is wrong because enabling a redrive allow policy on the queue is not a standard SQS feature; the correct term is 'redrive policy' (or 'redrive permission policy') which is used to allow a source queue to use a specific DLQ, but the question asks for the feature that moves failed messages, which is the DLQ with a redrive policy, not just the allow policy.

Practice this question →

207

MCQmedium

Refer to the exhibit. A SysOps administrator ran the commands shown. What is the state of the EC2 instance?

A.The instance is running and healthy.

B.The instance is terminated.

C.The instance is running but has a system impairment.

D.The instance is stopped.

AnswerC

System status shows impaired, indicating an underlying system issue.

Why this answer

Option C is correct because the system status check indicates impaired reachability, which means the instance is running but not reachable. The instance state shows running, so it is not stopped or terminated. The impairment is due to a system issue, not an instance-specific issue like memory pressure (which would be instance status check).

Practice this question →

208

Drag & Dropmedium

Drag and drop the steps to set up an Amazon S3 bucket policy to grant cross-account access into the correct order.

Drag steps to the numbered slots on the right, or tap a step then tap a slot.

Steps

Order

Why this order

Identify the bucket and account, write the policy with correct principal and actions, save, and test.

Practice this question →

209

Multi-Selectmedium

Which THREE measures can be used to improve the recovery time objective (RTO) for a database running on Amazon RDS? (Choose three.)

Select 3 answers

A.Use a read replica and promote it during a failure.

B.Configure automated backups with point-in-time recovery.

C.Enable deletion protection.

D.Create manual snapshots daily.

E.Enable Multi-AZ deployment.

AnswersA, B, E

Promoting a read replica is faster than restoring from backup.

Why this answer

Options A, B, and D are correct. Multi-AZ provides automatic failover, reducing RTO. Automated backups with point-in-time recovery allow quick restore to a specific time.

Read replicas can be promoted to primary quickly. Option C is incorrect because manual snapshots require manual restore, which is slower. Option E is incorrect because deletion protection is about preventing deletion, not recovery.

Practice this question →

210

MCQmedium

A SysOps administrator is designing a disaster recovery plan for a critical application hosted on AWS. The application runs on EC2 instances with data stored in an RDS MySQL database. The RPO must be less than 15 minutes, and the RTO must be less than 1 hour. Which solution meets these requirements?

A.Take daily snapshots of the RDS instance and copy them to another Region.

B.Use an RDS DB instance with a Multi-AZ standby and a cross-Region Read Replica.

C.Use an RDS Multi-AZ DB Cluster deployment.

D.Use a single-AZ RDS instance with automated backups and point-in-time recovery.

AnswerC

Correct: Multi-AZ DB Cluster provides synchronous replication with automatic failover, meeting RPO <15min and RTO <1 hour.

Why this answer

Option D is correct because Multi-AZ DB Cluster provides automatic failover with low RPO and RTO, and synchronous replication keeps data consistent. Option A is wrong because single-AZ with snapshots has an RPO of up to 5 minutes and RTO of several hours. Option B is wrong because Read Replicas are asynchronous, leading to potential data loss.

Option C is wrong because Daily snapshots do not meet the 15-minute RPO.

Practice this question →

211

MCQmedium

A company is using AWS CloudFormation to manage its infrastructure. The SysOps Administrator needs to ensure that updates to a critical stack do not accidentally replace the database. Which feature should be used?

A.Use drift detection to identify changes.

B.Enable termination protection on the stack.

C.Use a change set to review the updates before executing them.

D.Define a stack policy that denies updates to the database resource.

AnswerD

Stack policies allow you to protect specific resources from updates.

Why this answer

Option C is correct because a stack policy can protect specific resources from being updated or replaced during a stack update. Option A is wrong because change sets allow preview but do not prevent replacement. Option B is wrong because it does not prevent updates.

Option D is wrong because drift detection identifies changes but does not prevent them.

Practice this question →

212

MCQhard

A company runs a critical application on Amazon EC2 instances. The application uses an NFS file system stored on an Amazon EFS file system. The SysOps administrator must ensure that the file system is highly available and can withstand an Availability Zone failure. The file system must be accessible from all Availability Zones in the region. Which configuration is required to meet these requirements?

A.Configure the EFS file system for One Zone storage class and mount it using the file system ID.

B.Configure the EFS file system for Standard storage class and mount it using the regional DNS name.

C.Configure EFS to use provisioned throughput and mount it using a mount target IP address.

D.Enable EFS lifecycle management to move files to Infrequent Access storage class.

AnswerB

Correct. Standard storage class replicates data across multiple AZs, and the regional DNS name resolves to mount targets in all AZs for high availability.

Why this answer

Option B is correct because the EFS Standard storage class replicates data across multiple Availability Zones (AZs) within a region, providing high availability and resilience against an AZ failure. Mounting the file system using the regional DNS name ensures that clients in any AZ can reach the file system via the nearest mount target, as the regional DNS name resolves to the mount target IP addresses in the local AZ. This configuration meets the requirement for the file system to be accessible from all AZs in the region while withstanding an AZ outage.

Exam trap

The trap here is that candidates often confuse storage class (One Zone vs. Standard) with performance settings (provisioned throughput) or cost-saving features (lifecycle management), and overlook that the regional DNS name is essential for multi-AZ access and failover.

How to eliminate wrong answers

Option A is wrong because the One Zone storage class stores data only within a single Availability Zone, which does not provide high availability or withstand an AZ failure. Option C is wrong because provisioned throughput is a performance setting, not a high-availability or multi-AZ configuration; mounting via a mount target IP address would pin the client to a specific AZ, failing the requirement for accessibility from all AZs. Option D is wrong because lifecycle management moves files to the Infrequent Access (IA) storage class to reduce costs, but it does not affect the availability or multi-AZ resilience of the file system.

Practice this question →

213

MCQmedium

An Auto Scaling group launches new EC2 instances when CPU exceeds 70 percent. The instances take 4 minutes to bootstrap (install software, register with a service discovery system, and warm up caches). Without a hook, the load balancer routes traffic to new instances before they are ready, causing 503 errors. What is the correct solution?

A.Add a lifecycle hook on the autoscaling:EC2_INSTANCE_LAUNCHING transition; signal CompleteLifecycleAction(CONTINUE) when bootstrap finishes

B.Increase the load balancer health check grace period to 10 minutes to give instances time to bootstrap

C.Increase the warm-up time in the Auto Scaling group's instance refresh configuration

D.Use a weighted target group with 0 weight for new instances until they are confirmed healthy

AnswerA

The hook holds the instance in Pending:Wait, outside the target group, until the signal arrives. The load balancer never routes traffic to the instance during its Pending:Wait phase. After the CONTINUE signal, the instance enters InService and the load balancer registers it normally. The heartbeat timeout (default 1 hour, configurable) should exceed the bootstrap time.

Why this answer

Option A is correct because lifecycle hooks allow the Auto Scaling group to pause instance launch until a custom action (e.g., bootstrap completion) is finished. By adding a hook on the autoscaling:EC2_INSTANCE_LAUNCHING transition, the instance is held in a 'pending:wait' state. Once the bootstrap script calls CompleteLifecycleAction with the CONTINUE result, the instance transitions to 'InService' and can then be registered with the load balancer, preventing premature traffic and 503 errors.

Exam trap

The trap here is that candidates often confuse the health check grace period (which only delays health checks, not registration) with lifecycle hooks (which actually control when the instance becomes available to the load balancer).

How to eliminate wrong answers

Option B is wrong because increasing the load balancer health check grace period only delays when the load balancer starts checking health; it does not prevent the load balancer from routing traffic to the instance before it is ready. The instance is still added to the target group immediately, and the grace period only affects health check status, not registration. Option C is wrong because the warm-up time in an instance refresh configuration controls how long new instances are given to become healthy during a rolling update, not the initial launch or bootstrap process for a scaling event triggered by CPU.

Option D is wrong because weighted target groups distribute traffic based on weights; setting 0 weight for new instances would prevent all traffic, but the instances would still be registered and could receive traffic if the weight is later changed manually, and this approach does not automatically signal readiness after bootstrap.

Practice this question →

214

MCQmedium

A company runs a production Amazon RDS for PostgreSQL DB instance. The SysOps administrator needs to ensure that in the event of a database failure, there is automatic failover to a standby instance in another Availability Zone with minimal downtime. Which deployment configuration should the administrator enable?

A.Multi-AZ deployment

B.Read Replicas

C.Automated backups with point-in-time recovery

D.Amazon RDS Proxy

AnswerA

Multi-AZ provides a standby instance with synchronous replication and automatic failover, meeting the requirement for minimal downtime.

Why this answer

A Multi-AZ deployment for Amazon RDS automatically provisions and maintains a synchronous standby replica in a different Availability Zone. In the event of a database failure or an Availability Zone outage, Amazon RDS automatically fails over to the standby, typically within 60-120 seconds, without requiring manual intervention. This ensures high availability and minimal downtime for the production PostgreSQL DB instance.

Exam trap

The trap here is that candidates often confuse Read Replicas with Multi-AZ deployments, mistakenly believing that Read Replicas provide automatic failover, when in fact they only support manual promotion and are intended for read scaling, not high availability.

How to eliminate wrong answers

Option B is wrong because Read Replicas are designed for read traffic offloading and do not provide automatic failover; they require manual promotion to become the primary instance, which incurs significant downtime. Option C is wrong because automated backups with point-in-time recovery are for data durability and recovery from corruption or accidental deletion, not for automatic failover to a standby instance with minimal downtime. Option D is wrong because Amazon RDS Proxy is a connection pooling and management service that improves application scalability and resilience to database failures, but it does not replace the need for a standby instance or provide automatic failover itself.

Practice this question →

215

MCQmedium

A company runs a critical web application on EC2 instances behind an Application Load Balancer in a single Availability Zone. To improve reliability, what is the MOST effective design change?

A.Place the RDS database in a Multi-AZ deployment.

B.Add a second Application Load Balancer in the same Availability Zone.

C.Increase the EC2 instance size to handle more traffic.

D.Launch EC2 instances across two or more Availability Zones.

AnswerD

Eliminates single point of failure at AZ level.

Why this answer

Option C is correct because deploying instances across multiple Availability Zones eliminates a single point of failure and is a fundamental high-availability design. Option A is wrong because increasing instance size does not provide fault tolerance. Option B is wrong because a Multi-AZ RDS instance improves database reliability but does not address the web tier's single AZ risk.

Option D is wrong because an additional load balancer in the same AZ does not protect against AZ failure.

Practice this question →

216

MCQhard

A SysOps administrator is designing a disaster recovery plan for a critical application that runs on EC2 instances with data stored on EBS volumes. The application requires an RPO of 15 minutes and an RTO of 2 hours. The current solution uses EBS snapshots taken every 6 hours. The administrator needs to improve the backup strategy to meet the RPO. What is the most cost-effective way to achieve this?

A.Enable EBS Recycle Bin with a retention rule of 15 minutes.

B.Change the EBS volume type to io2 Block Express.

C.Increase the snapshot frequency to every 15 minutes.

D.Use EBS Multi-Attach to attach volumes to multiple instances for redundancy.

AnswerC

More frequent snapshots reduce the potential data loss window to 15 minutes, meeting the 15-minute RPO.

Why this answer

Option D is correct because increased snapshot frequency (e.g., every 15 minutes) directly reduces RPO. Option A is incorrect because Recycle Bin only prevents accidental deletion, it does not create new snapshots. Option B is incorrect because changing volume type does not affect snapshot frequency.

Option C is incorrect because EBS Multi-Attach does not provide snapshot capabilities.

Practice this question →

217

MCQmedium

A company runs a stateful web application on a single Amazon EC2 instance with an Elastic IP address. The SysOps administrator needs to increase availability so that if the instance fails, a new instance can be launched quickly with the same configuration and the same IP address. The administrator also needs to ensure data is not lost. Which solution meets these requirements with the least operational overhead?

A.Use an Application Load Balancer with an Auto Scaling group and a launch configuration that includes the Elastic IP

B.Create an AMI from the instance, store data on an Amazon EFS file system, and use an Auto Scaling group with a lifecycle hook to associate the Elastic IP

C.Create a CloudFormation template that launches a new instance and associates the Elastic IP

D.Place the instance in an Auto Scaling group with a minimum of 1 and a maximum of 1, and set the health check to replace unhealthy instances

AnswerB

The AMI provides a pre-configured launch template. EFS provides durable, shared storage for application data. The Auto Scaling group automatically launches a new instance if the current one fails, and the lifecycle hook script associates the Elastic IP to the new instance, ensuring continuity with the same IP.

Why this answer

Option B is correct because it separates the stateful data (stored on Amazon EFS) from the compute instance, ensuring data persistence even if the instance fails. Creating an AMI from the instance captures the configuration, and an Auto Scaling group with a lifecycle hook can associate the Elastic IP to the new instance automatically, providing a quick failover with minimal operational overhead.

Exam trap

The trap here is that candidates often assume an Auto Scaling group alone can handle Elastic IP association, but without a lifecycle hook or custom script, the new instance will not automatically receive the Elastic IP, leading to IP address changes and potential downtime.

How to eliminate wrong answers

Option A is wrong because an Application Load Balancer (ALB) does not support Elastic IP addresses; ALBs use DNS names and are designed for distributing traffic, not for preserving a static IP for a stateful application. Option C is wrong because a CloudFormation template requires manual or automated invocation to launch a new instance and associate the Elastic IP, which introduces additional operational overhead and does not automatically handle instance failure detection and replacement. Option D is wrong because placing the instance in an Auto Scaling group with a minimum and maximum of 1 does not automatically launch a new instance with the same configuration or data; it only replaces the instance if it becomes unhealthy, but without a lifecycle hook to associate the Elastic IP or a mechanism to preserve stateful data, the solution fails to meet the requirements.

Practice this question →

218

MCQmedium

An application running on Amazon ECS with Fargate launch type is experiencing intermittent failures. The tasks are spread across multiple Availability Zones. The SysOps administrator notices that failures occur only when an entire AZ becomes unavailable. What should the administrator do to improve the reliability of the application?

A.Use multiple subnets in the same AZ.

B.Use a cluster placement group.

C.Attach an Amazon EFS filesystem to all tasks.

D.Increase the desired task count to ensure sufficient capacity across AZs.

AnswerD

More tasks distributed across AZs provide redundancy.

Why this answer

Option A is correct because increasing the number of tasks ensures that even if one AZ fails, tasks in other AZs can handle the load. Option B is incorrect because using more subnets does not necessarily improve availability. Option C is incorrect because placement groups are not applicable to Fargate.

Option D is incorrect because EFS is for shared storage, not compute reliability.

Practice this question →

219

Multi-Selecteasy

A company uses Amazon S3 to store backup data. The SysOps administrator needs to ensure that the data is encrypted at rest and that access is limited to only authorized users. Which TWO actions should be taken? (Choose TWO.)

Select 2 answers

A.Enable default encryption on the S3 bucket using SSE-S3 or AWS KMS.

B.Block all public access to the S3 bucket.

C.Create a bucket policy that allows only specific IAM roles or users.

D.Enable S3 Versioning on the bucket.

E.Enable S3 Transfer Acceleration.

AnswersA, C

Default encryption ensures all objects are encrypted.

Why this answer

Options A and D are correct. Option A: Enabling S3 default encryption ensures all objects are encrypted at rest. Option D: Using bucket policies with conditions like aws:SourceIp or aws:PrincipalArn restricts access.

Option B is wrong because S3 Transfer Acceleration is for faster uploads, not encryption. Option C is wrong because blocking public access is a security best practice but does not guarantee authorized user access. Option E is wrong because versioning is for data protection, not encryption or access control.

Practice this question →

220

MCQhard

A company's S3 bucket contains critical data. The bucket policy allows s3:PutObject and s3:GetObject for all authenticated users in the account. A SysOps administrator needs to ensure that if the bucket is accidentally deleted, the objects can be recovered. Which solution meets this requirement?

A.Set a bucket policy that requires MFA for any s3:DeleteObject action.

B.Add a bucket policy that denies s3:DeleteBucket for all principals.

C.Enable versioning on the bucket and enable MFA delete.

D.Enable cross-region replication on the bucket.

AnswerC

Versioning preserves all object versions, and MFA delete adds an extra layer of protection, requiring a second factor to permanently delete versions.

Why this answer

Option D is correct because S3 versioning combined with MFA delete protection prevents accidental deletion of versions. Option A is wrong because cross-region replication replicates objects but does not protect against bucket deletion; the replicated bucket could also be deleted. Option B is wrong because a bucket policy cannot prevent deletion of the bucket itself; deletion is controlled by IAM.

Option C is wrong because bucket policies do not prevent deletion of the bucket; they only control access to objects.

Practice this question →

221

Multi-Selecthard

A SysOps administrator is troubleshooting a high error rate on an Application Load Balancer (ALB). The ALB is configured with two target groups: one for EC2 instances and one for Lambda functions. The administrator notices that the EC2 target group is unhealthy. Which THREE steps should the administrator take to resolve the issue?

Select 3 answers

A.Review the ALB's DNS resolution for the target instances.

B.Verify that the EC2 instances' security groups allow traffic from the ALB.

C.Increase the size of the Auto Scaling group to distribute load.

D.Check the health check settings on the target group for correct path and interval.

E.Inspect the application logs on the EC2 instances for errors.

AnswersB, D, E

Security group misconfiguration can block health checks.

Why this answer

Options A, B, and D are correct. Checking security groups ensures traffic from ALB is allowed, health check configuration ensures correct path and interval, and instance logs help identify application errors. Option C is wrong because DNS issues are not related to health checks.

Option E is wrong because scaling does not fix health check failures.

Practice this question →

222

MCQhard

A company runs a stateful web application on EC2 instances in an Auto Scaling group behind an Application Load Balancer. Users report that their sessions are frequently lost during scaling events. What is the MOST effective solution to maintain session persistence?

A.Increase the cooldown period for the Auto Scaling group.

B.Modify the application to store session data in an external data store such as ElastiCache or DynamoDB.

C.Enable sticky sessions (session affinity) on the Application Load Balancer.

D.Use larger EC2 instance types to reduce the frequency of scaling.

AnswerC

Sticky sessions route a user's requests to the same instance, preserving sessions during scaling if the instance is not terminated.

Why this answer

Option D is correct because enabling sticky sessions (session affinity) on the ALB ensures that a user's requests are routed to the same instance during a session. Option A is wrong because increasing the cooldown delay does not solve session loss during scale-in. Option B is wrong while helpful for state offloading, it does not address the immediate issue of session persistence; a full rewrite may be needed.

Option C is wrong because a larger instance size does not prevent session loss when instances are terminated.

Practice this question →

223

Multi-Selecthard

A company uses AWS CloudFormation to deploy infrastructure. The SysOps administrator needs to ensure that if a stack update fails, the stack is automatically rolled back to the last known good state. Which THREE steps should the administrator take? (Choose three.)

Select 3 answers

A.Create a manual snapshot of the database before each update.

B.Configure the stack to use a service role with permissions to perform rollback actions.

C.Use rollback triggers to monitor CloudWatch metrics and automatically roll back the stack if a metric breache.

D.Enable termination protection on the stack.

E.Define a stack policy that prevents updates to the database resources.

AnswersB, C, E

A service role ensures CloudFormation has the necessary permissions to roll back resources.

Why this answer

CloudFormation stack policies protect critical resources from accidental updates. Rollback triggers monitor stack metrics and initiate rollback if a threshold is breached. IAM permissions are necessary to allow CloudFormation to perform rollback.

Manual snapshots and SNS notifications do not automate rollback.

Practice this question →

224

MCQmedium

An RDS Multi-AZ DB instance fails over to the standby. The application uses the DB instance endpoint. What should the SysOps administrator usually do in the application after failover?

A.Ensure the application retries/reconnects using the same DB endpoint.

B.Manually change the application to the standby instance IP address.

C.Restore from the latest snapshot before reconnecting.

D.Create a new read replica and promote it immediately.

AnswerA

The endpoint remains the abstraction for the active database instance.

Why this answer

When an RDS Multi-AZ DB instance fails over to the standby, the DNS record for the DB instance endpoint is automatically updated to point to the new primary instance. The application should simply retry or reconnect using the same endpoint; no manual changes are needed because the endpoint remains valid. This is the standard behavior for Multi-AZ deployments, ensuring minimal disruption.

Exam trap

The trap here is that candidates may think they need to manually update the connection string or IP address, but the DNS endpoint automatically resolves to the new primary after failover, so only retry logic is required.

How to eliminate wrong answers

Option B is wrong because the application should use the DNS endpoint, not the IP address; the IP address can change after failover, and relying on it would break connectivity. Option C is wrong because restoring from a snapshot is unnecessary and would cause data loss; Multi-AZ failover preserves data without requiring a restore. Option D is wrong because creating and promoting a read replica is not the correct recovery action for a Multi-AZ failover; the standby is already promoted automatically by RDS.

Practice this question →

225

MCQmedium

A company runs a production Amazon RDS for PostgreSQL DB instance in a single Availability Zone (AZ). The SysOps administrator needs to improve database availability so that in the event of a database failure or AZ outage, a standby instance is automatically promoted with minimal downtime. Which configuration should the administrator enable?

A.Enable automated backups with a retention period of 35 days.

B.Create a read replica in another Availability Zone.

C.Enable Multi-AZ deployment on the DB instance.

D.Schedule manual snapshots to be taken every hour and restore from the latest snapshot when needed.

AnswerC

Multi-AZ deployment creates a standby in a different AZ with synchronous replication. AWS handles automatic failover, ensuring high availability with minimal disruption.

Why this answer

Multi-AZ deployment automatically creates and maintains a synchronous standby replica in a different Availability Zone. In the event of a failure or AZ outage, Amazon RDS automatically fails over to the standby, typically within 60–120 seconds, with no manual intervention required. This meets the requirement for automatic promotion with minimal downtime.

Exam trap

The trap here is that candidates confuse read replicas (which are for read scaling and require manual promotion) with Multi-AZ (which provides automatic failover), or they overestimate the speed and automation of backups and snapshots for disaster recovery.

How to eliminate wrong answers

Option A is wrong because automated backups only provide point-in-time recovery (PITR) to restore the database to a specific time, not automatic failover with minimal downtime; restoration is a manual process that can take hours. Option B is wrong because a read replica is designed for read scaling and asynchronous replication, not automatic failover; promoting a read replica requires manual intervention and can result in data loss due to replication lag. Option D is wrong because manual snapshots require scheduling and manual restoration, which involves significant downtime and does not provide automatic failover or minimal disruption.

Practice this question →

← PreviousPage 3 of 4 · 240 questions totalNext →

Ready to test yourself?

Try a timed practice session using only Soa Reliability Bc questions.

Start 20-question session