Courseiva
Knowledge + Practice
CertificationsVendorsCareer RoadmapsLabs & ToolsStudy GuidesGlossaryPractice Questions
C
Courseiva

Free IT certification practice questions with explained answers for CCNA, CompTIA, AWS, Azure, Google Cloud, and more.

Certification Practice Questions

CCNA practice questionsSecurity+ SY0-701 practice questionsAWS SAA-C03 practice questionsAZ-104 practice questionsAZ-900 practice questionsCLF-C02 practice questionsA+ Core 1 practice questionsGoogle Cloud ACE practice questionsCySA+ CS0-003 practice questionsNetwork+ N10-009 practice questions
View all certifications →

Product

CertificationsCertification PathsExam TopicsPractice TestsExam Dumps vs Practice TestsStudy HubComparisons

Company

AboutContactEditorial PolicyQuestion Writing PolicyTrust Center

Legal

Privacy PolicyTerms of Service

Courseiva is a free IT certification practice platform offering original exam-style practice questions, detailed explanations, topic-based practice, mock exams, readiness tracking, and study analytics for Cisco, CompTIA, Microsoft, AWS, and other technology certifications.

© 2026 Courseiva. Courseiva is operated by JTNetSolutions Ltd. All rights reserved.

Courseiva is an independent certification practice platform and is not affiliated with, endorsed by, or sponsored by Cisco, Microsoft, AWS, CompTIA, Google, ISC2, ISACA, or any other certification vendor. Vendor names and certification marks are used only to identify the exams learners are preparing for.

← Reliability and Business Continuity practice sets

SOA-C02 Reliability and Business Continuity • Complete Question Bank

SOA-C02 Reliability and Business Continuity — All Questions With Answers

Complete SOA-C02 Reliability and Business Continuity question bank — all 0 questions with answers and detailed explanations.

240
Questions
Free
No signup
Certifications/SOA-C02/Practice Test/Reliability and Business Continuity/All Questions
Question 1mediummultiple choice
Read the full Reliability and Business Continuity explanation →

An application uses an Amazon DynamoDB table with on-demand capacity. The SysOps administrator needs to ensure the table remains available during an AWS regional outage. Which strategy should be used?

Question 2mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is testing the failover of an Amazon RDS for PostgreSQL Multi-AZ DB instance. The application currently writes to the primary instance in us-east-1a. Which action will manually trigger a failover to the standby instance in us-east-1b?

Question 3easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a web application on Amazon EC2 instances in a single Availability Zone. The SysOps administrator wants to increase the availability of the application so that it can survive an Availability Zone failure. Which action is the most effective?

Question 4mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a stateful web application on a single Amazon EC2 instance with an Elastic IP address. The SysOps administrator needs to increase availability so that if the instance fails, a new instance can be launched quickly with the same configuration and the same IP address. The administrator also needs to ensure data is not lost. Which solution meets these requirements with the least operational overhead?

Question 5mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical production database on Amazon RDS for MySQL with a Multi-AZ deployment. The database experiences a primary instance failure. The SysOps administrator needs to understand exactly how the failover process worked and why the application experienced a longer-than-expected downtime. Which AWS service or feature should the administrator use to review detailed events and actions during the failover?

Question 6mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a stateless web application on Amazon EC2 instances in an Auto Scaling group with a minimum of 2 and maximum of 10 instances. The instances are behind an Application Load Balancer (ALB). The SysOps administrator needs to ensure that the application can survive the failure of an entire AWS Availability Zone (AZ) in the region. Which configuration is necessary?

Question 7hardmultiple choice
Read the full NAT/PAT explanation →

A company runs a critical stateful web application on Amazon EC2 instances in a single AWS region. The application stores user session data in an Amazon ElastiCache for Redis cluster. The SysOps administrator must design a disaster recovery (DR) strategy that can survive a complete regional outage with a Recovery Point Objective (RPO) of 15 minutes and a Recovery Time Objective (RTO) of 1 hour. The application must be able to redirect users to the DR region with minimal manual effort. Which combination of actions meets these requirements?

Question 8mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical production database on Amazon RDS for MySQL with Multi-AZ deployment. The SysOps administrator needs to be automatically notified when a failover event occurs, and also capture the exact time and reason for the failover for compliance purposes. Which AWS service or feature should be used to capture the failover event details with the least operational overhead?

Question 9hardmultiple choice
Read the full NAT/PAT explanation →

A company runs a critical web application on Amazon EC2 instances that are part of an Auto Scaling group. The application receives unpredictable traffic spikes. The SysOps administrator needs to ensure that when a scale-out event occurs, new instances are ready to serve traffic quickly to minimize latency spikes. Currently, the instance launch and configuration process (including software installs and cache warming) takes about 5 minutes. The administrator wants to reduce the time it takes for new instances to start serving traffic. Which combination of Auto Scaling features should be used?

Question 10mediummultiple choice
Read the full NAT/PAT explanation →

A company runs a stateful web application on a single Amazon EC2 instance. The SysOps administrator needs to implement a high availability architecture that can tolerate an Availability Zone (AZ) failure. The application stores session state in memory and also writes critical data to an Amazon EBS volume. The administrator wants to use an Auto Scaling group and an Application Load Balancer (ALB). Which combination of steps is required to make the application highly available?

Question 11mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical web application on Amazon EC2 instances in an Auto Scaling group behind an Application Load Balancer (ALB). The application uses session stickiness (sticky sessions) to maintain user sessions. The SysOps administrator notices that when instances are replaced during a scale-in or failure event, users lose their session data. The administrator needs to preserve session data across instance failures without losing stickiness benefits. What should the administrator do?

Question 12mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical web application on Amazon EC2 instances in an Auto Scaling group across three Availability Zones in us-east-1. The application stores data in an Amazon RDS for MySQL DB instance with Multi-AZ deployment. The SysOps administrator needs to design a disaster recovery strategy that can recover from a complete regional outage. The Recovery Time Objective (RTO) is 2 hours and the Recovery Point Objective (RPO) is 1 hour. Which solution should the administrator implement?

Question 13mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical application on Amazon EC2 instances with data stored on Amazon EBS volumes. The SysOps administrator needs to implement a backup strategy that supports point-in-time recovery with a Recovery Point Objective (RPO) of 1 hour and a Recovery Time Objective (RTO) of 4 hours. Which solution meets these requirements with the least operational overhead?

Question 14mediummultiple choice
Read the full NAT/PAT explanation →

A company runs a web application on Amazon EC2 instances in an Auto Scaling group behind an Application Load Balancer (ALB). The application stores session state in memory on each instance. The SysOps administrator wants to make the application highly available across multiple Availability Zones without losing session data when instances are terminated or replaced. The solution must minimize application changes. Which approach should the administrator take?

Question 15hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical database workload on an Amazon RDS for MySQL DB instance with Multi-AZ deployment in the us-east-1 region. The SysOps administrator must design a disaster recovery strategy that can recover from a complete regional outage. The Recovery Time Objective (RTO) is 2 hours and the Recovery Point Objective (RPO) is 1 hour. Which solution meets these requirements at the lowest cost?

Question 16mediummultiple choice
Read the full NAT/PAT explanation →

A company runs a stateful web application on a single Amazon EC2 instance. The application stores session state in memory and writes critical data to an Amazon EBS volume. The SysOps administrator needs to implement a highly available architecture that can tolerate an Availability Zone (AZ) failure. The administrator plans to use an Auto Scaling group and an Application Load Balancer (ALB). Which combination of steps is required to make the application highly available while preserving session and data durability across AZ failures?

Question 17mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a global e-commerce application that uses Amazon DynamoDB as its primary database. The application requires single-digit millisecond read and write latency from any region and must continue to operate during a regional outage with minimal data loss. Which DynamoDB feature should the SysOps administrator enable to meet these requirements?

Question 18mediummultiple choice
Read the full DNS explanation →

A company runs a web application on two EC2 instances in different Availability Zones, each with an Elastic IP address. The SysOps administrator needs to automatically route traffic to the healthy instance and fail over if one instance becomes unhealthy. The application must be accessible via a single DNS name. Which Route 53 routing policy should be used?

Question 19mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs an Amazon RDS for MySQL DB instance in us-east-1. The SysOps administrator needs to implement a disaster recovery solution that can recover from a regional outage with a Recovery Point Objective (RPO) of less than 1 second and a Recovery Time Objective (RTO) of less than 1 minute. Which solution should the administrator use?

Question 20hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company has an Amazon RDS for PostgreSQL DB instance with Multi-AZ deployment in us-east-1. The SysOps administrator must design a disaster recovery strategy to recover from a regional outage. The Recovery Time Objective (RTO) is 1 hour and the Recovery Point Objective (RPO) is 5 minutes. Which solution meets these requirements at the lowest cost?

Question 21hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical application on Amazon EC2 instances. The application uses an NFS file system stored on an Amazon EFS file system. The SysOps administrator must ensure that the file system is highly available and can withstand an Availability Zone failure. The file system must be accessible from all Availability Zones in the region. Which configuration is required to meet these requirements?

Question 22mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical database on an Amazon RDS for MySQL DB instance. The SysOps administrator needs to ensure that the database can survive a single Availability Zone failure with minimal downtime. Which configuration should the administrator implement?

Question 23hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical MySQL database on an Amazon RDS DB instance in a single Availability Zone. The SysOps administrator needs to implement a disaster recovery solution with a Recovery Point Objective (RPO) of 5 minutes and a Recovery Time Objective (RTO) of 1 hour, while minimizing costs. Which solution meets these requirements?

Question 24mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company stores critical data in an Amazon S3 bucket in the us-west-2 Region. The SysOps administrator needs to ensure that all objects are automatically replicated to another AWS Region for disaster recovery. The Recovery Point Objective (RPO) must be less than 15 minutes, and existing objects must also be replicated. Which S3 feature should the administrator use?

Question 25easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company processes orders using an Amazon SQS standard queue. The order processing application occasionally fails to process a message. The SysOps administrator wants to ensure that any message that fails to be successfully processed after three attempts is automatically moved to a separate queue for manual review. Which SQS feature should be configured?

Question 26hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical application on Amazon EC2 instances across multiple Availability Zones. The application stores state data on a shared Amazon EFS file system. The SysOps administrator needs to ensure that the file system remains available if an entire Availability Zone fails. The file system must also provide low-latency access from all instances. Which configuration meets these requirements?

Question 27mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company has an Amazon DynamoDB table with on-demand capacity mode. The SysOps administrator needs to ensure that the table can survive a regional outage. The table is currently in us-east-1. Which feature should be configured to achieve regional resilience with minimal data loss?

Question 28easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical application on Amazon ECS with the Fargate launch type. The application must remain available if a single Availability Zone fails. The SysOps administrator needs to configure the ECS service for high availability. Which configuration should be used?

Question 29mediummultiple choice
Read the full NAT/PAT explanation →

A company hosts a critical web application on Amazon EC2 instances in a single AWS Region (us-east-1). The SysOps administrator needs to implement a Disaster Recovery (DR) solution using a different AWS Region (us-west-2). The DR plan requires a Recovery Time Objective (RTO) of 1 hour and a Recovery Point Objective (RPO) of 15 minutes. The application uses an Amazon Aurora MySQL DB cluster and static assets stored in an Amazon S3 bucket. Which combination of actions should the administrator take to meet these requirements?

Question 30mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical application on a single Amazon EC2 instance. The SysOps administrator needs to ensure that if the instance fails, a new instance is automatically provisioned in a different Availability Zone. Which configuration should the administrator implement?

Question 31mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a web application on Amazon EC2 instances in an Auto Scaling group that spans two Availability Zones. The application uses an Application Load Balancer (ALB) that is deployed across the same Availability Zones. The SysOps administrator wants to ensure the application remains available if an entire Availability Zone fails. Which configuration is essential for this high availability?

Question 32hardmultiple choice
Read the full NAT/PAT explanation →

A company runs a stateful application on a single Amazon EC2 instance with a 200 GB EBS volume. The application data changes frequently. The SysOps administrator needs a disaster recovery (DR) plan with a Recovery Point Objective (RPO) of 1 hour and a Recovery Time Objective (RTO) of 15 minutes. The DR region is us-west-2. Which combination of steps should the administrator implement?

Question 33mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a production Amazon RDS for PostgreSQL DB instance. The SysOps administrator needs to ensure that in the event of a database failure, there is automatic failover to a standby instance in another Availability Zone with minimal downtime. Which deployment configuration should the administrator enable?

Question 34mediummulti select
Read the full Reliability and Business Continuity explanation →

A SysOps administrator needs to set up automated backups for a production Amazon RDS for PostgreSQL DB instance and copy the snapshots to another AWS Region for disaster recovery. Order the steps the administrator should take to implement this cross-region snapshot copy. (Drag each step to the correct order from first to last.) (Choose 4.)

Question 35mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a stateless web application on Amazon EC2 instances in an Auto Scaling group across two Availability Zones. The SysOps administrator needs to ensure that the application can tolerate a failure of an entire Availability Zone. Which configuration is required?

Question 36hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a stateful application on a single Amazon EC2 instance with an attached EBS volume. The SysOps administrator needs to ensure that in the event of an instance failure, a new instance can be launched quickly with the same data. The Recovery Point Objective (RPO) is 15 minutes and the Recovery Time Objective (RTO) is 30 minutes. Which strategy should the administrator implement?

Question 37mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a production database on an Amazon RDS for PostgreSQL DB instance in a single Availability Zone. The SysOps administrator needs to improve the database's availability to meet an SLA of 99.99% and ensure automatic failover in case of a database failure. Which configuration change should be made?

Question 38mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a production Amazon RDS for PostgreSQL DB instance in a single Availability Zone (AZ). The SysOps administrator needs to improve database availability so that in the event of a database failure or AZ outage, a standby instance is automatically promoted with minimal downtime. Which configuration should the administrator enable?

Question 39mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a production Amazon RDS for MySQL DB instance in a single Availability Zone. The SysOps administrator needs to improve database availability to ensure automatic failover if the primary instance fails. Which configuration should the administrator enable?

Question 40hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical application on a single Amazon EC2 instance with an attached Amazon EBS volume. The SysOps administrator needs to implement a disaster recovery solution that meets a Recovery Point Objective (RPO) of 15 minutes and a Recovery Time Objective (RTO) of 30 minutes. The application runs continuously and data changes frequently. Which solution should the administrator implement?

Question 41easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical web application on a single Amazon EC2 instance with a 100 GiB gp2 EBS volume. The SysOps administrator needs to ensure data durability by taking automated snapshots of the root volume every hour. The snapshots should be retained for 7 days. Which AWS service can be used to automate this task with minimal configuration?

Question 42mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company has a production Amazon RDS for MySQL DB instance in a single Availability Zone. The SysOps administrator needs to improve database availability to ensure automatic failover in the event of a database failure or an Availability Zone outage. Which configuration should the administrator enable?

Question 43mediummultiple choice
Read the full Reliability and Business Continuity explanation →

An RDS Multi-AZ DB instance fails over to the standby. The application uses the DB instance endpoint. What should the SysOps administrator usually do in the application after failover?

Question 44hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company uses Amazon MQ (RabbitMQ) for messaging between microservices. The SysOps administrator needs to ensure the message broker is highly available with automatic failover and no data loss. Which deployment mode should be used?

Question 45mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a production application on Amazon EC2 instances in an Auto Scaling group across two Availability Zones. The application uses an Amazon RDS Multi-AZ DB instance. The SysOps administrator wants to test the application's behavior during an Availability Zone failure of the database. Which action should the administrator take to simulate a failure with minimal impact on production?

Question 46easymultiple choice
Read the full Reliability and Business Continuity explanation →

A web application runs on Amazon EC2 instances in an Auto Scaling group behind an Application Load Balancer. To achieve high availability, what is the minimum number of Availability Zones (AZs) that must be configured for the Auto Scaling group?

Question 47easymultiple choice
Read the full Reliability and Business Continuity explanation →

A production RDS MySQL database stores financial records. The team needs the ability to restore the database to any point within the last 7 days in case of accidental data deletion. Automated backups are currently disabled. What must be configured?

Question 48mediummultiple choice
Review the full routing breakdown →

An Auto Scaling group launches new EC2 instances when CPU exceeds 70 percent. The instances take 4 minutes to bootstrap (install software, register with a service discovery system, and warm up caches). Without a hook, the load balancer routes traffic to new instances before they are ready, causing 503 errors. What is the correct solution?

Question 49mediummultiple choice
Read the full Reliability and Business Continuity explanation →

Regulatory requirements mandate that all RDS and EBS backups are replicated to a secondary AWS region within 24 hours of creation. The company has workloads in us-east-1 and must replicate backups to eu-west-1. Restoring from the secondary region must be possible without manual copying steps during a disaster. What service and configuration implements this requirement?

Question 50hardmultiple choice
Read the full NAT/PAT explanation →

An EC2 instance runs a database on a 2 TB EBS gp3 volume. After a corruption event, the team must restore from a snapshot. When they detach the corrupted volume, attach a new volume restored from the snapshot, and start the database, performance is 10 to 20 times lower than normal for the first two hours. What causes this behavior, and what feature eliminates it?

Question 51mediumdrag order
Read the full VPN explanation →

Drag and drop the steps to set up an AWS Site-to-Site VPN connection into the correct order.

Drag steps to the numbered slots on the right, or tap a step then tap a slot.

Steps
Order
1Step 1
2Step 2
3Step 3
4Step 4
5Step 5
Question 52mediumdrag order
Read the full Reliability and Business Continuity explanation →

Drag and drop the steps to set up an Amazon S3 bucket policy to grant cross-account access into the correct order.

Drag steps to the numbered slots on the right, or tap a step then tap a slot.

Steps
Order
1Step 1
2Step 2
3Step 3
4Step 4
5Step 5
Question 53mediummatching
Read the full Reliability and Business Continuity explanation →

Match each AWS storage service to its description.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts
Matches

Object storage for any data

Block storage for EC2 instances

File storage for Linux instances

Managed file system for Windows or Lustre

Low-cost archival storage

Question 54mediummatching
Read the full Reliability and Business Continuity explanation →

Match each AWS backup and disaster recovery service to its feature.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts
Matches

Centralized backup management

Automatic object replication across regions

High availability with standby replica

Read scaling and cross-region disaster recovery

Continuous replication for DR

Question 55mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company is running a critical web application on EC2 instances behind an Application Load Balancer. The instances are in an Auto Scaling group across two Availability Zones. The company needs to ensure that if an entire Availability Zone fails, the application remains available. Which configuration meets this requirement?

Question 56hardmultiple choice
Read the full DNS explanation →

A company uses a Multi-AZ RDS for MySQL instance for its production database. During a maintenance window, the primary instance fails and a failover occurs. However, the application experiences a 5-minute downtime. The application uses a DNS CNAME record pointing to the RDS endpoint. What is the MOST likely cause of the downtime?

Question 57easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company wants to back up its on-premises file server to AWS. The backup must be encrypted in transit and at rest. Which AWS service should the company use to meet these requirements?

Question 58mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company has an AWS Lambda function that processes S3 events. The function is critical and must be available even if one Availability Zone fails. How can a SysOps administrator ensure high availability for the Lambda function?

Question 59hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a stateful web application on EC2 instances in an Auto Scaling group. The application uses a sticky session (session affinity) feature of the Application Load Balancer. During a scale-in event, some users lose their session data. What should the SysOps administrator do to prevent session data loss?

Question 60easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company wants to create a disaster recovery (DR) strategy for its RDS for PostgreSQL database. The primary database is in us-east-1. The company needs a recovery point objective (RPO) of less than 5 minutes and a recovery time objective (RTO) of less than 1 hour. Which solution meets these requirements?

Question 61mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company has an S3 bucket that stores critical data. The bucket has versioning enabled. A SysOps administrator accidentally deletes a version of an object. What is the quickest way to recover the deleted version?

Question 62hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a production application on EC2 instances in an Auto Scaling group. The application stores data on an EBS volume. The SysOps administrator wants to ensure that the data is durable and available even if an EC2 instance fails. Which approach should the administrator take?

Question 63easymultiple choice
Read the full NAT/PAT explanation →

A company wants to ensure that its EC2 instances receive patches automatically to maintain security compliance. Which AWS service can be used to automate patch management?

Question 64mediummulti select
Read the full Reliability and Business Continuity explanation →

A company is designing a disaster recovery plan for its critical applications. The plan must minimize data loss and recovery time. Which TWO measures should the SysOps administrator implement?

Question 65hardmulti select
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is troubleshooting a high error rate on an Application Load Balancer (ALB). The ALB is configured with two target groups: one for EC2 instances and one for Lambda functions. The administrator notices that the EC2 target group is unhealthy. Which THREE steps should the administrator take to resolve the issue?

Question 66mediummulti select
Read the full Reliability and Business Continuity explanation →

A company runs a stateless web application on EC2 instances behind an Application Load Balancer. The company wants to improve the application's availability and fault tolerance. Which TWO actions should the SysOps administrator take?

Question 67mediummultiple choice
Read the full Reliability and Business Continuity explanation →

Refer to the exhibit. A SysOps administrator creates an IAM policy to allow an EC2 instance to upload objects to an S3 bucket. However, the instance is unable to upload objects. What is the MOST likely reason?

Exhibit

Refer to the exhibit.

```
{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Effect": "Allow",
      "Action": "s3:PutObject",
      "Resource": "arn:aws:s3:::my-bucket/*"
    }
  ]
}
```
Question 68hardmultiple choice
Read the full Reliability and Business Continuity explanation →

Refer to the exhibit. A SysOps administrator needs to restore the database 'mydb' to the most recent restorable time shown. However, the administrator cannot restore to that time. What is the MOST likely reason?

Network Topology
$ aws rds describe-db-instancesdb-instance-identifier mydbRefer to the exhibit.```"DBInstances": ["DBInstanceIdentifier": "mydb","DBInstanceClass": "db.t3.medium","Engine": "mysql","DBInstanceStatus": "available","MultiAZ": false,"ReadReplicaDBInstanceIdentifiers": [],"BackupRetentionPeriod": 0,"PreferredBackupWindow": "03:00-04:00","LatestRestorableTime": "2024-01-01T04:00:00Z","InstanceCreateTime": "2024-01-01T00:00:00Z"
Question 69easymultiple choice
Read the full Reliability and Business Continuity explanation →

Refer to the exhibit. A SysOps administrator creates a CloudFormation stack with the template shown. After 30 days, what happens to noncurrent versions of objects in the bucket?

Exhibit

Refer to the exhibit.

```
Resources:
  MyBucket:
    Type: AWS::S3::Bucket
    Properties:
      VersioningConfiguration:
        Status: Enabled
      LifecycleConfiguration:
        Rules:
          - Id: DeleteOldVersions
            Status: Enabled
            NoncurrentVersionExpirationInDays: 30
```
Question 70mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company is running a web application on EC2 instances behind an Application Load Balancer. They want to ensure that if an entire Availability Zone fails, the application remains available. Which configuration should they implement?

Question 71hardmultiple choice
Read the full NAT/PAT explanation →

A company uses S3 to store critical data. They need to ensure that data can be recovered in the event of accidental deletion or overwriting by users. Which combination of actions should they take?

Question 72easymultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is setting up a backup plan for an RDS MySQL database. The database is 500 GB in size and is used for a critical application. The company requires a Recovery Point Objective (RPO) of 5 minutes and a Recovery Time Objective (RTO) of 1 hour. Which solution meets these requirements?

Question 73mediummultiple choice
Read the full Reliability and Business Continuity explanation →

An application running on EC2 instances stores session data in an attached EBS volume. The company wants to ensure session data is not lost if an instance fails. Which solution should the administrator implement?

Question 74hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical application on EC2 instances in an Auto Scaling group with a minimum of 2 instances. The instances are in a single Availability Zone. The company wants to achieve 99.99% availability. Which change should they make?

Question 75easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company is designing a disaster recovery plan for its on-premises database. They need to replicate the database to AWS with low latency. Which AWS service should they use?

Question 76mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator needs to ensure that an S3 bucket can recover from accidental deletions by users. The bucket stores versioned objects. What additional configuration should be enabled to prevent permanent deletion?

Question 77hardmultiple choice
Review the full subnetting walkthrough →

A company is running a stateful web application on a single EC2 instance in a public subnet. The instance stores user sessions locally. The company wants to improve availability without rewriting the application. Which design should they use?

Question 78easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company has an RDS PostgreSQL database with a Multi-AZ deployment. The primary instance fails. What happens to the application connections?

Question 79mediummulti select
Read the full Reliability and Business Continuity explanation →

A company is designing a highly available architecture for a web application. The application uses an Application Load Balancer (ALB) and an Auto Scaling group of EC2 instances. Which TWO steps should the company take to ensure the architecture is resilient to an Availability Zone failure? (Select TWO.)

Question 80hardmulti select
Read the full Reliability and Business Continuity explanation →

A company wants to back up its on-premises file servers to AWS for disaster recovery. The data changes frequently, and the company needs to minimize data loss. Which THREE steps should the company take? (Select THREE.)

Question 81easymulti select
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is planning for disaster recovery of an RDS MySQL database. The database is currently in a single AZ. Which TWO actions will improve recovery time and reduce data loss? (Select TWO.)

Question 82mediummultiple choice
Read the full NAT/PAT explanation →

A company runs a critical web application on EC2 instances behind an Application Load Balancer across three Availability Zones. The application stores session data in an RDS MySQL database. To improve reliability, the company wants to ensure that a single Availability Zone failure does not impact the application's availability. Which combination of actions should the SysOps administrator take?

Question 83easymultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is designing a backup strategy for an Amazon EFS file system. The file system stores critical data that must be recoverable within 15 minutes of a failure. Which solution meets these requirements?

Question 84hardmultiple choice
Review the full routing breakdown →

A company has a production AWS account with multiple VPCs connected via a transit gateway. The security team requires that all cross-VPC traffic be inspected by a centralized network firewall appliance. The firewall is deployed in a dedicated inspection VPC. The SysOps administrator must ensure that traffic from VPC A to VPC B is routed through the inspection VPC. Which configuration achieves this?

Question 85mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is troubleshooting an issue where an Auto Scaling group is not launching EC2 instances despite having a scaling policy that should trigger when CPU utilization exceeds 80%. The CloudWatch alarm shows that the metric is breaching the threshold, but no instances are launched. What is the most likely cause?

Question 86easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company is using Amazon S3 to store critical data with versioning enabled. The SysOps administrator needs to implement a solution that automatically transitions objects to S3 Glacier Deep Archive after 90 days and permanently deletes them after 7 years. Which S3 feature should be used?

Question 87mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a stateful web application on EC2 instances behind a Network Load Balancer. The application requires that client requests from a particular session are always sent to the same target instance. Which feature should the SysOps administrator configure on the NLB to meet this requirement?

Question 88hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is tasked with designing a disaster recovery strategy for an Amazon RDS for PostgreSQL database. The database is currently in us-east-1. The recovery point objective (RPO) is 5 seconds, and the recovery time objective (RTO) is 1 minute. Which solution meets these requirements?

Question 89easymultiple choice
Read the full NAT/PAT explanation →

A company has an Auto Scaling group that launches EC2 instances in private subnets. The instances need to download software patches from the internet. Which component must be added to the VPC to allow outbound internet traffic while keeping the instances private?

Question 90mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A team of developers is deploying a new microservice that uses Amazon DynamoDB as its data store. The SysOps administrator must ensure that the application can handle a sudden spike in read traffic without throttling. Which DynamoDB feature can be used to automatically handle increases in read capacity?

Question 91mediummulti select
Read the full Reliability and Business Continuity explanation →

A company runs a critical application on Amazon EC2 instances in an Auto Scaling group. The application stores data on an Amazon EBS volume. The SysOps administrator needs to implement a backup strategy that ensures data can be recovered in the event of an AZ failure. Which TWO actions should be taken? (Choose TWO.)

Question 92hardmulti select
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is designing a highly available architecture for a web application using an Application Load Balancer and an Auto Scaling group across three Availability Zones. The application must be able to withstand the loss of an entire AZ. Which THREE components are necessary to meet this requirement? (Choose THREE.)

Question 93easymulti select
Read the full Reliability and Business Continuity explanation →

A company uses Amazon S3 to store backup data. The SysOps administrator needs to ensure that the data is encrypted at rest and that access is limited to only authorized users. Which TWO actions should be taken? (Choose TWO.)

Question 94easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company is designing a highly available web application using an Application Load Balancer (ALB) with EC2 instances in an Auto Scaling group across two Availability Zones. Which configuration ensures that the application remains available if one Availability Zone fails?

Question 95mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is designing a disaster recovery plan for a critical RDS MySQL database. The database must be available with a Recovery Point Objective (RPO) of less than 1 hour and a Recovery Time Objective (RTO) of less than 2 hours. The primary region is us-east-1. Which solution meets these requirements?

Question 96hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a stateful application on EC2 instances behind a Network Load Balancer (NLB). The application uses sticky sessions (session affinity) to maintain client state. During a deployment, the SysOps administrator needs to replace instances without disrupting active sessions. Which approach should be used?

Question 97easymultiple choice
Read the full Reliability and Business Continuity explanation →

An organization needs to back up an Amazon EFS file system daily and retain backups for 30 days. Which AWS service provides a managed backup solution for EFS?

Question 98mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company is running a critical application on EC2 instances in an Auto Scaling group. The application experiences occasional CPU spikes. The SysOps administrator needs to configure a scaling policy that reacts quickly to increased load but avoids unnecessary scaling actions due to short bursts. Which scaling policy type should be used?

Question 99hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is implementing a disaster recovery strategy for an S3 bucket containing critical data. The bucket must be available within 15 minutes if the primary region fails. Data must be replicated with RPO of less than 5 minutes. Which S3 replication configuration should be used?

Question 100easymultiple choice
Read the full DNS explanation →

A company uses Amazon Route 53 for DNS. They want to ensure that if the primary web server fails, traffic is automatically routed to a secondary server in another region. Which routing policy should be used?

Question 101mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator notices that an RDS instance's storage is nearly full. The instance uses General Purpose SSD (gp2) storage. The administrator needs to increase storage with minimal downtime. Which action should be taken?

Question 102hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a microservices architecture on Amazon ECS with Fargate. They need to ensure that if a task fails, it is automatically restarted. Which configuration is required?

Question 103mediummulti select
Read the full Reliability and Business Continuity explanation →

A company is designing a backup strategy for its on-premises file servers to AWS. Which TWO services can be used to back up data to AWS? (Choose TWO.)

Question 104mediummulti select
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is designing a highly available architecture for a web application using an Application Load Balancer (ALB) with EC2 instances in an Auto Scaling group. Which TWO configurations are required to ensure high availability? (Choose TWO.)

Question 105hardmulti select
Read the full Reliability and Business Continuity explanation →

A company runs a critical application on EC2 instances with EBS volumes. The SysOps administrator must ensure that EBS snapshots are taken every hour and retained for 7 days. Which THREE steps should be taken to achieve this? (Choose THREE.)

Question 106easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a web application on EC2 instances in an Auto Scaling group behind an Application Load Balancer. The application stores session data in an in-memory cache on the EC2 instances. During an instance refresh, users lose their session data. Which action should be taken to improve reliability without major application changes?

Question 107mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a production RDS for PostgreSQL instance with Multi-AZ enabled. The database experiences a failover due to an AZ outage. After the failover, the application experiences high latency on write operations. What is the most likely cause?

Question 108hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is designing a disaster recovery plan for a critical application that runs on EC2 instances with data stored on EBS volumes. The application requires an RPO of 15 minutes and an RTO of 2 hours. The current solution uses EBS snapshots taken every 6 hours. The administrator needs to improve the backup strategy to meet the RPO. What is the most cost-effective way to achieve this?

Question 109easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a stateless web application on EC2 instances in an Auto Scaling group. The instances are behind an Application Load Balancer. The Auto Scaling group uses a dynamic scaling policy based on average CPU utilization. During a traffic spike, new instances are launched but take 5 minutes to become healthy. Users experience errors during this time. Which solution would reduce the time to serve traffic from new instances?

Question 110mediummultiple choice
Read the full Reliability and Business Continuity explanation →

An organization is using AWS CloudFormation to deploy infrastructure. The SysOps administrator needs to ensure that if a stack update fails, the stack automatically rolls back to the last known good state. Which stack update option should be configured?

Question 111hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical database on an EC2 instance with an EBS volume. The administrator wants to create a disaster recovery plan that can recover the database in a different AWS Region within 4 hours. The database size is 1 TB. What is the MOST efficient approach to meet the RTO?

Question 112easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company uses AWS CloudTrail to log API activity. The SysOps administrator needs to ensure that log files are protected from accidental deletion and are available for compliance audits for at least 7 years. Which service should be used to meet these requirements?

Question 113mediummultiple choice
Read the full Reliability and Business Continuity explanation →

An application running on Amazon ECS with Fargate launch type is experiencing intermittent failures. The tasks are spread across multiple Availability Zones. The SysOps administrator notices that failures occur only when an entire AZ becomes unavailable. What should the administrator do to improve the reliability of the application?

Question 114hardmultiple choice
Read the full DNS explanation →

A company uses Amazon Route 53 for DNS and has a domain registered with Route 53. The company wants to ensure that if the primary web server fails, traffic is automatically routed to a secondary server in a different region. The health check must consider both the server response and the application response time. Which routing policy should be used?

Question 115easymulti select
Read the full Reliability and Business Continuity explanation →

Which TWO actions should a SysOps administrator take to ensure high availability of a web application running on EC2 instances? (Choose two.)

Question 116mediummulti select
Read the full Reliability and Business Continuity explanation →

Which THREE measures can be used to improve the recovery time objective (RTO) for a database running on Amazon RDS? (Choose three.)

Question 117hardmulti select
Read the full Reliability and Business Continuity explanation →

Which TWO steps should a SysOps administrator take to ensure data durability for an Amazon S3 bucket that stores critical documents? (Choose two.)

Question 118mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a web application on EC2 instances behind an Application Load Balancer. The database is an RDS MySQL instance with Multi-AZ enabled. The application experiences intermittent 5xx errors that correlate with database failover events. What is the MOST likely cause and solution?

Question 119hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company uses AWS Backup to back up its Amazon EFS file system daily. The backup retention policy is set to 30 days. Recently, a user accidentally deleted a critical directory. The company wants to restore the directory as it existed 2 days ago. What is the MOST cost-effective and quickest way to achieve this?

Question 120easymultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator needs to ensure that an Amazon S3 bucket can withstand the loss of an entire AWS Availability Zone. What is the SIMPLEST configuration to meet this requirement?

Question 121mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a stateless web application on EC2 instances in an Auto Scaling group across multiple Availability Zones. The application experiences increased latency during peak hours. The SysOps administrator needs to improve the application's performance and reliability. Which action should be taken?

Question 122hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company has a production RDS for PostgreSQL instance with Multi-AZ enabled. During a recent failover test, the application experienced a 5-minute downtime. The company requires that failover be completed within 2 minutes. Which action should be taken to meet this requirement?

Question 123easymultiple choice
Read the full DNS explanation →

A company uses Amazon Route 53 for DNS resolution. The company wants to ensure that if a web server becomes unhealthy, traffic is automatically routed to a healthy server in another Availability Zone. Which routing policy should be used?

Question 124mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is designing a disaster recovery strategy for a critical application that runs on EC2 instances. The application data is stored on EBS volumes. The recovery point objective (RPO) is 15 minutes, and the recovery time objective (RTO) is 1 hour. Which solution meets these requirements MOST cost-effectively?

Question 125hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical application on EC2 instances in an Auto Scaling group. The application stores state information locally on the instance. The SysOps administrator needs to ensure that if an instance fails, the state is not lost. What should the administrator do?

Question 126easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company has an application that runs on EC2 instances behind an Application Load Balancer. The application uses an RDS Multi-AZ database. The company wants to ensure that the application remains available during a database failover. What should the SysOps administrator do?

Question 127mediummulti select
Read the full Reliability and Business Continuity explanation →

A company is designing a highly available architecture for a web application using AWS services. Which TWO actions should the SysOps administrator take to improve reliability? (Choose TWO.)

Question 128hardmulti select
Read the full Reliability and Business Continuity explanation →

A company wants to implement a disaster recovery solution for its on-premises database using AWS. The solution must have an RPO of less than 1 hour and an RTO of less than 4 hours. Which THREE steps should the SysOps administrator take? (Choose THREE.)

Question 129easymulti select
Read the full Reliability and Business Continuity explanation →

A company wants to protect its data in Amazon S3 from accidental deletion. Which TWO methods should the SysOps administrator use? (Choose TWO.)

Question 130hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical e-commerce application on Amazon ECS with Fargate launch type, fronted by an Application Load Balancer. The application uses an Amazon ElastiCache for Redis cluster for session state and an Amazon RDS for MySQL Multi-AZ database for persistent data. Recently, during a deployment of a new service version, the application became unresponsive for 15 minutes. The SysOps administrator discovered that the deployment updated the task definition with a new environment variable that pointed to an incorrect ElastiCache endpoint. The ECS service was configured with a rolling update, minimum healthy percent of 50%, and maximum percent of 200%. After the deployment, all tasks failed health checks due to a connection timeout to the wrong Redis endpoint. What is the MOST effective way to prevent this issue in future deployments?

Question 131mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a web application on EC2 instances in an Auto Scaling group across two Availability Zones. The application uses an Application Load Balancer. The SysOps administrator receives an alert that the application is returning 503 errors. The administrator checks the CloudWatch metrics and sees that the ALB's RequestCount is normal, but the HealthyHostCount is zero. The EC2 instances are in running state and pass the EC2 status checks. What is the MOST likely cause and what should the administrator do to resolve the issue?

Question 132mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical web application on EC2 instances behind an Application Load Balancer in a single Availability Zone. To improve reliability, what is the MOST effective design change?

Question 133hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company's S3 bucket contains critical data. The bucket policy accidentally allowed public write access, and a malicious actor uploaded several objects. The company needs to recover the bucket to a known good state as quickly as possible. What should the SysOps administrator do?

Question 134easymultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator needs to ensure that an EC2 instance automatically recovers from an underlying hardware failure. Which action should be taken?

Question 135mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company uses AWS Backup to back up its Amazon EFS file systems. The SysOps administrator needs to ensure that backups are retained for 7 years to meet compliance requirements. What should the administrator do?

Question 136hardmulti select
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is designing a disaster recovery strategy for a production RDS MySQL database. The database must be recoverable within 15 minutes with a Recovery Point Objective (RPO) of less than 5 seconds. Which TWO actions should the administrator take? (Choose two.)

Question 137mediummulti select
Read the full Reliability and Business Continuity explanation →

A company runs a stateless web application on EC2 instances in an Auto Scaling group. To improve reliability during a traffic spike, which THREE actions should the SysOps administrator take? (Choose three.)

Question 138easymulti select
Read the full Reliability and Business Continuity explanation →

A company is designing a backup strategy for its Amazon S3 buckets. Which TWO methods can be used to protect against accidental deletion or overwriting of objects? (Choose two.)

Question 139mediummulti select
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is troubleshooting an issue where an Application Load Balancer (ALB) is returning 503 errors to clients. The target group has healthy EC2 instances. Which THREE possible causes should the administrator investigate? (Choose three.)

Question 140hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical application on a fleet of EC2 instances in an Auto Scaling group behind an Application Load Balancer. The application is deployed in us-east-1 with instances in three Availability Zones. The company's disaster recovery plan requires failover to us-west-2 within 30 minutes with a Recovery Point Objective (RPO) of 15 minutes and a Recovery Time Objective (RTO) of 30 minutes. The application uses an RDS MySQL database and stores session data in ElastiCache Redis. The SysOps administrator has configured an RDS read replica in us-west-2 with cross-Region replication and an ElastiCache cluster in us-west-2. The administrator has also created an AMI of the EC2 instances and copied it to us-west-2. During a disaster simulation, the failover took 45 minutes because the application needed to repopulate the cache. What should the administrator do to meet the RTO?

Question 141mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a web application on EC2 instances in an Auto Scaling group behind an Application Load Balancer. The application is deployed in a single Availability Zone. The SysOps administrator notices that during peak hours, the application becomes slow and some requests fail. CloudWatch metrics show that CPU utilization on the instances reaches 90%, but the Auto Scaling group does not scale out. The administrator has configured a target tracking scaling policy based on average CPU utilization with a target value of 75%. The Auto Scaling group has a minimum of 2, maximum of 10, and desired capacity of 2. What is the MOST likely reason the Auto Scaling group is not scaling out?

Question 142easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a stateless web application on EC2 instances in an Auto Scaling group. The application is deployed in us-east-1 with three Availability Zones. The SysOps administrator wants to ensure that the application remains available even if an entire Availability Zone becomes unavailable. The Auto Scaling group is configured with a minimum of 3, maximum of 9, and desired capacity of 3. The instances are distributed evenly across the three AZs. What additional configuration is required to ensure the application can survive an AZ failure?

Question 143mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company uses AWS CloudFormation to deploy its infrastructure. The SysOps administrator needs to ensure that the application stack can be recreated in another AWS Region in the event of a disaster. The stack includes an RDS MySQL database and an EC2 instance running a web server. The administrator wants to automate the backup of the RDS database and the EC2 instance configuration. What is the MOST efficient way to achieve this?

Question 144hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a production application on EC2 instances in an Auto Scaling group behind an Application Load Balancer. The application uses an RDS for PostgreSQL database. The SysOps administrator has configured a read replica in a different AWS Region for disaster recovery. During a disaster, the primary region becomes unavailable. The administrator promotes the read replica to a standalone instance. After promoting, the application fails to connect to the new database because the endpoint changed. The administrator needs to minimize downtime. What should the administrator do to handle the endpoint change automatically?

Question 145hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a web application on AWS that uses an Application Load Balancer (ALB) across multiple Availability Zones. The application is deployed on EC2 instances in an Auto Scaling group behind the ALB. The RDS database is Multi-AZ with synchronous replication. Recently, the operations team noticed that during a planned failover test of the primary RDS instance, the application experienced a 30-second timeout and returned 503 errors to users. The RDS failover completed successfully, but the application did not recover until the Auto Scaling group replaced all instances. The application health check endpoint on the EC2 instances checks database connectivity. The ALB health check is configured to check the health check endpoint every 10 seconds with a threshold of 2 consecutive failures. The application uses a connection pool with a timeout of 5 seconds. What is the MOST likely cause of the 503 errors and the need to replace instances?

Question 146mediummultiple choice
Read the full NAT/PAT explanation →

A company runs a critical web application on EC2 instances behind an Application Load Balancer (ALB) across three Availability Zones. The application stores session data in memory on the EC2 instances. During a deployment, a new version of the application is released by terminating and replacing instances. Users report that they are unexpectedly logged out during the deployment. What should a SysOps administrator do to improve the reliability of the application during deployments?

Question 147hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company uses AWS CloudFormation to deploy a multi-tier application. The stack includes an RDS DB instance with automated backups enabled. The SysOps administrator needs to ensure that the database can be recovered to any point within the last 35 days with minimal data loss. What should the administrator do?

Question 148hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a production workload on a fleet of EC2 instances in an Auto Scaling group (ASG). The ASG spans three Availability Zones. To avoid regional failure, the company wants to replicate the infrastructure in a second AWS Region and be able to fail over within 30 minutes. The application state is stored in an RDS MySQL database. What is the MOST cost-effective and reliable solution?

Question 149mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator receives an alert that an EC2 instance in an Auto Scaling group is unhealthy. The instance fails the EC2 status check. What is the BEST course of action to restore availability automatically?

Question 150easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company stores critical data in an S3 bucket. The SysOps administrator needs to ensure that the data is durable and can be recovered if an entire AWS Region becomes unavailable. What is the MOST cost-effective solution?

Question 151easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a web application on EC2 instances in an Auto Scaling group. The application is behind an Application Load Balancer. The company wants to ensure that the application can handle a sudden spike in traffic without downtime. What should the SysOps administrator do?

Question 152mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company has a production RDS for MySQL database. The SysOps administrator receives an alert that the database instance is running out of storage. The company requires high availability and minimal downtime during any modifications. What should the administrator do?

Question 153easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a stateless web application on EC2 instances in an Auto Scaling group. The application is deployed across multiple Availability Zones. The SysOps administrator wants to ensure that the application remains available even if an entire Availability Zone fails. What is the MOST effective way to achieve this?

Question 154hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical application on AWS Lambda functions. The functions are invoked by an API Gateway endpoint. The SysOps administrator needs to ensure that the application continues to work if an entire AWS Region becomes unavailable. What should the administrator do?

Question 155mediummulti select
Read the full Reliability and Business Continuity explanation →

A company runs a web application on EC2 instances in an Auto Scaling group behind an ALB. The application uses an RDS MySQL database. The SysOps administrator needs to improve the reliability of the database layer. Which TWO actions should the administrator take? (Choose two.)

Question 156hardmulti select
Read the full Reliability and Business Continuity explanation →

A company uses AWS CloudFormation to deploy infrastructure. The SysOps administrator needs to ensure that if a stack update fails, the stack is automatically rolled back to the last known good state. Which THREE steps should the administrator take? (Choose three.)

Question 157mediummulti select
Read the full Reliability and Business Continuity explanation →

A company runs a critical application on EC2 instances in an Auto Scaling group. The group uses a dynamic scaling policy based on CPU utilization. The SysOps administrator wants to ensure that the application remains available during a planned maintenance event that will take down one of the Availability Zones. Which TWO actions should the administrator take? (Choose two.)

Question 158mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a web application on EC2 instances behind an Application Load Balancer. The application stores session data in a DynamoDB table. To improve reliability, the SysOps administrator needs to ensure that session data is preserved if an EC2 instance fails. Which action should the administrator take?

Question 159easymultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is configuring an Amazon RDS for MySQL Multi-AZ deployment. What is the primary benefit of using Multi-AZ?

Question 160hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company has a critical application running on EC2 instances in an Auto Scaling group behind an ALB. The application uses a custom health check endpoint at /health. The SysOps administrator notices that the ALB occasionally marks instances as unhealthy even though the application is running correctly. The health check settings are: HealthCheckIntervalSeconds=30, HealthyThresholdCount=5, UnhealthyThresholdCount=2. What is the most likely cause of the intermittent health check failures?

Question 161easymultiple choice
Read the full NAT/PAT explanation →

A company uses Amazon S3 to store critical data. The SysOps administrator needs to protect against accidental deletion of objects. Which combination of actions should the administrator take? (Choose the best answer.)

Question 162mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is designing a disaster recovery plan for a web application that runs on EC2 instances with data stored in an RDS MySQL database. The application requires a Recovery Point Objective (RPO) of 5 minutes and a Recovery Time Objective (RTO) of 1 hour. Which solution meets these requirements most cost-effectively?

Question 163hardmultiple choice
Read the full NAT/PAT explanation →

A company runs a stateful application on EC2 instances in an Auto Scaling group. The application maintains state in memory. The SysOps administrator wants to ensure that when an instance is terminated, the state is preserved and a new instance can resume operation. Which approach should the administrator use?

Question 164mediummultiple choice
Read the full NAT/PAT explanation →

A SysOps administrator is tuning the health check of an Auto Scaling group. The group uses an ALB. The application takes up to 2 minutes to start. The health check settings are: HealthCheckGracePeriod=300, HealthCheckType=EC2. The administrator notices that instances are often marked unhealthy and terminated shortly after launch. What should the administrator change?

Question 165easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company wants to ensure that its Amazon RDS database can withstand the loss of an entire Availability Zone. Which feature should the SysOps administrator enable?

Question 166mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is reviewing the reliability of a production system that uses Amazon DynamoDB as its primary data store. The table has on-demand capacity and a single partition key. The application experiences occasional throttling errors during peak hours. Which action would most effectively improve reliability?

Question 167hardmulti select
Read the full Reliability and Business Continuity explanation →

A company is designing a disaster recovery strategy for its AWS environment. The primary Region is us-east-1, and the secondary Region is us-west-2. The application uses Amazon RDS for MySQL, Amazon S3 for static assets, and EC2 instances in an Auto Scaling group. The RTO is 30 minutes, and the RPO is 15 minutes. Which TWO actions should the SysOps administrator take to meet these requirements? (Choose two.)

Question 168mediummulti select
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is responsible for an Auto Scaling group that runs a critical application. The administrator wants to ensure that the application can recover from an AZ failure. Which THREE steps should the administrator take? (Choose three.)

Question 169easymulti select
Read the full Reliability and Business Continuity explanation →

A SysOps administrator wants to back up an Amazon EBS volume that is attached to an EC2 instance running a production database. The backup must be crash-consistent and should not cause any downtime. Which TWO steps should the administrator take? (Choose two.)

Question 170mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a web application on EC2 instances behind an Application Load Balancer. The instances are in an Auto Scaling group across three Availability Zones. To improve reliability, the company wants to ensure that if an entire Availability Zone fails, the application remains available. Which configuration should be implemented?

Question 171mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is designing a disaster recovery plan for a critical application hosted on AWS. The application runs on EC2 instances with data stored in an RDS MySQL database. The RPO must be less than 15 minutes, and the RTO must be less than 1 hour. Which solution meets these requirements?

Question 172hardmultiple choice
Read the full NAT/PAT explanation →

A company has a production application running on Amazon ECS with Fargate launch type. The application uses an Application Load Balancer. The SysOps administrator notices that during deployments, the application experiences a brief period of downtime. Which combination of actions should the administrator take to achieve zero-downtime deployments?

Question 173easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company stores critical data in an S3 bucket. To ensure data durability and availability, the company wants to automatically replicate objects to a bucket in a different AWS Region. Which S3 feature should be used?

Question 174hardmultiple choice
Read the full Reliability and Business Continuity explanation →

An application runs on EC2 instances in an Auto Scaling group. The instances process messages from an SQS queue. To ensure high availability, the SysOps administrator has configured the Auto Scaling group to span three Availability Zones. However, during a recent failure of one AZ, the application experienced a temporary increase in processing latency. What is the MOST likely cause of this latency?

Question 175easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company wants to ensure that its EC2 instances automatically recover from an instance failure. Which feature should be used?

Question 176mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a stateful web application on a single EC2 instance. To improve reliability, the company wants to implement a highly available architecture. What should the SysOps administrator do?

Question 177hardmultiple choice
Read the full DNS explanation →

A company uses Amazon Route 53 for DNS and wants to ensure that its web application is highly available across two AWS Regions. The application runs behind an Application Load Balancer in each Region. Which routing policy should be used?

Question 178easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical application on an EC2 instance backed by Amazon EBS. To protect against data loss, the company wants to create a backup strategy that allows for point-in-time recovery. Which solution should be used?

Question 179mediummulti select
Read the full Reliability and Business Continuity explanation →

A company is designing a disaster recovery strategy for its production database hosted on Amazon RDS for MySQL. The primary database is in us-east-1. The company requires an RPO of less than 5 minutes and an RTO of less than 1 hour in the event of a Regional failure. Which TWO actions should the company take to meet these requirements?

Question 180hardmulti select
Read the full Reliability and Business Continuity explanation →

A company runs a web application on EC2 instances behind an Application Load Balancer. The instances are in an Auto Scaling group. The SysOps administrator wants to ensure that the application can handle a sudden increase in traffic without downtime. Which THREE actions should be taken?

Question 181mediummulti select
Read the full Reliability and Business Continuity explanation →

A company has a production application running on Amazon ECS with Fargate. The application must be highly available across multiple Availability Zones. Which TWO configurations should be implemented?

Question 182mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company is running a web application on EC2 instances in an Auto Scaling group behind an Application Load Balancer. The application stores session data on the local instance storage. Users are experiencing session loss during scaling events. What should a SysOps administrator do to maintain session persistence?

Question 183hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company has a production DynamoDB table with on-demand capacity. They need to ensure business continuity with a Recovery Point Objective (RPO) of 5 minutes and a Recovery Time Objective (RTO) of 1 hour in case of a regional outage. What is the MOST cost-effective solution?

Question 184easymultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator needs to ensure that an EC2 instance automatically recovers from an underlying hardware failure. Which configuration should be used?

Question 185mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company uses an RDS for MySQL Multi-AZ DB instance. They want to minimize downtime during a planned maintenance update that requires a database engine version upgrade. What should the SysOps administrator do?

Question 186hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company's S3 bucket contains critical data. The bucket policy allows s3:PutObject and s3:GetObject for all authenticated users in the account. A SysOps administrator needs to ensure that if the bucket is accidentally deleted, the objects can be recovered. Which solution meets this requirement?

Question 187easymultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is designing a disaster recovery plan for a web application. The application runs on EC2 instances in a single Availability Zone. What is the FIRST step to improve availability?

Question 188mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company's critical application uses an EBS-backed EC2 instance. They want to back up the instance daily with a retention policy of 30 days. What is the MOST efficient way to achieve this?

Question 189hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company has a production RDS for PostgreSQL instance. They need to recover from a logical corruption that occurred 2 hours ago. Which recovery method will minimize data loss?

Question 190easymultiple choice
Read the full Reliability and Business Continuity explanation →

An application uploads files to an S3 bucket. The SysOps administrator needs to ensure that the files are automatically replicated to another bucket in a different AWS Region for disaster recovery. Which action should be taken?

Question 191mediummulti select
Read the full Reliability and Business Continuity explanation →

Which TWO actions improve the availability of an application running on EC2 instances in a single Auto Scaling group? (Choose 2)

Question 192hardmulti select
Read the full Reliability and Business Continuity explanation →

Which THREE measures help protect an S3 bucket from accidental data loss? (Choose 3)

Question 193mediummulti select
Read the full Reliability and Business Continuity explanation →

Which TWO steps should a SysOps administrator take to ensure that an RDS for MySQL instance can withstand an Availability Zone failure? (Choose 2)

Question 194hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator runs the above command for an EC2 instance. The instance is running but the system status check is impaired. What does this indicate?

Network Topology
aws ec2 describe-instance-statusinstance-ids i-1234567890abcdef0Refer to the exhibit.```"InstanceStatuses": ["AvailabilityZone": "us-east-1a","InstanceId": "i-1234567890abcdef0","InstanceState": {"Code": 16,"Name": "running"},"InstanceStatus": {"Details": ["Name": "reachability","Status": "passed"],"Status": "ok""SystemStatus": {"Status": "impaired"
Question 195mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator creates the above IAM policy for a user. The user reports that they cannot delete an object in the bucket 'my-bucket' even though they are using MFA. What is the likely cause?

Exhibit

Refer to the exhibit.

```
{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Effect": "Allow",
      "Action": [
        "s3:GetObject",
        "s3:PutObject",
        "s3:DeleteObject"
      ],
      "Resource": "arn:aws:s3:::my-bucket/*",
      "Condition": {
        "Bool": {
          "aws:MultiFactorAuthPresent": "true"
        }
      }
    }
  ]
}
```
Question 196easymultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator deploys the above CloudFormation template. The stack creation fails with an error. What is the most likely reason?

Exhibit

Refer to the exhibit.

```
{
  "AWSTemplateFormatVersion": "2010-09-09",
  "Resources": {
    "EC2Instance": {
      "Type": "AWS::EC2::Instance",
      "Properties": {
        "ImageId": "ami-0abcdef1234567890",
        "InstanceType": "t2.micro",
        "AvailabilityZone": "us-east-1a"
      }
    },
    "EBSVolume": {
      "Type": "AWS::EC2::Volume",
      "Properties": {
        "Size": 10,
        "AvailabilityZone": "us-east-1a"
      }
    },
    "VolumeAttachment": {
      "Type": "AWS::EC2::VolumeAttachment",
      "Properties": {
        "InstanceId": { "Ref": "EC2Instance" },
        "VolumeId": { "Ref": "EBSVolume" }
      }
    }
  }
}
```
Question 197easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company is designing a highly available web application on AWS. The application runs on EC2 instances behind an Application Load Balancer. Which configuration ensures that the application remains available if an entire AWS Availability Zone fails?

Question 198mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator needs to implement a backup strategy for an Amazon RDS for PostgreSQL database. The database is 500 GB and experiences heavy write traffic. Which solution provides the most cost-effective backup with the least impact on database performance?

Question 199hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a stateful web application on EC2 instances in an Auto Scaling group behind an Application Load Balancer. Users report that their sessions are frequently lost during scaling events. What is the MOST effective solution to maintain session persistence?

Question 200easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company has an S3 bucket that stores critical financial data. The bucket versioning is enabled. A SysOps administrator needs to ensure that data can be recovered after accidental deletion by users. What is the MOST effective way to protect against accidental deletion?

Question 201mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a production database on Amazon RDS for MySQL with Multi-AZ enabled. During a recent Availability Zone outage, the database experienced a failover. After the failover, the application team notices that the database endpoint in the connection string no longer works. What is the most likely cause?

Question 202hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical application on an EC2 instance that stores data on an EBS volume. The SysOps administrator needs to implement a backup strategy that provides the ability to restore the volume to a specific point in time within the last 24 hours, with a recovery time objective (RTO) of less than 15 minutes. Which solution meets these requirements?

Question 203easymultiple choice
Read the full NAT/PAT explanation →

A company has a fleet of EC2 instances that need to be patched monthly. The SysOps administrator must ensure that the patching process does not affect the availability of the application. Which strategy should the administrator use?

Question 204mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a web application on EC2 instances in an Auto Scaling group behind an Application Load Balancer. The application stores session data locally on each instance. During a traffic spike, the Auto Scaling group launches new instances, but users report that they are logged out and lose session data. Which solution addresses this issue without modifying the application?

Question 205hardmulti select
Read the full NAT/PAT explanation →

A company uses Amazon S3 to store backup data. The SysOps administrator needs to ensure that the data is protected against accidental deletion by users with administrative privileges. Which combination of actions should the administrator take? (Choose TWO.)

Question 206mediummulti select
Read the full Reliability and Business Continuity explanation →

A company is designing a disaster recovery strategy for a production RDS for MySQL database. The database is currently single-AZ. The recovery point objective (RPO) is 1 hour, and the recovery time objective (RTO) is 15 minutes. Which steps should the SysOps administrator take to meet these requirements? (Choose THREE.)

Question 207hardmulti select
Read the full NAT/PAT explanation →

A company runs a stateless web application on EC2 instances behind an Application Load Balancer. The application is deployed in an Auto Scaling group with a minimum of 2 and maximum of 10 instances. During a traffic spike, the Auto Scaling group launches new instances, but the new instances are immediately marked as unhealthy by the ALB and terminated. What could be the cause? (Choose TWO.)

Question 208mediummultiple choice
Read the full Reliability and Business Continuity explanation →

Refer to the exhibit. A SysOps administrator ran the commands shown. What is the state of the EC2 instance?

Network Topology
Command: aws ec2 describe-instancesinstance-ids i-1234567890abcdef0query 'Reservations[0].Instances[0].State'query 'InstanceStatuses[0].SystemStatus'Refer to the exhibit.Output:"Code": 16,"Name": "running""Status": "impaired","Details": ["Name": "reachability"
Question 209easymultiple choice
Read the full Reliability and Business Continuity explanation →

Refer to the exhibit. An IAM policy is attached to an IAM user. Which action can the user perform?

Exhibit

Refer to the exhibit.

{
    "Version": "2012-10-17",
    "Statement": [
        {
            "Effect": "Allow",
            "Action": [
                "ec2:Describe*",
                "ec2:StartInstances",
                "ec2:StopInstances"
            ],
            "Resource": "*"
        },
        {
            "Effect": "Deny",
            "Action": "ec2:TerminateInstances",
            "Resource": "arn:aws:ec2:us-east-1:123456789012:instance/*"
        }
    ]
}
Question 210hardmultiple choice
Read the full Reliability and Business Continuity explanation →

Refer to the exhibit. A SysOps administrator deployed the CloudFormation template. Which statement is true about data protection?

Exhibit

Refer to the exhibit.

CloudFormation template snippet:

Resources:
  MyBucket:
    Type: AWS::S3::Bucket
    Properties:
      BucketName: my-app-data-123
      VersioningConfiguration:
        Status: Enabled
      LifecycleConfiguration:
        Rules:
          - Id: ExpireOldVersions
            Status: Enabled
            NoncurrentVersionExpirationInDays: 30

  MyBucketPolicy:
    Type: AWS::S3::BucketPolicy
    Properties:
      Bucket: !Ref MyBucket
      PolicyDocument:
        Version: 2012-10-17
        Statement:
          - Effect: Deny
            Action: s3:DeleteBucket
            Resource: !GetAtt MyBucket.Arn
            Principal: "*"
Question 211easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a web application on EC2 instances in an Auto Scaling group behind an Application Load Balancer. The application stores session data on local instance storage. Users report that they are unexpectedly logged out during peak traffic. Which action should the SysOps Administrator take to improve reliability?

Question 212mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company uses AWS CloudFormation to deploy infrastructure. They want to ensure that if a stack update fails, the stack automatically rolls back to the last known good state. Which CloudFormation stack policy should be used?

Question 213hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical database on an RDS for PostgreSQL instance in a single Availability Zone. The database experiences high write latency. The SysOps Administrator needs to improve the database's reliability and performance without downtime. Which solution meets these requirements?

Question 214easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company uses Amazon S3 to store critical data. They need to protect against accidental deletion of objects. Which feature should the SysOps Administrator enable?

Question 215mediummultiple choice
Read the full Reliability and Business Continuity explanation →

An application running on EC2 instances in an Auto Scaling group uses an SQS queue for decoupling. The application experiences increased latency when the queue has a high number of messages. The SysOps Administrator needs to maintain responsiveness. Which solution is the most cost-effective?

Question 216hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a stateful web application on a single EC2 instance. The SysOps Administrator wants to improve fault tolerance. Which design should they implement?

Question 217easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company wants to ensure that its S3 bucket is accessible only from a VPC. Which configuration should the SysOps Administrator implement?

Question 218mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company is using AWS CloudFormation to manage its infrastructure. The SysOps Administrator needs to ensure that updates to a critical stack do not accidentally replace the database. Which feature should be used?

Question 219hardmultiple choice
Review the full subnetting walkthrough →

A company runs a web application on EC2 instances in a private subnet. The application needs to connect to an RDS database in a different VPC. The VPCs are peered. The SysOps Administrator is troubleshooting connectivity issues. The RDS security group allows inbound traffic from the EC2 security group, but connections still fail. What could be the issue?

Question 220mediummulti select
Read the full Reliability and Business Continuity explanation →

A company is designing a disaster recovery strategy for its primary AWS region. The application runs on EC2 instances with an RDS database. The RPO is 15 minutes and RTO is 2 hours. Which TWO actions should the SysOps Administrator take to meet these requirements? (Choose TWO.)

Question 221hardmulti select
Read the full Reliability and Business Continuity explanation →

A company runs a stateless web application on EC2 instances behind an Application Load Balancer. The SysOps Administrator needs to ensure the application can withstand the loss of an entire Availability Zone. Which THREE steps should be taken? (Choose THREE.)

Question 222easymulti select
Read the full Reliability and Business Continuity explanation →

A company wants to ensure its SysOps Administrator can recover an EBS volume from a snapshot. Which TWO steps are required? (Choose TWO.)

Question 223mediummultiple choice
Read the full NAT/PAT explanation →

Refer to the exhibit. An IAM policy is attached to a group. A SysOps Administrator in that group tries to terminate an EC2 instance with the tag 'Environment=production'. The action fails. What is the most likely reason?

Exhibit

Refer to the exhibit.

{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Effect": "Allow",
      "Action": "ec2:TerminateInstances",
      "Resource": "*",
      "Condition": {
        "StringEquals": {
          "ec2:ResourceTag/Environment": "production"
        }
      }
    }
  ]
}
Question 224hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical microservices application on Amazon ECS with Fargate launch type. The application consists of several services that communicate via internal HTTP calls. The SysOps Administrator notices that during periods of increased load, some services become unresponsive and the health checks fail. The ECS service auto scaling is configured based on CPU utilization, but it does not scale quickly enough. The administrator needs to improve the reliability and responsiveness of the application. The services are stateless and can be scaled horizontally. The current architecture uses a single Application Load Balancer for each service. The ALB health checks are set to a 30-second interval with a 5-second timeout and 2 unhealthy thresholds. The administrator has observed that when a service instance becomes unhealthy, it takes too long for the ALB to stop sending traffic to it, causing errors. What should the SysOps Administrator do to improve the reliability and responsiveness of the application?

Question 225mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a file-sharing application on AWS. Users upload files to an S3 bucket, which triggers a Lambda function to process the files and store metadata in a DynamoDB table. Recently, users have reported that some uploaded files are never processed. The SysOps Administrator checks the CloudWatch logs and finds no errors from the Lambda function. The S3 bucket is configured to send events to the Lambda function. The DynamoDB table has sufficient write capacity. The administrator suspects that the event notifications are being lost. Which action should the SysOps Administrator take to ensure that every file upload triggers a Lambda function and that the function processes the file successfully?

Question 226mediummultiple choice
Read the full NAT/PAT explanation →

A company runs a web application on EC2 instances behind an Application Load Balancer (ALB) in a single Availability Zone. The application stores session data in an RDS MySQL DB instance. To improve reliability, the company wants to deploy the application across multiple Availability Zones. Which combination of actions should the company take to achieve this? (Choose the correct course of action.)

Question 227easymultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is tasked with ensuring that an Amazon S3 bucket can withstand the loss of an entire AWS Region. The bucket stores critical data that must be accessible with minimal latency from multiple regions. Which solution meets these requirements?

Question 228hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a stateful web application on EC2 instances in an Auto Scaling group across two Availability Zones. The application uses an Application Load Balancer for traffic distribution. Users report that their sessions are frequently lost during scale-in events. The SysOps administrator needs to minimize session loss without introducing significant latency. What should the administrator do?

Question 229mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A SysOps administrator is designing a disaster recovery plan for a critical application that runs on EC2 instances in a single region. The RTO is 1 hour, and the RPO is 15 minutes. The application data is stored on an Amazon EBS volume. Which approach meets these requirements at the lowest cost?

Question 230mediummulti select
Read the full Reliability and Business Continuity explanation →

A company is using AWS CloudFormation to manage infrastructure. They want to ensure that if a stack update fails, the stack automatically rolls back to the last known good state. Which two steps should the company take? (Choose TWO.)

Question 231hardmulti select
Read the full Reliability and Business Continuity explanation →

A company runs a web application on EC2 instances in an Auto Scaling group. The application uses an Amazon RDS Multi-AZ DB instance. The SysOps administrator notices that during a recent failover test, the application became unresponsive for several minutes. The administrator wants to improve the application's resilience during failover. Which three actions should the administrator take? (Choose THREE.)

Question 232easymulti select
Read the full Reliability and Business Continuity explanation →

A company wants to ensure that its Amazon S3 data is resilient to an accidental deletion of an entire bucket. Which two actions should the company take? (Choose TWO.)

Question 233hardmultiple choice
Read the full NAT/PAT explanation →

A company runs a critical application on a fleet of EC2 instances in an Auto Scaling group behind an Application Load Balancer (ALB). The application uses an Amazon RDS MySQL Multi-AZ DB instance for persistent storage. The SysOps administrator recently configured a lifecycle hook on the Auto Scaling group to perform a custom action before instance termination. During a recent scale-in event, the administrator noticed that some requests were still being routed to the terminating instance, causing errors. The ALB's deregistration delay is set to 300 seconds. The lifecycle hook has a default timeout of 3600 seconds. The administrator wants to ensure that the instance completes its custom action and that all in-flight requests are drained before the instance is terminated. The custom action typically takes 120 seconds. What should the administrator do to resolve this issue?

Question 234mediummultiple choice
Review the full subnetting walkthrough →

A company has a production AWS account with a single VPC and multiple subnets across two Availability Zones. The company hosts a web application on EC2 instances in an Auto Scaling group. The application uses an Amazon Aurora MySQL database cluster with one writer and two reader instances in the same VPC. The SysOps administrator configured AWS CloudTrail to log API calls and Amazon CloudWatch alarms for operational monitoring. After a recent network partition event in one Availability Zone, the application became unavailable for several minutes. The administrator wants to improve the application's resilience to such events without changing the database cluster configuration. The administrator has budget for additional resources but wants to minimize costs. What should the administrator do?

Question 235easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company uses Amazon S3 to store sensitive financial documents. The company's compliance team requires that all objects be encrypted at rest using server-side encryption with AWS KMS managed keys (SSE-KMS). Additionally, the compliance team requires that if an object is not accessed for 90 days, it should be automatically moved to Amazon S3 Glacier to reduce costs. The SysOps administrator is tasked with implementing these requirements. The administrator creates an S3 bucket and enables default encryption with SSE-KMS. Then, the administrator creates a lifecycle policy with a transition action to Glacier after 90 days. During testing, the administrator notices that objects uploaded to the bucket are not being transitioned to Glacier after 90 days. What is the most likely cause of this issue?

Question 236hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a containerized application on Amazon ECS with the Fargate launch type. The application is deployed across two Availability Zones and uses an Application Load Balancer (ALB) to distribute traffic. The application stores state in an Amazon ElastiCache Redis cluster. Recently, the development team deployed a new version of the application that introduced a memory leak. The leak causes the ECS tasks to consume more memory over time until they are OOM-killed. The SysOps administrator is asked to ensure that the application remains available and responsive. The administrator has configured an ECS service auto scaling based on average CPU utilization, but the memory leak causes tasks to fail before CPU utilization triggers scaling. Which action should the administrator take to automatically detect and replace failing tasks?

Question 237mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company has a production AWS account with multiple IAM users. The SysOps administrator needs to ensure that all IAM users have multi-factor authentication (MFA) enabled for console access. If a user does not have MFA enabled, they should be denied access to all AWS services except for the ability to enable MFA on their own account. The administrator has created an IAM policy that denies all actions if MFA is not present, but the policy is not working as expected; users without MFA can still access the S3 console. The administrator attaches the policy to all users. What is the most likely reason the policy is not effective?

Question 238easymultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a static website on Amazon S3 with a custom domain name (www.example.com). The website is accessed via Amazon CloudFront. The company's marketing team recently updated the website content, but users are reporting that they still see the old content. The SysOps administrator checks the S3 bucket and confirms that the new files are present. The administrator also checks CloudFront and finds that the default TTL for the cache behavior is 24 hours. The marketing team needs the new content to be visible immediately. What should the administrator do to make the new content available to users as quickly as possible?

Question 239mediummultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a critical application on EC2 instances in an Auto Scaling group. The application processes messages from an Amazon SQS queue. The SysOps administrator notices that during periods of high load, the SQS queue depth increases significantly, and the application takes a long time to recover. The administrator wants to improve the application's ability to handle spikes in traffic without over-provisioning resources. The application is stateless and can scale horizontally. What should the administrator do?

Question 240hardmultiple choice
Read the full Reliability and Business Continuity explanation →

A company runs a production web application on AWS using Auto Scaling groups (ASGs) behind an Application Load Balancer (ALB). The application state is stored in an Amazon RDS for MySQL Multi-AZ DB instance. The application experiences periodic traffic spikes, and the current ASG uses a simple scaling policy based on average CPU utilization. Recently, during a spike, the application became unresponsive for several minutes. The CloudWatch metrics show that the CPU utilization on the RDS instance peaked at 80%, and the DB Connections metric reached the maximum allowed. The read replica lag increased to over 10 seconds during the spike. The web servers are stateless and scale out quickly. The operations team needs to improve the reliability and performance of the application to handle future spikes. Which solution should the team implement?

Practice tests

Scored 10-question sessions with instant feedback and explanations.

SOA-C02 Practice Test 1 — 10 Questions→SOA-C02 Practice Test 2 — 10 Questions→SOA-C02 Practice Test 3 — 10 Questions→SOA-C02 Practice Test 4 — 10 Questions→SOA-C02 Practice Test 5 — 10 Questions→SOA-C02 Practice Exam 1 — 20 Questions→SOA-C02 Practice Exam 2 — 20 Questions→SOA-C02 Practice Exam 3 — 20 Questions→SOA-C02 Practice Exam 4 — 20 Questions→Free SOA-C02 Practice Test 1 — 30 Questions→Free SOA-C02 Practice Test 2 — 30 Questions→Free SOA-C02 Practice Test 3 — 30 Questions→SOA-C02 Practice Questions 1 — 50 Questions→SOA-C02 Practice Questions 2 — 50 Questions→SOA-C02 Exam Simulation 1 — 100 Questions→

Practice by domain

Each domain maps to a weighted exam section. Focus on the domain where you are weakest.

Monitoring, Logging, and RemediationReliability and Business ContinuityDeployment, Provisioning, and AutomationSecurity and ComplianceNetworking and Content DeliveryCost and Performance Optimization

Practice by scenario

Filter questions by type — troubleshooting, exhibit, drag-and-drop, PBQ, ACLs, OSPF, and more.

Browse scenarios→

Continue studying

All Reliability and Business Continuity setsAll Reliability and Business Continuity questionsSOA-C02 Practice Hub