Knowledge + Practice

AWS Certified AI Practitioner AIF-C01 (AIF-C01) — Questions 901–975

1000 questions total · 14pages · All types, answers revealed

Take a mock exam Exam hub

Page 13 of 14

901

MCQmedium

A company has deployed a machine learning model using Amazon SageMaker and wants to monitor the model for bias over time. Which SageMaker feature should they use to detect bias in the model's predictions after deployment?

A.SageMaker Debugger

B.SageMaker Model Monitor

C.SageMaker Clarify

D.SageMaker Role Manager

AnswerC

Clarify provides bias detection and explainability for ML models.

Why this answer

SageMaker Clarify is designed to detect bias in ML models both before and after deployment. It can analyze predictions to identify potential bias against certain groups.

Full explanation →

902

Multi-Selecteasy

A company needs to convert live customer support calls into text for analysis and search. Which TWO AWS services should be used together? (Select TWO.)

Select 2 answers

A.Amazon Rekognition

B.Amazon Comprehend

C.Amazon Transcribe

D.Amazon Polly

E.Amazon Translate

AnswersB, C

Comprehend provides NLP capabilities for text analysis and entity extraction.

Why this answer

Amazon Transcribe converts audio to text in real time or batch. Amazon Comprehend can then perform NLP on the transcribed text for analysis and entity extraction. Together they enable search and insights from call recordings.

Full explanation →

903

MCQeasy

A startup is developing a mobile app that uses facial recognition to verify user identity for account access. The app is intended for a global audience, but the training data predominantly includes images of light-skinned individuals. During beta testing, users with darker skin tones report frequent verification failures, while light-skinned users have a high success rate. The startup wants to release the app soon and needs to address this fairness issue without delaying the launch too much. The team has limited resources. Which approach should they take to most effectively mitigate the bias while meeting the launch timeline?

A.Apply a post-processing rule to increase acceptance rate for users with darker skin tones

B.Lower the similarity threshold for all users to improve acceptance rates

C.Defer verification for users with darker skin tones to manual human review

D.Collect more diverse training data and augment the existing dataset, then retrain the model

AnswerD

Adding diverse data addresses the root cause of bias.

Why this answer

The most effective approach is to collect additional training data representing diverse skin tones and augment the dataset, then retrain the model. This directly addresses the data imbalance. Applying a post-processing rule without retraining may not fix the underlying model bias.

Deferring to humans is a temporary workaround and does not scale. Reducing the threshold for all users could increase false positives and may not be acceptable.

Full explanation →

904

MCQeasy

A company develops a chatbot using Amazon Lex. To ensure transparency, what should the chatbot do when it cannot answer a question?

A.Remain silent and wait for the next input

B.Provide a random answer from a predefined list

C.Clearly state that it cannot answer and offer alternatives

D.Automatically escalate all unanswered questions to a human

AnswerC

Honest communication builds trust.

Why this answer

Option C is correct because responsible AI guidelines, including those from AWS for Amazon Lex, require that when a chatbot cannot answer a question, it should clearly state its inability and offer alternatives (e.g., rephrasing the query or providing related topics). This maintains transparency and user trust, aligning with the 'Explainability' principle under the Guidelines for Responsible AI.

Exam trap

AWS often tests the misconception that a chatbot should always escalate or remain passive when it cannot answer, but the correct approach under responsible AI is to acknowledge the limitation and offer alternatives, not to hide or mislead.

How to eliminate wrong answers

Option A is wrong because remaining silent and waiting for the next input violates transparency and can confuse users, as it provides no feedback or guidance. Option B is wrong because providing a random answer from a predefined list is deceptive and can mislead users, undermining the principle of honesty and accountability in AI. Option D is wrong because automatically escalating all unanswered questions to a human is inefficient and not always necessary; the chatbot should first attempt to offer alternatives or clarify before escalation, as per responsible AI practices.

Full explanation →

905

MCQeasy

A data scientist wants to restrict which IAM roles can invoke a specific Amazon Bedrock base model. Which AWS feature should they use?

A.S3 bucket policy on the model artifacts

B.Bedrock resource-based policy

C.AWS KMS key policy for the encryption key

D.AWS CloudTrail log delivery policy

AnswerB

Bedrock resource policies are applied to the model resource itself and can restrict invocation to specific IAM roles.

Why this answer

Bedrock resource policies allow you to attach fine-grained permissions to a specific model, specifying which principals (IAM roles) can invoke it.

Full explanation →

906

MCQmedium

A company wants to build a customer service chatbot that answers questions about their internal policy documents. The documents are updated monthly, and the team cannot afford to retrain a model each time. Which approach is MOST appropriate?

A.Train a custom model from scratch on the policy documents each month

B.Use Retrieval-Augmented Generation (RAG) with the policy documents indexed in a vector store

C.Use a larger foundation model with a longer context window and paste all documents into each prompt

D.Fine-tune a base LLM on the policy documents monthly

AnswerB

RAG retrieves relevant document chunks at query time, ensuring up-to-date answers without retraining.

Why this answer

RAG (Retrieval-Augmented Generation) allows the LLM to retrieve relevant document sections at inference time, so knowledge stays current without retraining. The other options either require expensive retraining for each update or lack document grounding.

Full explanation →

907

Multi-Selectmedium

A developer is using Amazon Bedrock to generate responses from a foundation model and wants to receive the output as a stream to improve user experience. Which TWO statements about streaming responses are correct? (Select TWO.)

Select 2 answers

A.Streaming reduces perceived latency by displaying partial results as they are generated

B.Streaming is supported by the Converse API

C.Streaming returns the entire response at once

D.Streaming is supported by the InvokeModel API

E.Streaming cannot be used with temperature or max_tokens parameters

AnswersA, B

Users see output sooner, improving responsiveness.

Why this answer

Streaming in Bedrock returns tokens incrementally, reducing perceived latency. The Converse API supports streaming by default (or via specific settings). The InvokeModel API does not support streaming; you must use InvokeModelWithResponseStream.

Temperature and max_tokens parameters can still be used with streaming.

Full explanation →

908

MCQeasy

Which AWS service can be used to create human review workflows for high-risk AI predictions, ensuring a human-in-the-loop?

A.Amazon SageMaker Clarify

B.Amazon Augmented AI (A2I)

C.AWS Glue

D.AWS Step Functions

AnswerB

A2I provides built-in human review workflows for machine learning predictions.

Why this answer

Amazon Augmented AI (A2I) is the service designed to set up human review workflows for AI predictions, enabling human oversight when needed.

Full explanation →

909

Multi-Selecteasy

A company uses Amazon Bedrock to build a generative AI application. They need to secure the application by restricting access to the model and preventing sensitive data from being stored in prompts. Which TWO actions should they take? (Choose two.)

Select 2 answers

A.Enable VPC Flow Logs for Bedrock.

B.Enable AWS CloudTrail to log all Bedrock API calls.

C.Use Amazon Bedrock Guardrails to define content filters and deny topics.

D.Implement IAM policies that allow only specific users to invoke the model.

E.Use AWS KMS to encrypt the model data.

AnswersC, D

Guardrails can filter sensitive data and enforce content policies.

Why this answer

Amazon Bedrock Guardrails (Option C) directly addresses the need to prevent sensitive data from being stored in prompts by allowing you to define content filters, deny topics, and configure sensitive information filters that block or mask such data before it reaches the model. This is the native service feature designed for content safety and data leakage prevention.

Exam trap

AWS often tests the distinction between logging/auditing services (CloudTrail, VPC Flow Logs) and actual security enforcement mechanisms (Guardrails, IAM), leading candidates to confuse observability with prevention.

Full explanation →

910

MCQmedium

Refer to the exhibit. A security officer has attached this IAM policy to a user to allow invocation of a SageMaker endpoint. However, the user is unable to invoke the endpoint from an EC2 instance in the same VPC. What is the most likely cause?

A.The SageMaker endpoint is not configured to accept traffic from the VPC.

B.The IAM policy does not allow the sagemaker:InvokeEndpoint action without conditions.

C.The user's IAM role is missing the s3:GetObject permission for model artifacts.

D.The EC2 instance is not using the specified VPC endpoint (vpce-12345678).

AnswerD

The condition restricts invocation to traffic coming through that specific VPC endpoint.

Why this answer

The policy condition requires that the request originates from a specific VPC endpoint (vpce-12345678). If the EC2 instance is not using that endpoint (e.g., it is using a different endpoint or direct VPC access), the request will be denied. Option B is correct because the instance likely does not use that specific VPC endpoint.

Full explanation →

911

Multi-Selecteasy

Which TWO AWS services can be used together to build a chatbot that leverages a foundation model for natural language understanding?

Select 2 answers

A.Amazon Rekognition

B.Amazon Lex

C.Amazon Polly

D.AWS Glue

E.Amazon Bedrock

AnswersB, E

Lex handles dialog management and intent recognition.

Why this answer

Amazon Lex provides the conversational interface and natural language understanding (NLU) to interpret user intents and manage dialog, while Amazon Bedrock gives access to foundation models (FMs) for advanced natural language generation and understanding. Together, Lex can route utterances to a Bedrock FM via a Lambda function or direct integration, enabling a chatbot that leverages a pre-trained FM for richer responses.

Exam trap

AWS often tests the distinction between services that handle conversational interfaces (Lex) versus those that provide generative AI models (Bedrock), tempting candidates to pick Polly (speech output) or Rekognition (vision) as part of a chatbot, when they are not core to NLU or FM integration.

Full explanation →

912

MCQhard

A company is using a machine learning model to predict employee turnover. The model's predictions are used to identify at-risk employees for retention efforts. The company wants to ensure that the model does not inadvertently discriminate against employees based on age. Which metric should be used to measure fairness across age groups?

A.Equalized odds

B.SHAP feature importance

C.Disparate impact ratio

D.Demographic parity

AnswerA

Equalized odds ensures equal true positive and false positive rates across groups.

Why this answer

Equalized odds requires that the model's true positive rate and false positive rate are equal across groups, which is appropriate for binary outcomes like turnover prediction.

Full explanation →

913

MCQeasy

Which component of the Transformer architecture allows the model to weigh the importance of different words in an input sequence when generating each output token?

A.Feed-forward neural network

B.Positional encoding

C.Self-attention mechanism

D.Layer normalization

AnswerC

Self-attention computes attention weights across all input tokens, capturing dependencies.

Why this answer

Self-attention computes attention scores between all pairs of positions in the input, enabling the model to focus on relevant context. Feed-forward layers process each position independently; positional encoding adds sequence order information; layer normalization stabilizes training.

Full explanation →

914

MCQhard

A team is training a deep learning model using Horovod distributed training on SageMaker. They observe that the loss stops decreasing after a few epochs. Which technique should they implement to reduce overfitting?

A.Increase learning rate

B.Add more layers to the model

C.Reduce the number of epochs

D.Use dropout regularization

AnswerD

Dropout is a regularization technique that reduces overfitting.

Why this answer

Dropout regularization randomly drops a fraction of neurons during training, which prevents the model from relying too heavily on specific features and forces it to learn more robust representations. This directly addresses overfitting, which is the likely cause of the loss plateauing after a few epochs in a Horovod distributed training setup on SageMaker.

Exam trap

The AIF-C01 exam often tests the misconception that early stopping (reducing epochs) is a regularization technique to reduce overfitting, but the trap here is that early stopping only halts training and does not actively prevent the model from memorizing noise during the epochs it does train.

How to eliminate wrong answers

Option A is wrong because increasing the learning rate can cause the optimizer to overshoot minima, leading to divergence or unstable training, not a reduction in overfitting. Option B is wrong because adding more layers increases model capacity, which typically worsens overfitting by making it easier for the model to memorize noise. Option C is wrong because reducing the number of epochs only stops training earlier; it does not address the underlying overfitting issue, and the loss may still plateau due to memorization rather than convergence.

Full explanation →

915

MCQmedium

A startup is building a medical diagnosis assistant. They have a small dataset of doctor-patient conversations. Which approach should they take to minimize cost while ensuring the model understands medical terminology?

Answer options not yet available.

Why this answer

Fine-tuning a pre-trained LLM on the medical dataset is cost-effective (small dataset) and adapts the model to the domain.

Full explanation →

916

Multi-Selectmedium

A company is building a generative AI application using Amazon Bedrock. They need to implement a RAG pipeline that ingests PDF documents, processes them, and stores embeddings for retrieval. Which THREE steps are essential in this pipeline?

Select 3 answers

A.Fine-tuning an LLM on the ingested documents

B.Chunking the documents into smaller pieces

C.Storing the embeddings in a vector store such as Amazon OpenSearch Serverless

D.Generating embeddings for each chunk using an embedding model

E.Augmenting the prompt with retrieved chunks at query time

AnswersB, C, D

Chunking is necessary to break large documents into segments that can be embedded and retrieved accurately.

Why this answer

Document ingestion involves chunking documents into manageable pieces, generating embeddings for each chunk, and storing those embeddings in a vector store for similarity search. Prompt augmentation is part of the retrieval step, not ingestion.

Full explanation →

917

MCQmedium

An AI practitioner is evaluating foundation models on Amazon Bedrock for a text summarization task. The input documents average 6,000 tokens. The model must process the entire document in a single pass without chunking. Which model capability is MOST critical for this requirement?

A.The output modality (text vs. multimodal)

B.The temperature setting

C.The context window size

D.The number of parameters in the model

AnswerC

Context window size determines the maximum input length; a model with a context window >= 6k tokens can process the document in one pass.

Why this answer

The context window defines the maximum number of tokens the model can handle in one pass. To process a 6,000-token document without chunking, the model's context window must be at least 6,000 tokens (plus output).

Full explanation →

918

Multi-Selecthard

A company wants to build a system that automatically routes support tickets to the appropriate department based on the text description. They have labeled historical data. Which THREE AWS services could be used to implement this solution? (Select THREE.)

Select 3 answers

A.Amazon Comprehend

B.Amazon SageMaker

C.Amazon Rekognition

D.Amazon Polly

E.Amazon Lex

AnswersA, B, E

Comprehend's custom classification can be trained on historical tickets to predict department.

Why this answer

Amazon Comprehend can perform custom classification. SageMaker allows building custom ML models. Amazon Lex can process text input and integrate with a bot to route tickets.

Full explanation →

919

MCQeasy

A company uses Amazon Bedrock Agents to build an agent that interacts with users through a chat interface. The agent is configured with a knowledge base containing product documentation. Sometimes the agent fails to answer simple questions like 'What is your return policy?' and instead says it cannot find the answer. The knowledge base does contain the return policy. What is the most likely reason?

A.Increase the agent's maximum timeout for processing

B.Use a more powerful foundation model for reasoning

C.Add more documents to the knowledge base

D.Simplify and clarify the agent's instruction prompt to emphasize knowledge base usage

AnswerD

A clear prompt instructing the agent to consult the knowledge base for all answers can dramatically improve consistency.

Why this answer

The agent's instruction prompt might be too complex or not explicitly directing the agent to use the knowledge base. Simplifying the prompt to clearly instruct the agent to first search the knowledge base can resolve the issue. Increasing timeout or adding more data is unnecessary.

A stronger model may help but is not the root cause.

Full explanation →

920

MCQmedium

A data scientist is training a binary classification model on a highly imbalanced dataset where the positive class represents only 2% of the data. The model currently achieves 98% accuracy but fails to identify any positive instances. Which metric is MOST appropriate to evaluate this model's performance?

A.RMSE

B.Accuracy

C.AUC-ROC

D.MAE

AnswerC

AUC-ROC evaluates the model's ability to distinguish between classes regardless of threshold, making it robust for imbalanced datasets.

Why this answer

When classes are imbalanced, accuracy can be misleading because a model that always predicts the majority class will still have high accuracy. AUC-ROC summarizes the trade-off between true positive rate and false positive rate across thresholds, making it a robust metric for imbalanced classification. F1 score would also be good, but AUC-ROC is more commonly recommended first.

Full explanation →

921

MCQhard

A team is fine-tuning a foundation model using SageMaker. They want to minimize training time while keeping the model's original knowledge. Which technique is BEST suited?

A.Use Parameter Efficient Fine-Tuning (PEFT) such as LoRA

B.Use distributed training across multiple GPUs

C.Use prompt engineering instead of fine-tuning

D.Full fine-tuning on the new dataset

AnswerA

PEFT methods adapt the model with fewer trainable parameters, reducing training time and preserving original knowledge.

Why this answer

Parameter Efficient Fine-Tuning (PEFT) methods like LoRA (Low-Rank Adaptation) are best suited because they freeze the pre-trained model weights and inject trainable low-rank matrices into specific layers, drastically reducing the number of trainable parameters. This minimizes training time and computational cost while preserving the model's original knowledge, as only a small fraction of parameters are updated during fine-tuning.

Exam trap

AWS often tests the distinction between techniques that modify the model (fine-tuning) versus those that only change the input (prompt engineering), and the trap here is that candidates may choose distributed training (Option B) thinking it reduces time, but it does not address parameter efficiency or knowledge preservation as directly as PEFT.

How to eliminate wrong answers

Option B is wrong because distributed training across multiple GPUs accelerates training but does not inherently preserve the model's original knowledge or reduce the number of updated parameters; it still requires full or partial parameter updates and does not address the goal of minimizing training time through parameter efficiency. Option C is wrong because prompt engineering is a zero-shot or few-shot inference technique that does not involve training at all, so it cannot be used to fine-tune the model on a new dataset. Option D is wrong because full fine-tuning updates all model parameters, which is computationally expensive, time-consuming, and risks catastrophic forgetting of the original knowledge, contrary to the goal of minimizing training time while preserving original knowledge.

Full explanation →

922

MCQmedium

A team is deploying a sentiment analysis application using Amazon Bedrock. They need to ensure the model returns only 'positive', 'negative', or 'neutral'. Which prompt engineering technique is BEST suited for this requirement?

A.Use a chain-of-thought prompt to reason about sentiment

B.Provide few-shot examples of the three labels in the prompt

C.Use a Bedrock Guardrail to block any response not matching the three labels

D.Include an instruction like 'Classify the sentiment as one of: positive, negative, neutral. Output only that word.'

AnswerD

Clear instruction with constrained output is the most direct prompt engineering technique.

Why this answer

Constrained generation with a controlled vocabulary is achieved by instructing the model to output only specific tokens. Few-shot examples help but do not guarantee strict adherence; output parsing via Guardrails can enforce format but not specific vocabulary.

Full explanation →

923

MCQmedium

A developer wants to generate an image of a cat in a spacesuit using Amazon Bedrock. Which model provider should they choose?

A.Anthropic

B.Stability AI

C.Meta

D.Cohere

AnswerB

Stability AI provides text-to-image diffusion models capable of generating images from prompts.

Why this answer

Stability AI offers diffusion models (e.g., Stable Diffusion) for image generation from text prompts.

Full explanation →

924

Multi-Selecthard

A company is implementing an AI governance framework for their machine learning models deployed on Amazon SageMaker. Which THREE actions should they include to manage the model lifecycle effectively? (Select THREE.)

Select 3 answers

A.Enable Amazon CloudWatch alarms for model accuracy metrics

B.Automatically delete all previous model versions after deployment

C.Create a decommissioning policy that retires old models

D.Use AWS CloudTrail to track model inference requests

E.Use SageMaker Model Registry to version models

AnswersA, C, E

Monitoring for drift is essential; CloudWatch can alert on accuracy drops.

Why this answer

Model versioning tracks changes, monitoring for drift ensures performance, and a decommissioning process retires outdated models. These are key lifecycle management practices.

Full explanation →

925

MCQhard

A company needs to transcribe and summarize customer support calls in real time. They want to use a large language model (LLM) for summarization but the audio input is streaming. Which approach should they use?

A.Use a multimodal LLM that directly accepts audio input

B.Use an image generation model to create a visual summary of the audio

C.Transcribe the audio using Amazon Transcribe and then pass the text to an LLM for summarization

D.Use an embedding model to convert audio directly into searchable vectors

AnswerC

Transcribe converts audio to text; the LLM then summarizes the transcript.

Why this answer

The pipeline should first transcribe speech to text (e.g., with Amazon Transcribe), then feed the text to an LLM for summarization.

Full explanation →

926

Multi-Selectmedium

A developer is evaluating generative AI models for a code generation task. Which THREE metrics are commonly used for automated evaluation of generated code? (Choose THREE.)

Select 3 answers

A.BERTScore

B.BLEU

C.ROUGE

D.Human evaluation score

E.Perplexity

AnswersA, B, C

BERTScore uses embeddings to compute similarity and is applicable to code.

Why this answer

Common automated metrics for code generation include BLEU (n-gram overlap), ROUGE (recall-oriented), and BERTScore (contextual similarity). Perplexity is for language modeling, and human evaluation is not automated. Note: While ROUGE and BLEU originate from NLP, they are also used for code generation evaluation.

Full explanation →

927

MCQeasy

A startup with limited ML expertise wants to quickly prototype a binary classification model using a small customer dataset. They need a managed environment to run Jupyter notebooks and access pre-built algorithms. Which AWS service should they choose?

A.AWS Lambda

B.Amazon SageMaker

C.Amazon EMR

D.AWS Glue

AnswerB

SageMaker provides managed notebooks and built-in algorithms for quick experimentation.

Why this answer

Amazon SageMaker is the correct choice because it provides a fully managed environment for Jupyter notebooks and includes built-in, pre-built algorithms for binary classification. This allows the startup to quickly prototype without deep ML expertise, as SageMaker handles infrastructure, scaling, and model training.

Exam trap

The AIF-C01 exam often tests the distinction between managed ML platforms (SageMaker) and general-purpose compute or data processing services (Lambda, EMR, Glue), leading candidates to pick a service that can run code but lacks the specific notebook and pre-built algorithm capabilities required.

How to eliminate wrong answers

Option A is wrong because AWS Lambda is a serverless compute service for running code in response to events, not a managed environment for Jupyter notebooks or pre-built ML algorithms. Option C is wrong because Amazon EMR is a big data processing service using frameworks like Apache Spark and Hadoop, not designed for interactive Jupyter notebook-based ML prototyping with pre-built algorithms. Option D is wrong because AWS Glue is a serverless data integration and ETL service, not a platform for running Jupyter notebooks or accessing pre-built ML models.

Full explanation →

928

MCQmedium

A company deploys a machine learning model for resume screening. They want to measure whether the model selects candidates proportionally across different demographic groups. Which fairness metric is most appropriate?

A.Demographic parity

B.Disparate impact

C.Accuracy parity

D.Equalized odds

AnswerB

Disparate impact measures the ratio of selection rates between groups, often using the 'four-fifths rule' (rate of protected group >= 0.8 times rate of majority group). It is a standard metric for fairness in hiring.

Why this answer

Disparate impact measures the ratio of favorable outcomes between groups, commonly used to assess whether a selection process disproportionately affects a protected group.

Full explanation →

929

MCQmedium

A media company runs batch inference jobs to generate captions for thousands of images weekly using a foundation model on Amazon Bedrock. They want to minimize costs while maintaining predictable throughput. Which pricing option should they choose?

A.SageMaker Batch Transform

B.On-demand inference

C.Provisioned Throughput

D.Spot instances (EC2 Spot)

AnswerC

Reserves capacity for a model, providing consistent performance and lower per-token cost for large batches.

Why this answer

Provisioned Throughput reserves capacity for a specific model, offering predictable performance and cost savings for steady workloads. On-demand is pay-per-use but may be costlier for high volume. Batch Transform is for SageMaker, not Bedrock.

Spot instances are not available for Bedrock.

Full explanation →

930

MCQmedium

A company wants to build a customer service chatbot that answers questions about their internal policy documents. The documents are updated monthly, and the team cannot afford to retrain a model each time. Which approach is MOST appropriate?

A.Train a custom model from scratch on the policy documents each month

B.Use Retrieval-Augmented Generation (RAG) with the policy documents indexed in a vector store

C.Fine-tune a base LLM on the policy documents monthly

D.Use a larger foundation model with a longer context window and paste all documents into each prompt

AnswerB

RAG retrieves relevant document chunks at query time, ensuring the chatbot always answers from the latest uploaded documents without any model retraining.

Why this answer

RAG (Retrieval-Augmented Generation) allows the LLM to retrieve relevant document sections at inference time, so knowledge stays current without retraining. The other options either require expensive retraining for each update or lack document grounding.

Full explanation →

931

MCQmedium

A developer wants to use Amazon Bedrock to build a chatbot that can maintain context across multiple turns of conversation. Which API should they use to simplify multi-turn interactions?

A.ListFoundationModels

B.InvokeModelWithResponseStream

C.Converse API

D.InvokeModel

AnswerC

The Converse API handles multi-turn conversation context, system prompts, and messages seamlessly.

Why this answer

The Converse API manages conversation history and context automatically, making it ideal for multi-turn chat.

Full explanation →

932

MCQhard

A company is using Amazon Bedrock to build a chatbot that must comply with data residency regulations. All data must remain in a specific AWS region. Which action is MOST important to meet this requirement?

Answer options not yet available.

Why this answer

Ensuring the model is invoked in the required region and data is not transferred out is critical. Bedrock does not offer customer-managed keys for model inference, and model customization may not be needed.

Full explanation →

933

MCQhard

A company needs to fine-tune a foundation model on a large dataset of proprietary documents. They are concerned about data privacy and want to ensure that no data leaves their AWS account. Which Amazon Bedrock feature should they use?

A.Custom Model Import

B.Guardrails for Amazon Bedrock

C.Provisioned Throughput

D.Knowledge Bases for Amazon Bedrock

AnswerA

Custom Model Import lets you import and fine-tune your own model in your own environment, ensuring data never leaves your control.

Why this answer

Custom Model Import allows you to bring your own model and fine-tune it using your own data within a VPC, keeping data fully contained.

Full explanation →

934

MCQmedium

A company wants to analyze customer feedback to identify common themes and sentiment. The feedback is in multiple languages. Which AWS services should be used together?

A.Amazon Translate and Amazon Comprehend

B.Amazon Transcribe and Amazon Comprehend

C.Amazon Polly and Amazon Translate

D.Amazon Rekognition and Amazon Comprehend

AnswerA

Translate handles multilingual input; Comprehend performs NLP analysis.

Why this answer

Amazon Comprehend provides sentiment analysis and key phrase extraction; Amazon Translate can translate feedback into a common language before analysis.

Full explanation →

935

MCQmedium

A data scientist is building a binary classification model for fraud detection. The dataset is highly imbalanced (99% legitimate, 1% fraud). Which metric is most appropriate to evaluate model performance?

A.Accuracy

B.F1-score

C.Recall

D.Precision

AnswerB

F1-score is the harmonic mean of precision and recall, providing a balanced evaluation for imbalanced datasets.

Why this answer

In highly imbalanced datasets (99% legitimate, 1% fraud), accuracy is misleading because a model that predicts all transactions as legitimate would achieve 99% accuracy but fail to detect any fraud. The F1-score is the harmonic mean of precision and recall, providing a balanced measure that accounts for both false positives and false negatives, making it the most appropriate metric for evaluating fraud detection models.

Exam trap

The AIF-C01 exam often tests the misconception that accuracy is always the best metric, especially when candidates overlook the impact of class imbalance on model evaluation.

How to eliminate wrong answers

Option A is wrong because accuracy is not suitable for imbalanced datasets; it can be artificially high (e.g., 99%) even if the model never identifies any fraud, as it only measures overall correct predictions. Option C is wrong because recall alone (true positive rate) ignores false positives; a model that predicts all transactions as fraud would have perfect recall but terrible precision, leading to many false alarms. Option D is wrong because precision alone ignores false negatives; a model that predicts only one transaction as fraud (and gets it right) could have 100% precision but miss the other 99 fraud cases, failing the detection goal.

Full explanation →

936

Multi-Selecthard

A company is deploying an Amazon Lex chatbot that processes customer credit card information. The company must ensure that the chatbot does not log sensitive data and that the data is encrypted in transit. Which THREE actions should the company take? (Choose THREE.)

Select 3 answers

A.Ensure the Amazon Lex endpoint uses HTTPS for all API calls

B.Enable audio logging in Amazon Lex with content redaction enabled

C.Enable CloudTrail data events for the Lex bot to capture conversation logs

D.Enable S3 server access logs on the bucket storing the Lex bot configuration

E.Configure the Amazon Lex bot to use obfuscation for the credit card slot type

AnswersA, B, E

HTTPS encrypts data in transit.

Why this answer

Option A is correct because Amazon Lex endpoints support HTTPS, which encrypts data in transit using TLS. By ensuring all API calls use HTTPS, the company protects sensitive credit card information from being intercepted during communication between the client and the Lex service.

Exam trap

The trap here is that candidates may confuse CloudTrail data events (which log API activity) with conversation logging (which captures user utterances), leading them to incorrectly select option C as a solution for preventing sensitive data logging.

Full explanation →

937

MCQhard

A data scientist is fine-tuning a foundation model on a custom dataset using Amazon SageMaker. After training, the model shows high accuracy on training data but poor on validation. Which action should be taken?

A.Add dropout layers

B.Reduce training epochs or add regularization

C.Increase learning rate

D.Use a different foundation model

AnswerB

Reducing epochs prevents overfitting; regularization also helps.

Why this answer

The model is overfitting, as indicated by high training accuracy but poor validation performance. Reducing training epochs or adding regularization (e.g., L1/L2 weight decay) directly addresses overfitting by limiting the model's capacity to memorize noise. In Amazon SageMaker, this can be implemented via hyperparameter tuning or by modifying the training script to include regularization terms.

Exam trap

AWS often tests the misconception that overfitting is solved by increasing model complexity or data augmentation, but the correct approach is to reduce capacity or add regularization.

How to eliminate wrong answers

Option A is wrong because adding dropout layers is a regularization technique that could help, but it is not the only or most direct action; the question asks for a single action, and reducing epochs or adding regularization (Option B) is a more fundamental fix for overfitting. Option C is wrong because increasing the learning rate can cause the model to diverge or overshoot minima, worsening generalization and potentially increasing overfitting. Option D is wrong because using a different foundation model does not address the root cause of overfitting; the current model is capable of learning the training data, and the issue is with training dynamics, not model architecture.

Full explanation →

938

MCQmedium

A company needs to ensure that model inference endpoints in SageMaker are only accessible from a private subnet in their VPC, and no traffic goes over the public internet. Which network configuration should they use?

A.Create a public endpoint and restrict access using IAM policies

B.Use AWS PrivateLink for the endpoint, but keep public endpoint enabled

C.Use a VPC interface endpoint for SageMaker and disable public access

D.Deploy the endpoint in a public subnet with a security group blocking all inbound traffic

AnswerC

An interface endpoint connects SageMaker to the VPC privately, and disabling public access ensures no internet traffic.

Why this answer

A VPC-only endpoint (interface endpoint) or private endpoint without public internet access ensures traffic stays within the VPC.

Full explanation →

939

MCQeasy

Which AWS service is used as a vector store in Amazon Bedrock Knowledge Bases for storing and retrieving embeddings?

A.Amazon DynamoDB

B.Amazon Redshift

C.Amazon OpenSearch Serverless

D.Amazon S3

AnswerC

Correct. OpenSearch Serverless provides vector database capabilities for Bedrock Knowledge Bases.

Why this answer

Amazon Bedrock Knowledge Bases support several vector stores, including Amazon OpenSearch Serverless, Aurora pgvector, Pinecone, and MongoDB Atlas. OpenSearch Serverless is the AWS-native managed solution frequently used for this purpose.

Full explanation →

940

MCQeasy

A company uses Amazon SageMaker to train sensitive ML models. Which AWS service should they use to encrypt the training data and model artifacts at rest?

A.AWS Secrets Manager

B.AWS CloudHSM

C.AWS Key Management Service (KMS)

D.AWS Certificate Manager

AnswerC

KMS provides centralized control over encryption keys used to encrypt data at rest in SageMaker.

Why this answer

AWS Key Management Service (KMS) allows customers to create and manage encryption keys used to encrypt data at rest in SageMaker, including training data and model artifacts.

Full explanation →

941

MCQhard

A machine learning team is building a credit risk model and discovers that the training data has a significant imbalance in loan approval rates between two demographic groups. They decide to reweight the training samples using a preprocessing technique. Which SageMaker Clarify feature can help compute the appropriate sample weights to achieve demographic parity?

A.Clarify preprocessing (reweighting)

B.Clarify post-training bias metrics

C.Model Monitor bias drift

D.Clarify explainability (SHAP)

AnswerA

Clarify provides a preprocessing transformation that reweights data to meet fairness constraints.

Why this answer

Option A is correct because SageMaker Clarify's preprocessing transforms include a reweighting method that assigns weights to instances to adjust for fairness. Post-training (B, C) are not preprocessing. Monitoring (D) is post-deployment.

Full explanation →

942

MCQmedium

A developer is using the Amazon Bedrock Converse API to build a multi-turn conversational AI. They need to send a user message along with system instructions and previous conversation history. How should they structure the API request to include both system prompt and message history?

A.Include system prompt as a user message in the 'messages' array

B.Include system prompt as an assistant message in the 'messages' array

C.Include system prompt in the 'system' parameter and user/assistant messages in the 'messages' array

D.Include system prompt in the 'inferenceConfig' parameter

AnswerC

This is the correct structure per the Converse API documentation.

Why this answer

The Bedrock Converse API accepts an optional 'system' list for system prompts and a 'messages' array for conversation history, with each message having a role (user/assistant) and content. The other options misuse roles or place system prompts incorrectly.

Full explanation →

943

MCQeasy

A company uses Amazon Rekognition to analyze images for a user-generated content platform. To comply with data residency requirements, they must ensure that images are not processed outside a specific AWS region. What should the company do?

A.Enable Amazon Rekognition VPC endpoints.

B.Use an AWS Lambda function to copy images to a bucket in the desired region and invoke Rekognition there.

C.Configure an AWS WAF rule to block cross-region processing.

D.Set up AWS Organizations to restrict the Rekognition service to one region.

AnswerB

This ensures data is processed only in the specified region.

Why this answer

Option B is correct because it directly addresses the data residency requirement by using a Lambda function to copy images to an S3 bucket in the desired region and then invoking Rekognition in that same region. This ensures that image data never leaves the specified AWS region, as Rekognition processes images only in the region where the API call is made. The other options either do not prevent cross-region processing or are not applicable to Rekognition's data handling.

Exam trap

The trap here is that candidates often confuse VPC endpoints or WAF with data residency controls, not realizing that Rekognition processes data in the region of the API call, and that network-level or application-layer controls do not enforce regional data processing boundaries.

How to eliminate wrong answers

Option A is wrong because Amazon Rekognition VPC endpoints allow private connectivity to the Rekognition API within a VPC, but they do not restrict the region where processing occurs; the API call still goes to the Region endpoint you specify, and data can be processed in any region you choose. Option C is wrong because AWS WAF is a web application firewall that operates at the application layer (HTTP/HTTPS) and cannot control or block cross-region processing of Rekognition API calls, which are made via the AWS API or SDK, not through HTTP requests filtered by WAF. Option D is wrong because AWS Organizations can manage accounts and service control policies (SCPs) to restrict service usage, but SCPs cannot restrict Rekognition to a single region; they can only deny access to the Rekognition API in specific regions, but the company's requirement is to ensure images are not processed outside a specific region, which is a data residency concern, not an API access control issue.

Full explanation →

944

MCQhard

A healthcare company is deploying a model to predict patient readmission risk using Amazon SageMaker. The model processes protected health information (PHI). The company must adhere to HIPAA regulations, which require that all PHI is encrypted at rest and in transit, and that access to the data is logged and audited. The data is stored in an S3 bucket encrypted with SSE-S3. The SageMaker training job uses a custom Docker container that reads data from the S3 bucket and writes model artifacts back to another S3 bucket. The security team notices that the model artifacts in the output bucket are not encrypted. Also, the training logs in CloudWatch Logs contain PHI because the training script logs patient features. The company must fix these issues without delaying the deployment. What should the company do?

A.Implement client-side encryption for the output bucket, and update the training script to mask PHI before logging.

B.Change the output S3 bucket to use SSE-KMS encryption, and enable server access logs for the bucket.

C.Modify the training container to avoid logging PHI, and configure the output S3 bucket to use SSE-KMS encryption.

D.Configure the training job to use a VPC with private subnets, and enable data encryption in transit using TLS.

AnswerC

This directly addresses both issues: encryption of output artifacts and removal of PHI from logs.

Why this answer

Option C is correct because it directly addresses both issues: modifying the training container to avoid logging PHI ensures CloudWatch Logs no longer contain protected health information, and configuring the output S3 bucket to use SSE-KMS encryption ensures model artifacts are encrypted at rest. SSE-KMS is preferred over SSE-S3 for HIPAA compliance because it provides additional controls like key rotation and audit trails. This approach fixes the problems without delaying deployment, as it requires only code and configuration changes.

Exam trap

The trap here is that candidates may focus on network-level controls (VPC, TLS) or logging mechanisms, overlooking the fact that the core issues are encryption at rest for the output bucket and preventing PHI from being written to CloudWatch Logs, which require application-level changes and bucket configuration.

How to eliminate wrong answers

Option A is wrong because client-side encryption for the output bucket would require the training container to encrypt data before writing, which does not address the PHI in CloudWatch Logs; also, client-side encryption is not a native S3 encryption method and adds complexity. Option B is wrong because enabling server access logs for the bucket only logs access requests, not the content of the logs, and does not prevent PHI from being logged in CloudWatch Logs; it also does not address the encryption of model artifacts. Option D is wrong because configuring a VPC with private subnets and TLS encryption addresses data in transit but does not solve the encryption-at-rest issue for the output bucket or the PHI in CloudWatch Logs.

Full explanation →

945

MCQmedium

A company uses Amazon Bedrock to run a question-answering system over a large internal knowledge base. They currently use a RAG approach with Titan Embeddings to index documents and a separate LLM for generation. The team notices that the retrieval often returns irrelevant chunks, causing the LLM to produce incorrect answers. Which action would MOST directly improve retrieval relevance?

A.Switch from Titan Embeddings to a more powerful embedding model like Cohere Embed on Bedrock

B.Add a system prompt instructing the LLM to only answer when confident

C.Reduce the chunk size of documents to 256 tokens

D.Increase the temperature of the LLM to 0.7

AnswerA

Cohere Embed offers higher dimensionality and better semantic understanding, which can improve retrieval accuracy.

Why this answer

Choosing an embedding model that produces higher-quality vector representations improves the semantic similarity search, directly addressing retrieval relevance. The other options do not focus on retrieval quality.

Full explanation →

946

MCQmedium

A financial services company is deploying a generative AI chatbot to assist customers with account inquiries. The company wants to ensure the chatbot does not generate biased or harmful responses. Which combination of AWS services and practices should the company implement to monitor and mitigate these risks?

A.Configure the chatbot to use a pre-trained model from SageMaker JumpStart and disable logging to avoid storing sensitive customer data.

B.Use Amazon Rekognition to analyze chat logs for biased language and automatically block responses with a confidence score above 90%.

C.Use Amazon SageMaker Clarify to detect bias in model outputs and implement a human-in-the-loop workflow with Amazon A2I to review flagged responses.

D.Deploy Amazon Lex with built-in sentiment analysis to detect negative customer emotions and automatically escalate to a human agent.

AnswerC

SageMaker Clarify provides bias detection, and Amazon A2I enables human review of model outputs to ensure responsible AI.

Why this answer

Option C is correct because Amazon SageMaker Clarify is specifically designed to detect bias in machine learning models and their outputs, while Amazon Augmented AI (A2I) enables a human-in-the-loop workflow to review flagged responses. This combination directly addresses the requirement to monitor and mitigate biased or harmful responses from a generative AI chatbot, ensuring responsible AI practices.

Exam trap

The AIF-C01 exam often tests the distinction between services that detect customer sentiment (like Amazon Comprehend or Lex sentiment analysis) versus services that detect bias in model outputs (like SageMaker Clarify), leading candidates to mistakenly choose sentiment analysis options for bias detection.

How to eliminate wrong answers

Option A is wrong because disabling logging prevents the monitoring and auditing necessary to detect biased or harmful responses, and using a pre-trained model from SageMaker JumpStart without additional safeguards does not mitigate bias. Option B is wrong because Amazon Rekognition is an image and video analysis service, not designed for analyzing text chat logs for biased language; it cannot process text-based conversations. Option D is wrong because Amazon Lex's built-in sentiment analysis detects customer emotions but does not detect bias or harmful content in the chatbot's responses, and escalation to a human agent does not proactively mitigate biased outputs.

Full explanation →

947

Multi-Selectmedium

Which TWO strategies can help reduce inference costs when using Amazon Bedrock? (Select TWO.)

Select 2 answers

A.Use a higher temperature setting to generate fewer tokens

B.Increase the max tokens to allow longer responses

C.Use provisioned throughput for high-volume, predictable workloads

D.Cache frequently used responses in Amazon ElastiCache

E.Select a smaller foundation model variant

AnswersC, E

Provisioned throughput offers a discounted hourly rate compared to on-demand per-request pricing.

Why this answer

Using provisioned throughput for predictable workloads reduces per-request cost. Choosing a smaller model variant requires less compute. Caching responses is not directly supported, and increasing max tokens increases cost.

Full explanation →

948

Multi-Selecthard

Which TWO of the following are best practices for preparing training data for a machine learning model?

Select 2 answers

A.Handle missing values by imputing or removing them.

B.Split the data into training, validation, and test sets.

C.Remove all outliers to improve model robustness.

D.Use the entire dataset for training to maximize data usage.

E.Avoid shuffling the data to preserve original order.

AnswersA, B

Missing values can cause errors or bias; imputation or removal is a standard practice.

Why this answer

Option A is correct because handling missing values is a critical data preprocessing step. Missing data can introduce bias or cause algorithms to fail. Imputation (e.g., using mean, median, or model-based methods) or removal of rows/columns with missing values ensures the dataset is complete and suitable for training, preventing errors during model fitting.

Exam trap

The AIF-C01 exam often tests the misconception that removing all outliers is always beneficial, when in fact domain knowledge is required to distinguish between noise and legitimate extreme values that may be critical for model accuracy.

Full explanation →

949

Multi-Selectmedium

A data science team wants to document and share their model's intended use, performance, and limitations with stakeholders. They also need to track the model's version and deployment history. Which TWO AWS services or features should they use?

Select 2 answers

A.Amazon SageMaker Pipelines

B.Amazon SageMaker Clarify

C.Amazon SageMaker Studio

D.Amazon SageMaker Model Cards

E.Amazon SageMaker Model Registry

AnswersD, E

Model Cards document intended use, performance, and limitations.

Why this answer

SageMaker Model Cards provide standardized documentation for transparency. SageMaker Model Registry tracks model versions, deployment stages, and metadata. SageMaker Pipelines is for ML workflows, not documentation or version tracking.

SageMaker Studio is an IDE. SageMaker Clarify is for bias and explainability.

Full explanation →

950

Multi-Selectmedium

A team is using Amazon SageMaker to build a text classification model. They have raw text data in CSV files stored in Amazon S3. Before training, they need to perform feature engineering. Which THREE actions should they take? (Select THREE.)

Select 3 answers

A.Normalize numerical features to have zero mean and unit variance

B.Tokenize the text into words or subwords

C.Remove punctuation and stop words from the text

D.Convert the text into numerical features using techniques like TF-IDF or word embeddings

E.Encode categorical variables using one-hot encoding

AnswersB, C, D

Tokenization is necessary to convert text into tokens.

Why this answer

Cleaning text (removing punctuation, stop words), tokenization, and converting text to numerical features (e.g., TF-IDF or word embeddings) are standard preprocessing steps for text classification. Normalizing numerical features is not relevant for raw text. Encoding categorical variables may be needed if there are categorical columns, but the primary focus for text data is the textual feature engineering.

Full explanation →

951

MCQeasy

Refer to the exhibit. A developer runs this command but gets an error: 'An error occurred (AccessDeniedException) when calling the ListFoundationModels operation'. What is the most likely cause?

A.The IAM role does not have bedrock:ListFoundationModels permission

B.The AWS CLI version is outdated

C.The foundation model is not available in us-west-2

D.The region us-west-2 does not support Bedrock

AnswerA

AccessDeniedException is due to missing IAM permissions.

Why this answer

The error 'AccessDeniedException' when calling ListFoundationModels indicates that the IAM role or user executing the AWS CLI command lacks the required permission to list foundation models in Amazon Bedrock. The specific permission needed is bedrock:ListFoundationModels, which must be attached to the IAM identity via a policy. Without this permission, the API call is denied regardless of other factors like region or CLI version.

Exam trap

AWS often tests the distinction between service availability errors (e.g., region not supported) and IAM permission errors, where candidates mistakenly attribute an AccessDeniedException to regional or model availability issues rather than missing IAM permissions.

How to eliminate wrong answers

Option B is wrong because an outdated AWS CLI version would typically produce a different error (e.g., 'InvalidClientTokenId' or 'UnrecognizedClientException'), not an AccessDeniedException, and the ListFoundationModels API is available in recent CLI versions. Option C is wrong because the error is an access denial, not a model availability issue; if a model were unavailable, the error would be something like 'ValidationException' or 'ResourceNotFoundException' when trying to use that specific model. Option D is wrong because us-west-2 (Oregon) fully supports Amazon Bedrock and its APIs; the error is explicitly an IAM permissions issue, not a regional unsupported service error.

Full explanation →

952

MCQmedium

A company wants to build a customer service chatbot that answers questions about their internal policy documents. The documents are updated monthly, and the team cannot afford to retrain a model each time. Which approach is MOST appropriate?

A.Fine-tune a base LLM on the policy documents monthly

B.Use a larger foundation model with a longer context window and paste all documents into each prompt

C.Use Retrieval-Augmented Generation (RAG) with the policy documents indexed in a vector store

D.Train a custom model from scratch on the policy documents each month

AnswerC

RAG retrieves relevant document chunks at query time, ensuring the chatbot always answers from the latest uploaded documents without any model retraining.

Why this answer

RAG (Retrieval-Augmented Generation) allows the LLM to retrieve relevant document sections at inference time, so knowledge stays current without retraining. The other options either require expensive retraining for each update or lack document grounding.

Full explanation →

953

MCQeasy

What is the primary advantage of the transformer architecture over previous RNN-based architectures for natural language processing tasks?

A.It processes tokens sequentially to maintain the order of the sequence

B.It requires less training data because it uses convolutional layers

C.It uses a gated recurrent unit to selectively forget information

D.It relies on a self-attention mechanism that allows parallel processing of all tokens

AnswerD

Self-attention enables parallel computation and captures long-range dependencies effectively.

Why this answer

The transformer’s self-attention mechanism processes all tokens in parallel, enabling much faster training and better handling of long-range dependencies compared to sequential RNNs.

Full explanation →

954

MCQhard

A company is using Amazon Bedrock to build a sentiment analysis application for customer reviews. They need to evaluate the model's performance against a labeled test dataset. They want to use a metric that compares the model's predicted sentiment (positive, negative, neutral) to the ground truth labels. Which metric is MOST appropriate?

A.ROUGE-1

B.BERTScore

C.BLEU

D.Accuracy

AnswerD

Accuracy measures the proportion of correct predictions.

Why this answer

For classification tasks, accuracy (or F1 score) is the appropriate metric. ROUGE, BLEU, and BERTScore are for text generation tasks.

Full explanation →

955

Multi-Selecthard

Refer to the exhibit. A data scientist trained an XGBoost model using Amazon SageMaker. Which TWO actions should the data scientist take to improve the model's performance based on the exhibited training job metrics and resource configuration?

Select 2 answers

A.Reduce the VolumeSizeInGB to save costs.

B.Decrease the learning rate and increase the number of training rounds.

C.Increase the maximum number of leaf nodes in the XGBoost algorithm.

D.Use a distributed training strategy by increasing InstanceCount to 4.

E.Switch to a more powerful instance type to reduce training time.

AnswersB, C

A lower learning rate with more rounds typically improves convergence and model performance.

Why this answer

Option A is correct because increasing the maximum number of leaf nodes allows the model to capture more complex patterns, potentially improving AUC. Option C is correct because a lower learning rate with more training rounds often leads to better convergence and performance. Option B (distributed training) primarily reduces training time, not model performance.

Option D (switching instance type) also speeds up training but doesn't directly improve metrics. Option E (reducing volume size) saves cost but does not enhance performance.

Full explanation →

956

MCQmedium

A company uses Amazon Bedrock Guardrails to filter harmful content. They want to ensure that the model does not generate responses containing specific keywords related to their internal project names. Which Guardrails component should they configure?

A.Harmful content filters

B.Topic restrictions

C.Word filters

D.Grounding checks

AnswerC

Word filters allow blocking specific words or phrases.

Why this answer

Word filters in Bedrock Guardrails allow you to specify custom words or phrases that should be blocked in model responses.

Full explanation →

957

Multi-Selecteasy

A company wants to use AWS services to process natural language text. Which TWO AWS services provide natural language processing (NLP) capabilities? (Select TWO.)

Select 2 answers

A.Amazon Translate

B.Amazon Rekognition

C.Amazon Comprehend

D.Amazon Polly

E.Amazon Lex

AnswersC, E

Comprehend provides NLP features like sentiment analysis, entity recognition, and key phrase extraction.

Why this answer

Amazon Comprehend is a natural language processing (NLP) service that uses machine learning to extract insights such as entities, key phrases, sentiment, and syntax from text. Amazon Lex provides NLP capabilities through automatic speech recognition (ASR) and natural language understanding (NLU) to build conversational interfaces like chatbots.

Exam trap

The trap here is that candidates often confuse text-to-speech (Polly) or translation (Translate) with NLP, but these services do not perform language understanding or analysis—they only convert or generate speech/translation without extracting meaning.

Full explanation →

958

MCQhard

A team trained a binary classifier to detect fraudulent transactions. The dataset is highly imbalanced (1% fraud). The model achieves 99% accuracy but only catches 5% of actual fraud cases. Which metric should the team primarily optimize?

A.Recall

B.F1 score

C.Accuracy

D.Precision

AnswerA

Recall measures the fraction of actual fraud cases that are detected, directly addressing the low catch rate.

Why this answer

With high class imbalance, accuracy is misleading. Recall (true positive rate) measures the proportion of actual fraud cases caught, which is the key concern here.

Full explanation →

959

Multi-Selecthard

Which THREE considerations are essential for ensuring responsible AI in a model that predicts employee performance? (Choose 3)

Select 3 answers

A.Minimize the number of features to reduce cost

B.Publish the model's predictions publicly for transparency

C.Incorporate human review before final decisions

D.Ensure employee data privacy and consent

E.Test for bias across demographic groups

AnswersC, D, E

Human-in-the-loop reduces automation bias.

Why this answer

Option C is correct because responsible AI frameworks, such as those outlined by AWS, mandate human-in-the-loop (HITL) oversight for high-stakes decisions like employee performance predictions. This ensures that automated outputs are reviewed for context, fairness, and potential errors before affecting employment outcomes, aligning with the AIF-C01 domain's emphasis on human accountability.

Exam trap

The AIF-C01 exam often tests the misconception that transparency means public disclosure of all model outputs, whereas in responsible AI, transparency refers to explainability and auditability of the model's logic, not exposing sensitive predictions.

Full explanation →

960

MCQmedium

A developer is using the Amazon Bedrock Converse API to build a chat application. They want the model to maintain context across multiple turns. Which parameter should they set to ensure the conversation history is included?

A.inferenceConfig

B.messages

C.additionalModelRequestFields

D.system

AnswerB

The messages parameter should contain an array of previous user and assistant messages for context.

Why this answer

The Converse API expects messages as a list of previous turns. By including the full conversation history in the messages parameter, the model can reference past exchanges.

Full explanation →

961

Multi-Selecthard

A financial institution is using Amazon Bedrock for a customer-facing application. They must ensure compliance with data residency requirements: model inputs and outputs must not leave a specific AWS Region. Which THREE steps should they take? (Choose THREE)

Select 3 answers

A.Set up an S3 bucket for model invocation logs in a different Region for disaster recovery

B.Enable Bedrock model invocation logging to an S3 bucket and CloudWatch Logs in the same Region

C.Use AWS KMS to encrypt all data at rest with a customer managed key in the desired Region

D.Configure an AWS PrivateLink VPC endpoint for Amazon Bedrock in the desired Region

E.Use AWS Global Accelerator to route traffic to Bedrock across regions for better performance

AnswersB, C, D

Logging data must stay in the Region to meet residency requirements.

Why this answer

To ensure data residency, the Bedrock model must be accessed only via a VPC endpoint (so traffic stays within AWS network), and all storage and logging must be in the desired Region. CloudTrail and invocation logs must also be stored in that Region.

Full explanation →

962

Multi-Selecthard

A research team is using Amazon SageMaker to fine-tune a large language model. They want to optimize training cost and time without sacrificing model quality. Which THREE strategies should they implement? (Choose 3)

Select 3 answers

A.Use a larger instance type with more GPUs.

B.Apply parameter-efficient fine-tuning (PEFT) techniques like LoRA.

C.Increase the batch size to the maximum that fits in GPU memory.

D.Use managed spot training with checkpointing.

E.Enable mixed precision training (FP16).

AnswersB, D, E

LoRA fine-tunes a small subset of parameters, reducing compute and memory.

Why this answer

Option B is correct because Parameter-Efficient Fine-Tuning (PEFT) techniques like LoRA (Low-Rank Adaptation) freeze the pre-trained model weights and inject trainable rank decomposition matrices into specific layers. This drastically reduces the number of trainable parameters (often by 10,000x), lowering memory and compute requirements while preserving model quality, making it ideal for cost- and time-sensitive fine-tuning.

Exam trap

The AIF-C01 exam often tests the misconception that simply scaling up hardware (larger instances) or maximizing batch size is the best optimization strategy, when in fact algorithmic efficiency (PEFT, mixed precision) and cost-saving infrastructure (spot instances) are the correct approaches for balancing cost, time, and quality.

Full explanation →

963

Multi-Selecthard

A company is deploying a chatbot using Amazon Bedrock and wants to ensure that the model does not generate offensive or inappropriate content. Which THREE measures can they apply?

Select 3 answers

A.Use a system prompt to define ethical guidelines and constraints

B.Implement a human-in-the-loop review process for flagged responses

C.Increase the temperature to make outputs more creative and less likely to repeat offensive phrases

D.Enable content filtering via Bedrock's guardrails or built-in filters

E.Fine-tune the model on a dataset of safe conversations only

AnswersA, B, D

System prompts steer model behavior toward safe outputs.

Why this answer

System prompts set behavior, content filters block harmful output, and human review catches edge cases.

Full explanation →

964

MCQmedium

A machine learning engineer needs to train a model to predict customer churn. The dataset includes categorical features such as 'Region' (10 possible values) and 'SubscriptionType' (5 possible values). Which feature engineering technique should be used to convert these categorical features into numeric form for a linear regression model?

A.Label encoding

B.Standardization (z-score normalization)

C.One-hot encoding

D.Principal component analysis (PCA)

AnswerC

One-hot encoding creates binary columns for each category, avoiding ordinal assumptions and suitable for linear regression.

Why this answer

One-hot encoding creates binary columns for each category, which is appropriate for linear models because it does not imply ordinal relationships. Label encoding would introduce false ordinal relationships, and feature scaling or PCA are not encoding methods.

Full explanation →

965

MCQmedium

A machine learning engineer is using Amazon SageMaker to train a model and wants to automatically stop the training job if the loss does not improve for 10 consecutive epochs. Which SageMaker feature should be used?

A.SageMaker built-in algorithms with early stopping

B.SageMaker Training Compiler

C.SageMaker Debugger

D.SageMaker Experiments

AnswerA

Built-in algorithms support early stopping parameters.

Why this answer

Amazon SageMaker built-in algorithms support early stopping, which allows you to automatically terminate a training job when a specified metric, such as loss, stops improving for a defined number of consecutive epochs. This feature is configured directly in the algorithm's hyperparameters (e.g., `early_stopping_patience` for the XGBoost algorithm) and helps save compute time and cost by preventing overfitting.

Exam trap

The AIF-C01 exam often tests the distinction between monitoring (Debugger) and automated action (early stopping), leading candidates to mistakenly choose Debugger because it can detect stagnation, but it cannot stop the job without custom code or a separate hook.

How to eliminate wrong answers

Option B is wrong because SageMaker Training Compiler is a feature that accelerates training by optimizing the model graph and memory usage, but it does not provide any mechanism to monitor metrics or stop training based on convergence. Option C is wrong because SageMaker Debugger is used to monitor training in real time, capture tensors, and set rules (e.g., vanishing gradient detection), but it does not automatically stop the training job; it only emits alerts or saves data for analysis. Option D is wrong because SageMaker Experiments is a tool for organizing, tracking, and comparing training runs, but it has no built-in capability to halt a training job based on metric stagnation.

Full explanation →

966

MCQhard

A financial services company is deploying a fraud detection model using Amazon SageMaker. They need to ensure that inference requests are encrypted in transit and that the model endpoint is only accessible from within a VPC. What is the MOST secure way to achieve this?

A.Enable public endpoint with TLS and restrict access using a VPN

B.Use SageMaker Model Monitor to detect anomalies in request data

C.Create a SageMaker endpoint in a private subnet without an internet gateway, and use AWS PrivateLink

D.Deploy the model endpoint in a VPC with an interface VPC endpoint (AWS PrivateLink) and configure the model to require HTTPS

AnswerD

This ensures traffic stays within the VPC and uses TLS encryption in transit.

Why this answer

Option D is correct because it combines two critical security controls: deploying the SageMaker endpoint within a VPC using an interface VPC endpoint (AWS PrivateLink) ensures that all traffic stays within the AWS network and never traverses the public internet, while configuring the model to require HTTPS enforces encryption in transit. This architecture eliminates exposure to the internet and meets the requirement for encrypted inference requests and VPC-only access.

Exam trap

The trap here is that candidates often confuse 'deploying in a private subnet' with 'making the endpoint accessible within the VPC'—a SageMaker endpoint in a private subnet without an internet gateway or VPC endpoint cannot be reached because the SageMaker service itself needs a network path to invoke the endpoint, and without a VPC endpoint, the endpoint is effectively isolated and unreachable.

How to eliminate wrong answers

Option A is wrong because enabling a public endpoint with TLS still exposes the endpoint to the internet, and VPN access does not restrict access to within a VPC—it only adds an encrypted tunnel from a remote network. Option B is wrong because SageMaker Model Monitor is a feature for detecting data drift and model quality issues, not for controlling network access or encrypting inference traffic. Option C is wrong because creating a SageMaker endpoint in a private subnet without an internet gateway prevents the endpoint from being reachable at all—SageMaker endpoints require network connectivity to the SageMaker service for invocation, and without an internet gateway or VPC endpoint, the endpoint cannot receive requests.

Full explanation →

967

MCQmedium

A machine learning team needs to share a SageMaker notebook with a colleague from a different AWS account. The colleague should be able to open and run the notebook but not delete it. Which combination of actions should the team take?

A.Use AWS Lake Formation to share the notebook

B.Attach a resource-based policy to the notebook instance that grants the colleague's IAM role permissions to open and run, and deny delete

C.Create an IAM role in the colleague's account with permissions to create and manage SageMaker notebooks

D.Share the notebook via S3 with a bucket policy that grants read/write access to the colleague's account

AnswerB

Resource-based policies on the notebook instance can grant cross-account access with specific actions allowed.

Why this answer

To share a SageMaker notebook cross-account, you need to share the notebook instance or the notebook file via S3. Using a resource-based policy on the notebook instance allows cross-account access, and IAM policies can restrict actions (e.g., deny DeleteNotebookInstance). Alternatively, sharing via S3 with appropriate bucket policies and IAM roles is also valid, but the simplest approach is to use a resource-based policy on the notebook instance.

Full explanation →

968

Multi-Selectmedium

A company uses Amazon Macie to discover sensitive data in an S3 bucket containing training datasets. The bucket policy currently prohibits access from external accounts. Which TWO steps are necessary to allow a cross-account SageMaker training job to access this bucket while maintaining security?

Select 2 answers

A.Configure Macie to automatically grant access to the SageMaker execution role

B.Create a VPC endpoint for S3 and associate it with the SageMaker VPC

C.Add a bucket policy that grants the SageMaker execution role from the other account s3:GetObject and s3:ListBucket permissions

D.Attach an IAM policy to the SageMaker execution role that allows s3:GetObject and s3:ListBucket on the source bucket

E.Remove the bucket policy that prohibits external access

AnswersC, D

The bucket policy must explicitly allow the cross-account role.

Why this answer

For cross-account access, the bucket policy must grant the required permissions to the SageMaker execution role (from the other account), and the SageMaker execution role must have an IAM policy that allows it to access the S3 bucket. Macie doesn't control access. VPC endpoints are optional.

Full explanation →

969

MCQmedium

A company wants to build a customer service chatbot that answers questions about their internal policy documents. The documents are updated monthly, and the team cannot afford to retrain a model each time. Which approach is MOST appropriate?

A.Fine-tune a base LLM on the policy documents monthly

B.Use a larger foundation model with a longer context window and paste all documents into each prompt

C.Train a custom model from scratch on the policy documents each month

D.Use Retrieval-Augmented Generation (RAG) with the policy documents indexed in a vector store

AnswerD

RAG retrieves relevant document chunks at query time, ensuring the chatbot always answers from the latest uploaded documents without any model retraining.

Why this answer

RAG (Retrieval-Augmented Generation) allows the LLM to retrieve relevant document sections at inference time, so knowledge stays current without retraining. The other options either require expensive retraining for each update or lack document grounding.

Full explanation →

970

Multi-Selectmedium

A developer is building an agent using Amazon Bedrock Agents to handle customer support inquiries. The agent needs to look up order status from a database and escalate complex issues to a human. Which THREE components are essential for this agent?

Select 3 answers

A.A Lambda function that connects to the database and returns results

B.A fine-tuned foundation model specific to the support domain

C.A custom model trained on historical support tickets

D.A knowledge base containing troubleshooting documentation

E.An action group with an OpenAPI schema for the database query

AnswersA, D, E

Lambda provides the execution logic for the action group.

Why this answer

An action group to query the database, a Lambda function to execute the query, and a knowledge base are essential. IAM roles are needed but not listed here. The other options are not required: fine-tuning is not needed, and custom models are optional.

Full explanation →

971

MCQmedium

A data scientist is preparing features for a linear regression model. One feature 'income' has values ranging from $15,000 to $350,000, while another 'age' ranges from 18 to 70. Which feature engineering step is most important?

A.Polynomial feature creation

B.One-hot encoding

C.Feature selection using mutual information

D.Standardization (z-score scaling)

AnswerD

Standardization scales features to have zero mean and unit variance, preventing scale dominance.

Why this answer

Standardization (or normalization) scales features to a similar range, which is critical for linear models to avoid one feature dominating the loss due to its magnitude.

Full explanation →

972

MCQeasy

Refer to the exhibit. A developer wants to choose a model that can generate text (not just embeddings) and has the lowest cost. Based on the exhibit, which model should they select?

A.Titan Embed Text

B.Titan Text Express

C.Titan Text Lite

D.Need more information

AnswerC

Titan Text Lite is a text generation model and is the most cost-effective option among those listed.

Why this answer

Option A, Titan Text Lite, is a text generation model and is the lighter, cheaper option compared to Express. Titan Text Express is more expensive. Titan Embed Text is for embeddings, not text generation.

Therefore, Titan Text Lite is correct.

Full explanation →

973

MCQeasy

Which AWS service provides access to a wide variety of foundation models from different providers through a single API, without managing underlying infrastructure?

A.Amazon SageMaker

B.AWS Lambda

C.Amazon Rekognition

D.Amazon Bedrock

AnswerD

Bedrock provides a single API to access multiple foundation models from various providers.

Why this answer

Amazon Bedrock is a fully managed service that offers foundation models from providers like Anthropic, Meta, Mistral, Amazon, Cohere, and Stability AI through a single API. SageMaker is for custom model training and deployment. Lambda is for serverless functions.

Rekognition is for image/video analysis.

Full explanation →

974

MCQhard

A healthcare startup uses Amazon SageMaker to train a model predicting patient readmission. They need to ensure the model's predictions do not discriminate based on protected attributes like age or race. Which SageMaker feature allows them to monitor and mitigate bias during training?

A.SageMaker Model Monitor

B.SageMaker Autopilot

C.SageMaker Debugger

D.SageMaker Clarify

AnswerD

Provides bias detection and explainability.

Why this answer

Option B is correct: SageMaker Clarify provides bias detection and mitigation. Option A is wrong: Model Monitor is for data drift. Option C is wrong: Debugger is for debugging training issues.

Option D is wrong: Autopilot automates model building.

Full explanation →

975

MCQeasy

A company wants to use a pre-trained foundation model for sentiment analysis without any customization. Which Amazon Machine Learning service provides access to foundation models via API?

A.Amazon Bedrock

B.Amazon Textract

C.Amazon Comprehend

D.Amazon Rekognition

AnswerA

Why this answer

Amazon Bedrock provides a managed API to access foundation models from providers like AI21 Labs, Anthropic, and Amazon. Amazon Rekognition is for images; Textract for document text; Comprehend for natural language processing (but not foundation models per se).

Full explanation →

Page 13 of 14

All pages

1 2 3 4 5 6 7 8 9 10 11 12 13 14

Practice AIF-C01 by domain

Target a specific domain to shore up weak areas.

Applications of Foundation Models Fundamentals of AI and ML Fundamentals of Generative AI Guidelines for Responsible AI Security, Compliance and Governance for AI Solutions

See all domains with question counts →

AWS Certified AI Practitioner AIF-C01 AIF-C01 Questions 901–975 | Page 13/14 | Courseiva