Knowledge + Practice

AWS Certified AI Practitioner AIF-C01 (AIF-C01) — Questions 601–675

1000 questions total · 14pages · All types, answers revealed

Take a mock exam Exam hub

Page 9 of 14

601

MCQeasy

A company wants to use AI to automatically transcribe customer service calls into text. Which AWS service is most suitable?

A.Amazon Transcribe

B.Amazon Comprehend

C.Amazon Polly

D.Amazon Rekognition

AnswerA

Transcribe is designed for speech-to-text conversion.

Why this answer

Amazon Transcribe is the correct choice because it is a fully managed automatic speech recognition (ASR) service designed specifically to convert speech into text. It can handle real-time streaming or batch processing of audio files, making it ideal for transcribing customer service calls into searchable text.

Exam trap

The trap here is that candidates often confuse Amazon Transcribe (speech-to-text) with Amazon Polly (text-to-speech) or assume Amazon Comprehend can process audio directly, when in fact Comprehend only works on text input.

How to eliminate wrong answers

Option B is wrong because Amazon Comprehend is a natural language processing (NLP) service used for extracting insights like sentiment, entities, and key phrases from text, not for transcribing audio. Option C is wrong because Amazon Polly is a text-to-speech (TTS) service that converts text into lifelike speech, the opposite of the required speech-to-text functionality. Option D is wrong because Amazon Rekognition is a computer vision service for analyzing images and videos, such as object detection and facial recognition, and has no capability to process audio or transcribe speech.

Full explanation →

602

MCQeasy

A developer wants to quickly experiment with multiple foundation models using a single API. Which service provides this capability?

A.Amazon Bedrock

B.AWS Lambda

C.Amazon Bedrock

D.Amazon SageMaker Studio

AnswerA

Bedrock provides a single API to invoke multiple foundation models.

Why this answer

Amazon Bedrock is a fully managed service that provides a single API to access and experiment with multiple foundation models from leading AI providers like AI21 Labs, Anthropic, Cohere, Meta, Stability AI, and Amazon itself. This allows developers to quickly test different models without managing underlying infrastructure or learning separate APIs for each provider.

Exam trap

The trap here is that candidates may confuse Amazon Bedrock with Amazon SageMaker, thinking SageMaker also provides a unified API for multiple foundation models, but SageMaker requires you to deploy and manage individual models, whereas Bedrock is purpose-built for serverless access to a curated set of foundation models via a single API.

How to eliminate wrong answers

Option B (AWS Lambda) is wrong because Lambda is a serverless compute service for running code in response to events, not a service for accessing or experimenting with foundation models via a single API. Option C (Amazon Bedrock) is actually the same as the correct answer (A) and is listed as a duplicate; in the exam, such duplicates are typically a distractor, but since both A and C are identical, the correct choice is the one marked as correct (A). Option D (Amazon SageMaker Studio) is wrong because SageMaker Studio is an integrated development environment (IDE) for building, training, and deploying machine learning models, but it does not provide a single unified API for multiple foundation models; it requires you to manage models and endpoints yourself.

Full explanation →

603

MCQmedium

Refer to the exhibit. The training job is failing with an error 'CUDA out of memory'. Which hyperparameter change is MOST likely to resolve the issue?

A.Increase the number of epochs to 10

B.Increase learning_rate to 5e-4

C.Reduce per_device_train_batch_size to 4

D.Increase max_seq_length to 1024

AnswerC

Smaller batch size uses less GPU memory.

Why this answer

The 'CUDA out of memory' error indicates that the GPU memory is exhausted during training. Reducing `per_device_train_batch_size` directly decreases the number of samples processed simultaneously per GPU, which lowers memory consumption for activations, gradients, and optimizer states. This is the most direct and effective hyperparameter change to resolve an out-of-memory condition.

Exam trap

AWS often tests the misconception that increasing epochs or learning rate can fix resource exhaustion errors, when in fact only adjustments that reduce per-step memory usage (like batch size or sequence length) are effective.

How to eliminate wrong answers

Option A is wrong because increasing the number of epochs does not affect per-step memory usage; it only increases the total number of training iterations, which would not resolve an immediate memory allocation failure. Option B is wrong because increasing the learning rate changes the step size for gradient updates but has no impact on GPU memory consumption during forward/backward passes. Option D is wrong because increasing `max_seq_length` increases the sequence length of input tokens, which enlarges the memory footprint for attention matrices and hidden states, making the out-of-memory error worse.

Full explanation →

604

Multi-Selecteasy

A retail company is deploying a machine learning model to analyze customer reviews and predict sentiment. The team wants to follow responsible AI guidelines to ensure fairness, transparency, and accountability. Which TWO actions should the team take? (Choose TWO.)

Select 2 answers

A.Use SageMaker Debugger to optimize training performance.

B.Use SageMaker Clarify to evaluate bias in the training data.

C.Use SageMaker Model Monitor to automatically retrain the model when drift is detected.

D.Use Amazon Rekognition to detect personally identifiable information (PII) in the review text.

E.Use SageMaker Model Cards to document the model's intended use, limitations, and evaluation results.

AnswersB, E

This is a core fairness practice to detect and mitigate bias.

Why this answer

Option A: SageMaker Clarify detects bias in training data, which is a core fairness practice. Option C: SageMaker Model Cards document the model's intended use, limitations, and evaluation results, promoting transparency and accountability. Option B: Model Monitor tracks data drift, not directly a responsible AI practice.

Option D: Rekognition is for image moderation, not relevant for text sentiment. Option E: Debugger optimizes training, not responsible AI.

Full explanation →

605

MCQmedium

A company wants to build a customer service chatbot that answers questions about their internal policy documents. The documents are updated monthly, and the team cannot afford to retrain a model each time. Which approach is MOST appropriate?

A.Fine-tune a base LLM on the policy documents monthly

B.Use Retrieval-Augmented Generation (RAG) with the policy documents indexed in a vector store

C.Use a larger foundation model with a longer context window and paste all documents into each prompt

D.Train a custom model from scratch on the policy documents each month

AnswerB

RAG retrieves relevant document chunks at query time, ensuring the chatbot always answers from the latest uploaded documents without any model retraining.

Why this answer

RAG (Retrieval-Augmented Generation) allows the LLM to retrieve relevant document sections at inference time, so knowledge stays current without retraining. The other options either require expensive retraining for each update or lack document grounding.

Full explanation →

606

MCQhard

A financial services company is using Amazon Bedrock to generate investment summaries. They want to ensure that the model outputs are factually accurate and based on the latest market data. Which combination of services should they use to achieve this? (Select TWO)

A.Amazon SageMaker Ground Truth for data labeling

B.Amazon DynamoDB as the knowledge base store

C.Amazon Kendra for indexing the knowledge base

D.Amazon Aurora with the pgvector extension

E.Amazon Bedrock Knowledge Bases with RAG

AnswerD, E

Aurora with pgvector can store and query embeddings for RAG.

Why this answer

Amazon Aurora with the pgvector extension (Option D) enables storing and querying vector embeddings directly within a PostgreSQL-compatible database, which is essential for Retrieval-Augmented Generation (RAG). When combined with Amazon Bedrock Knowledge Bases (Option E), it allows the company to retrieve the most current market data as vector embeddings, ensuring the generated investment summaries are grounded in factual, up-to-date information rather than relying solely on the model's static training data.

Exam trap

The trap here is that candidates often confuse a general-purpose search service like Amazon Kendra with a vector database purpose-built for RAG, overlooking that Bedrock Knowledge Bases requires a vector store (e.g., Aurora with pgvector or Amazon OpenSearch Serverless) to perform semantic similarity retrieval, not just keyword-based indexing.

How to eliminate wrong answers

Option A is wrong because Amazon SageMaker Ground Truth is a data labeling service for creating training datasets, not for storing or retrieving knowledge bases for RAG; it does not provide real-time market data retrieval. Option B is wrong because Amazon DynamoDB is a NoSQL key-value and document database that lacks native vector search capabilities (e.g., pgvector or OpenSearch vector engine), making it unsuitable for efficient similarity search required in RAG workflows. Option C is wrong because Amazon Kendra is an intelligent search service that can index documents, but it is not a vector database optimized for storing and querying embeddings; it also does not integrate directly with Bedrock Knowledge Bases as a vector store.

Full explanation →

607

MCQhard

A team is fine-tuning a Meta Llama 2 model on Amazon Bedrock for a legal document classification task. After fine-tuning, the model performs well on the training set but poorly on the validation set. Which adjustment is MOST likely to reduce overfitting?

A.Increase the size of the training dataset and apply dropout

B.Increase the learning rate

C.Add more layers to the model

D.Reduce the number of training epochs

AnswerA

More data helps generalization; dropout randomly drops units during training, reducing overfitting.

Why this answer

Increasing the training data size and using regularization techniques like dropout or weight decay are standard ways to combat overfitting. Decreasing learning rate or increasing epochs can worsen overfitting; more layers usually increase overfitting.

Full explanation →

608

MCQmedium

A data science team is using Amazon SageMaker Studio. To meet compliance requirements, they need to ensure that all user activity in the environment is logged and that any unauthorized access attempts are detected. Which approach should they take?

A.Enable SageMaker Model Monitor and configure Amazon S3 server access logs.

B.Enable AWS CloudTrail and Amazon GuardDuty for threat detection.

C.Use AWS Config rules to track changes and Amazon Inspector for vulnerability scanning.

D.Enable SageMaker Studio with VPC only mode and use AWS CloudTrail.

AnswerB

CloudTrail logs API calls and GuardDuty detects unauthorized access patterns.

Why this answer

AWS CloudTrail logs all API activity in SageMaker Studio, including user actions and access attempts, while Amazon GuardDuty provides intelligent threat detection by analyzing CloudTrail logs, VPC flow logs, and DNS logs for unauthorized access patterns. Together, they meet compliance requirements for logging and detecting unauthorized access without additional configuration overhead.

Exam trap

The trap here is that candidates often confuse logging (CloudTrail) with threat detection (GuardDuty) and assume that enabling CloudTrail alone satisfies both requirements, but GuardDuty is specifically needed to analyze logs for unauthorized access attempts.

How to eliminate wrong answers

Option A is wrong because SageMaker Model Monitor is designed for detecting data drift and model quality issues, not for logging user activity or detecting unauthorized access; Amazon S3 server access logs only capture requests to S3 buckets, not SageMaker Studio user actions. Option C is wrong because AWS Config rules track resource configuration changes and compliance, not user activity logging, and Amazon Inspector focuses on vulnerability scanning of EC2 instances and container images, not threat detection for user access. Option D is wrong because VPC only mode restricts network access but does not provide logging of user activity or threat detection; AWS CloudTrail alone logs API calls but lacks the intelligent threat detection capability that GuardDuty provides for identifying unauthorized access attempts.

Full explanation →

609

Multi-Selectmedium

Which TWO actions can help reduce bias in a foundation model’s outputs? (Choose two.)

Select 2 answers

A.Fine-tune the model on a balanced, representative dataset

B.Use careful prompt engineering with neutral wording

C.Restrict model access to a subset of users

D.Increase temperature to add randomness

E.Use a larger foundation model

AnswersA, B

Fine-tuning with balanced data can correct biases.

Why this answer

Options B and D are correct. Prompt engineering with neutral phrasing can reduce biased responses. Fine-tuning with a balanced dataset can mitigate biases.

Option A (increase temperature) increases randomness, not reduce bias. Option C (larger model) may amplify biases. Option E (limit users) does not address bias.

Full explanation →

610

Multi-Selecthard

A company is using Amazon Fraud Detector to detect fraudulent transactions. Which TWO actions can be taken to improve model accuracy? (Select TWO.)

Select 2 answers

A.Increase the volume of event data

B.Deploy the model to multiple endpoints

C.Use a different detector type

D.Use a different model version

E.Select event variables that are more predictive

AnswersA, E

More data can help the model learn better patterns.

Why this answer

Increasing the volume of event data provides Amazon Fraud Detector with more examples of both fraudulent and legitimate transactions, which allows the model to learn more robust patterns and reduce overfitting. More data helps the model generalize better to unseen events, directly improving prediction accuracy.

Exam trap

The AIF-C01 exam often tests the misconception that changing model versions or detector types alone improves accuracy, when in reality accuracy improvements require data or feature enhancements.

Full explanation →

611

Multi-Selectmedium

Which TWO actions are recommended for improving the factual accuracy of a foundation model's responses when using RAG?

Select 2 answers

A.Include relevant context from the knowledge base in the prompt

B.Increase the max_tokens parameter

C.Provide clear instructions in the system prompt

D.Use the largest foundation model available

E.Increase the temperature parameter

AnswersA, C

RAG relies on accurate context to ground responses.

Why this answer

Including relevant context from the knowledge base and providing clear system instructions improve accuracy. Other options do not directly help.

Full explanation →

612

MCQhard

A financial services company is building a predictive model using Amazon SageMaker. The model training data contains personally identifiable information (PII). The company must ensure that the data is encrypted at rest and in transit, and that access to the data is logged. Which combination of AWS services meets these requirements?

A.Use S3 server-side encryption with S3-managed keys (SSE-S3) and enable CloudTrail trail for S3 data events

B.Use S3 server-side encryption with AWS KMS (SSE-KMS), enable SageMaker inter-container traffic encryption, and enable CloudTrail data events for the S3 bucket

C.Use S3 client-side encryption and configure SageMaker to use HTTPS for inter-container traffic

D.Enable S3 default encryption with AES-256 and use AWS CloudTrail for S3 data events

AnswerB

SSE-KMS provides encryption at rest with key control, inter-container traffic encryption provides transit encryption, and CloudTrail data events log access to objects.

Why this answer

Option B is correct because it addresses all three requirements: SSE-KMS encrypts data at rest in S3, SageMaker inter-container traffic encryption ensures data in transit between training containers is encrypted, and CloudTrail data events for the S3 bucket log all access to the PII data. This combination provides a complete, auditable encryption and logging solution.

Exam trap

The trap here is that candidates often assume S3 server-side encryption alone (SSE-S3 or SSE-KMS) satisfies all encryption requirements, forgetting that SageMaker inter-container traffic encryption is needed for data in transit during distributed training.

How to eliminate wrong answers

Option A is wrong because SSE-S3 does not provide customer-controlled key management or audit trails for key usage, and CloudTrail data events alone do not cover encryption of data in transit within SageMaker. Option C is wrong because client-side encryption places the burden on the client and does not guarantee encryption at rest in S3 unless explicitly configured, and HTTPS for inter-container traffic is not the same as SageMaker's built-in inter-container encryption which uses TLS. Option D is wrong because S3 default encryption with AES-256 is equivalent to SSE-S3, which lacks customer-managed key audit capabilities, and CloudTrail data events do not address encryption in transit for SageMaker training jobs.

Full explanation →

613

MCQmedium

A company wants to build a customer support chatbot that answers questions based on a large internal knowledge base. Which AWS service is most suitable for implementing RAG to retrieve relevant documents?

A.Amazon Lex

B.Amazon Polly

C.Amazon Connect

D.Amazon Kendra

AnswerD

Kendra provides intelligent search and retrieval from indexed documents, ideal for RAG workflows.

Why this answer

Amazon Kendra is a highly accurate enterprise search service that can retrieve relevant documents from various sources, which can then be provided to a foundation model for generation. Lex, Connect, and Polly are not designed for document retrieval.

Full explanation →

614

Multi-Selectmedium

A data scientist is using a foundation model to summarize long documents. Which TWO of the following steps are most likely to improve the quality of the summaries?

Select 2 answers

A.Break the input document into chunks and summarize each chunk separately.

B.Use a high temperature parameter to increase creativity.

C.Provide few-shot examples of desired summaries in the prompt.

D.Use a low frequency penalty to reduce repetition.

E.Use a longer context length by increasing the max tokens parameter.

AnswersA, C

Chunking allows handling of long documents that exceed context length.

Why this answer

Option A is correct because foundation models have a fixed maximum context window (e.g., 4,096 tokens for GPT-3.5). By breaking a long document into smaller chunks and summarizing each independently, you avoid truncation and ensure the model can process the entire content without losing information. This chunking strategy is a standard preprocessing technique for handling documents that exceed the model's context length.

Exam trap

AWS often tests the misconception that increasing max tokens extends the model's input capacity, when in reality it only controls the output length, while the input is constrained by the model's inherent context window.

Full explanation →

615

MCQeasy

What is the primary purpose of an embedding model in the context of semantic search?

Answer options not yet available.

Why this answer

Embedding models convert text into dense vector representations that capture semantic meaning, enabling similarity search in vector databases.

Full explanation →

616

MCQeasy

Which metric is commonly used to evaluate the quality of a text summarization model by comparing the generated summary with a reference summary, measuring the overlap of n-grams?

A.Perplexity

B.BLEU

C.ROUGE

D.BERTScore

AnswerC

ROUGE (especially ROUGE-L and ROUGE-N) is the standard metric for summarization, measuring n-gram recall.

Why this answer

ROUGE (Recall-Oriented Understudy for Gisting Evaluation) measures n-gram overlap between generated and reference summaries. BLEU is for translation, BERTScore uses embeddings, and perplexity measures language model fluency.

Full explanation →

617

MCQmedium

A team deployed a text generation model on Amazon Bedrock. They want to monitor for toxic content in model outputs. Which evaluation approach is MOST effective?

A.Enable CloudWatch Logs and set a metric filter for toxic words

B.Use Amazon SageMaker Ground Truth for human annotation

C.Manually review a sample of outputs each week

D.Use Amazon Bedrock Model Evaluation with toxicity metrics

AnswerD

Bedrock Model Evaluation provides automated toxicity assessment.

Why this answer

Amazon Bedrock Model Evaluation with toxicity metrics is the most effective approach because it provides automated, built-in evaluation of model outputs for toxic content using predefined metrics, directly integrated with the Bedrock service. This eliminates the need for manual effort or custom filtering, ensuring consistent and scalable monitoring of harmful content.

Exam trap

The trap here is that candidates may choose CloudWatch metric filters (Option A) because they associate monitoring with logs, but fail to recognize that toxicity detection requires semantic understanding beyond simple keyword matching.

How to eliminate wrong answers

Option A is wrong because CloudWatch Logs with a metric filter for toxic words is a simplistic, keyword-based approach that cannot detect nuanced or context-dependent toxicity, such as sarcasm or implicit hate speech, and requires manual setup of word lists. Option B is wrong because Amazon SageMaker Ground Truth for human annotation is designed for creating labeled datasets, not for real-time or automated monitoring of model outputs, and introduces latency and cost overhead. Option C is wrong because manually reviewing a sample of outputs each week is not scalable, introduces human bias, and fails to provide continuous or real-time monitoring, making it ineffective for production systems.

Full explanation →

618

MCQmedium

A company is developing an AI system that generates news articles. To comply with transparency regulations, they must clearly indicate when content is AI-generated. Which action should they take?

A.Register the model with a government agency

B.Remove any identifiable information about the model from the output

C.Use a watermark that is invisible to users

D.Include a disclaimer in the article metadata and a visible label stating 'This content was generated by AI'

AnswerD

This directly addresses transparency by informing users both in the metadata (for programmatic access) and visibly on the content itself.

Why this answer

Transparency requirements often mandate disclosure that content is AI-generated, which can be done through labels, watermarks, or metadata. Providing a clear disclosure to users is the most direct and compliant approach.

Full explanation →

619

MCQeasy

A company wants to automatically detect anomalies in their AWS CloudTrail logs to identify potential security threats. Which AWS service is specifically designed for this purpose?

A.Amazon Macie

B.AWS Config

C.Amazon GuardDuty

D.Amazon Inspector

AnswerC

GuardDuty uses ML to detect anomalies in CloudTrail logs and other sources.

Why this answer

Amazon GuardDuty is a threat detection service that continuously monitors AWS accounts and workloads using machine learning, anomaly detection, and integrated threat intelligence. It specifically analyzes CloudTrail management and data events, VPC Flow Logs, and DNS logs to identify unauthorized behavior or potential security threats, making it the correct choice for automatically detecting anomalies in CloudTrail logs.

Exam trap

The AIF-C01 exam often tests the distinction between services that detect threats (GuardDuty) versus services that protect data (Macie), assess vulnerabilities (Inspector), or track configuration compliance (Config), leading candidates to confuse their primary use cases.

How to eliminate wrong answers

Option A is wrong because Amazon Macie is a data security and data privacy service that uses machine learning to discover, classify, and protect sensitive data stored in Amazon S3, not to analyze CloudTrail logs for security threats. Option B is wrong because AWS Config is a service that evaluates and records resource configurations and compliance against desired policies, not designed for real-time anomaly detection in log data. Option D is wrong because Amazon Inspector is a vulnerability management service that scans EC2 instances and container images for software vulnerabilities and unintended network exposure, not for analyzing CloudTrail logs.

Full explanation →

620

MCQmedium

A team has built a Retrieval-Augmented Generation (RAG) pipeline using Amazon Bedrock Knowledge Bases. After deployment, they find that the model's answers often contain hallucinated details not present in the retrieved documents. What should they enable in Bedrock Guardrails to reduce this?

A.Topic denial policies

B.Grounding check

C.PII detection

D.Content filtering with increased severity

AnswerB

A grounding check compares the model's response against the retrieved source documents and can block or flag statements that lack support.

Why this answer

Grounding check ensures the model's response is grounded in the retrieved documents, penalizing or filtering content that is not supported. Content filtering obscenity, topic denial, and PII detection do not address hallucination of facts.

Full explanation →

621

Multi-Selecteasy

Which TWO of the following are types of feature scaling?

Select 2 answers

A.One-hot encoding

B.Principal Component Analysis (PCA)

C.Standardization

D.Binning

E.Normalization (Min-Max)

AnswersC, E

Standardization (Z-score) is a common feature scaling method.

Why this answer

Standardization (Z-score scaling) transforms features to have a mean of 0 and a standard deviation of 1, making it a valid type of feature scaling. It is essential when using algorithms that assume normally distributed data, such as linear regression, SVM, or PCA, and it does not bound the data to a fixed range.

Exam trap

AWS often tests the distinction between feature scaling (changing the numeric range of features) and data transformation techniques like encoding or dimensionality reduction, leading candidates to confuse one-hot encoding or PCA with scaling methods.

Full explanation →

622

MCQeasy

A company is using Amazon Comprehend to analyze customer feedback. They need to ensure that the documents are encrypted at rest. What should they do?

A.No action is needed; Amazon Comprehend automatically encrypts data at rest using AES-256

B.Enable encryption using AWS KMS in the Comprehend console

C.Store documents in an encrypted S3 bucket and use a VPC endpoint

D.Use SSL/TLS for all API calls to Comprehend

AnswerA

Comprehend encrypts all data at rest by default.

Why this answer

Amazon Comprehend automatically encrypts all data at rest using AES-256 encryption by default, with no additional configuration required. This encryption covers both the documents processed by the service and any models or artifacts stored internally. Therefore, no action is needed from the customer to enable encryption at rest.

Exam trap

The trap here is that candidates often assume they need to manually enable encryption or use KMS, but Amazon Comprehend enforces encryption at rest automatically with no user action required, making 'No action needed' the correct answer.

How to eliminate wrong answers

Option B is wrong because Amazon Comprehend does not expose a console option to enable or disable encryption via AWS KMS; encryption is always-on and managed by the service. Option C is wrong because while storing documents in an encrypted S3 bucket is a best practice for data in transit to Comprehend, it does not affect how Comprehend encrypts data at rest within its own storage; the service already encrypts at rest regardless of the source bucket's encryption. Option D is wrong because SSL/TLS protects data in transit, not data at rest, and is already enforced by Comprehend for API calls.

Full explanation →

623

MCQeasy

A developer is using Amazon Bedrock to generate code snippets. The model often produces insecure code. Which prompt engineering technique is MOST effective to improve security?

A.Use chain-of-thought prompting to step through the code

B.Provide few-shot examples of secure code

C.Set max_tokens to a low value to limit output

D.Include specific instructions to avoid common security vulnerabilities

AnswerD

Direct instructions in the prompt can effectively guide the model.

Why this answer

Option D is correct because directly instructing the model to avoid specific security vulnerabilities (e.g., SQL injection, buffer overflows) is the most explicit and effective way to constrain the output. Amazon Bedrock models respond well to clear, imperative instructions in the system prompt or user message, making this a direct application of prompt engineering for safety. Chain-of-thought or few-shot examples may improve reasoning or style but do not guarantee the model will avoid insecure patterns unless explicitly told to do so.

Exam trap

The trap here is that candidates often overestimate the effectiveness of few-shot examples or reasoning techniques for security, assuming they implicitly teach safety, when in fact explicit instructions are required to override the model's default training biases toward common (but insecure) coding patterns.

How to eliminate wrong answers

Option A is wrong because chain-of-thought prompting improves reasoning steps but does not inherently enforce security constraints; it may still produce insecure code if the model's reasoning path includes unsafe patterns. Option B is wrong because few-shot examples of secure code can guide style but do not prevent the model from generating insecure code when the prompt does not explicitly forbid it; the model may still default to common insecure patterns from its training data. Option C is wrong because setting max_tokens to a low value limits output length but does not affect the security of the generated code; it may truncate a secure solution or force incomplete code, not improve safety.

Full explanation →

624

MCQhard

A team is using Amazon SageMaker to train a deep learning model. The training job is taking too long. Which action is MOST likely to reduce training time while maintaining model quality?

A.Use incremental training

B.Enable managed spot training

C.Switch to a smaller instance type

D.Increase the number of training instances for distributed training

AnswerD

Distributed training across more instances can dramatically reduce training time if the model is parallelizable.

Why this answer

Using SageMaker's managed spot training can reduce cost but not necessarily time. Incremental training is not a standard SageMaker feature. The most direct way to reduce time is to increase the number of instances for distributed training, provided the model supports data parallelism.

Full explanation →

625

MCQhard

A team is building a real-time anomaly detection system for IoT sensor data. The data is unlabeled, and the team expects the anomalies to be rare but of high importance. Which combination of approach and AWS service should the team use?

A.Use Amazon Rekognition Custom Labels to detect anomalies in sensor images

B.Train a supervised classifier using Amazon SageMaker; use Amazon Kinesis Data Analytics for real-time inference

C.Apply k-means clustering with Amazon SageMaker; deploy the model as a real-time endpoint

D.Use a semi-supervised one-class classifier with Amazon Lookout for Equipment

AnswerD

Amazon Lookout for Equipment is designed for sensor data and can train a model on normal operating data (semi-supervised) to detect anomalies in real time.

Why this answer

For unlabeled data with rare anomalies, a semi-supervised approach using a one-class classifier is appropriate because it learns the 'normal' pattern. Amazon Lookout for Equipment is designed for sensor data anomaly detection and can be used in a semi-supervised manner. The other options require labels or are not suited for real-time IoT data.

Full explanation →

626

MCQeasy

A retail company is deploying a chatbot to handle customer inquiries. During testing, they notice the chatbot occasionally uses offensive language when responding to certain user inputs. Which responsible AI principle is being violated?

A.Privacy

B.Transparency

C.Fairness

D.Accountability

AnswerC

Fairness ensures AI systems treat all users equitably; offensive language is a fairness issue.

Why this answer

The correct answer is fairness because the chatbot's responses are biased and offensive, indicating a lack of fairness in treating all users appropriately. Accountability relates to ownership, privacy to data protection, and transparency to explainability.

Full explanation →

627

MCQeasy

A team trained a deep learning model that achieves 99% accuracy on training data but only 70% on validation data. What is the most likely issue?

A.Underfitting

B.Overfitting

C.Data leakage

D.Feature scaling

AnswerB

Overfitting occurs when the model learns training data too well, including noise, failing to generalize to validation data.

Why this answer

The model performs exceptionally well on training data (99% accuracy) but significantly worse on validation data (70% accuracy). This large gap indicates the model has memorized the training data, including noise and irrelevant patterns, rather than learning generalizable features — a classic symptom of overfitting.

Exam trap

The AIF-C01 exam often tests the distinction between overfitting and underfitting by presenting a scenario where training accuracy is high but validation accuracy is low, tempting candidates to incorrectly choose underfitting if they focus only on the low validation score.

How to eliminate wrong answers

Option A is wrong because underfitting would show poor performance on both training and validation data, not high training accuracy with low validation accuracy. Option C is wrong because data leakage typically causes both training and validation accuracy to be artificially high, not a large gap between them. Option D is wrong because feature scaling issues would generally affect model convergence or performance uniformly across datasets, not create a specific training-validation accuracy disparity.

Full explanation →

628

Multi-Selectmedium

A healthcare company is building a medical diagnosis assistant using Amazon Bedrock. They need to ensure the model’s responses are based on the latest medical research and do not include outdated information. The company also wants to minimize costs. Which TWO actions should they take? (Select TWO)

Select 2 answers

A.Use the Converse API to maintain conversation history

B.Use a smaller model like Amazon Titan Text Lite to reduce inference costs

C.Implement RAG by indexing the latest medical journals in a vector store

D.Fine-tune a large model on the latest medical data monthly

E.Choose a model with the largest context window to include all research in the prompt

AnswersB, C

A smaller model is cheaper per invocation and, when combined with RAG, can still provide accurate answers.

Why this answer

RAG allows the model to retrieve up-to-date information from a curated database without retraining. Using a smaller model like Titan Text Lite reduces cost. Fine-tuning would be expensive and still require updates.

The Converse API does not ensure freshness. A large model with long context is costly.

Full explanation →

629

MCQmedium

A machine learning practitioner is training a model to forecast product demand and observes that the model performs well on training data but poorly on unseen data. Which of the following is the MOST likely cause?

A.High bias-variance tradeoff favoring bias

B.Underfitting due to model being too simple

C.Data leakage causing artificially high training scores

D.Overfitting due to excessive model complexity

AnswerD

Overfitting leads to high training accuracy but low test accuracy because the model captures noise.

Why this answer

The model is overfitting: it has memorized the training data but fails to generalize. Overfitting occurs when a model is too complex relative to the amount of training data. The other options describe underfitting (not learning from training data) or data leakage (causing overly optimistic training but not necessarily poor unseen performance).

Full explanation →

630

Multi-Selectmedium

Which THREE of the following are capabilities of Amazon SageMaker? (Select THREE.)

Select 3 answers

A.Real-time inference endpoints

B.Automatic model tuning (hyperparameter optimization)

C.On-premises training only

D.Built-in algorithms for common tasks

E.Can only deploy models to EC2 instances

AnswersA, B, D

SageMaker offers real-time inference with managed endpoints.

Why this answer

Amazon SageMaker provides real-time inference endpoints that allow you to deploy trained models to a fully managed HTTPS endpoint for low-latency predictions. These endpoints automatically scale based on traffic and support A/B testing, making them suitable for production workloads.

Exam trap

The AIF-C01 exam often tests the misconception that SageMaker is limited to cloud-only or specific deployment targets, but the service actually offers flexible deployment options including on-premises and edge devices.

Full explanation →

631

Multi-Selecthard

A company uses Amazon SageMaker to build and deploy models. They want to enforce compliance that all model endpoints are encrypted in transit and use least privilege access. Which THREE steps should they take? (Choose THREE.)

Select 3 answers

A.Configure the SageMaker endpoint to use a custom SSL certificate via AWS Certificate Manager

B.Use an interface VPC endpoint (AWS PrivateLink) for SageMaker

C.Attach an IAM policy to the execution role that only allows specific actions on the endpoint

D.Enable AWS CloudTrail to log all endpoint invocations

E.Disable root access on the SageMaker notebook instances

AnswersA, B, C

This ensures HTTPS for encryption in transit.

Why this answer

Option A is correct because configuring a SageMaker endpoint to use a custom SSL certificate from AWS Certificate Manager (ACM) ensures that all data transmitted between clients and the endpoint is encrypted in transit using TLS. This enforces the compliance requirement for encryption in transit by replacing the default SageMaker certificate with a customer-managed certificate, which can be validated and rotated as needed.

Exam trap

The trap here is that candidates often confuse logging (CloudTrail) with enforcement of encryption or access control, or they mistakenly think disabling root access on notebooks affects endpoint security, when in fact it only secures the development environment.

Full explanation →

632

MCQmedium

A company deploys a large language model to automatically generate product descriptions. They want to ensure customers are aware that the content is AI-generated, as part of transparency requirements. What should they implement?

A.Include a disclosure statement such as 'This content was generated by AI' in the output

B.Use Amazon Rekognition to add a visible watermark to images only

C.Store metadata in the content database indicating AI generation

D.Embed an invisible watermark in the generated text

AnswerA

A clear disclosure directly informs customers that the content is AI-generated, meeting transparency goals.

Why this answer

Transparency requirements often mandate disclosing AI-generated content. The simplest approach is to include a clear disclosure in the output. Bedrock Guardrails can be configured to prepend or append a disclosure message.

The other options either hide the AI origin or only apply to specific types of content like images.

Full explanation →

633

MCQeasy

A developer is building a customer-facing chatbot using Amazon Bedrock. To ensure the chatbot does not generate offensive or inappropriate content, which AWS feature should they implement?

A.AWS Identity and Access Management (IAM) policies

B.Amazon Bedrock Guardrails

C.Prompt engineering with system prompts

D.Increasing the model temperature parameter

AnswerB

Guardrails enable content filtering, topic control, and safety mechanisms for Bedrock models.

Why this answer

Amazon Bedrock Guardrails provide content filtering, allowing you to define policies to block harmful or inappropriate content. Prompt templates and temperature affect output style but not safety. IAM controls access but not content.

Full explanation →

634

MCQhard

During a security review, it is found that an Amazon SageMaker notebook instance has outbound internet access, which could lead to data exfiltration. The notebook must only access resources within the VPC. Which step should be taken to restrict internet access?

A.Modify the notebook instance's IAM role to deny s3:GetObject

B.Attach a security group that denies all outbound traffic to 0.0.0.0/0

C.Configure the notebook instance in a VPC with no internet gateway or NAT device, and set the notebook's 'Direct Internet Access' option to 'Disabled'

D.Disable the SageMaker notebook instance's root volume encryption

AnswerC

This ensures the notebook can only reach resources within the VPC.

Why this answer

Option C is correct because disabling 'Direct Internet Access' on a SageMaker notebook instance and placing it in a VPC without an internet gateway or NAT device ensures the notebook cannot reach the public internet. This configuration forces all traffic to stay within the VPC, preventing data exfiltration via outbound internet connections while still allowing access to VPC resources.

Exam trap

The trap here is that candidates may confuse network-level controls (security groups, VPC routing) with IAM permissions, thinking that denying S3 access prevents all exfiltration, or they may incorrectly assume that disabling encryption or blocking all outbound traffic is the correct approach.

How to eliminate wrong answers

Option A is wrong because modifying the IAM role to deny s3:GetObject only restricts access to S3 objects, not outbound internet traffic; data exfiltration could still occur via other protocols (e.g., HTTP, DNS tunneling). Option B is wrong because attaching a security group that denies all outbound traffic to 0.0.0.0/0 would block all outbound traffic, including legitimate VPC resources (e.g., other services within the same VPC), which is overly restrictive and not the intended solution. Option D is wrong because disabling root volume encryption does not affect internet access; it only removes encryption at rest, which is a security risk but unrelated to network egress control.

Full explanation →

635

MCQeasy

A developer is building an application using Amazon Bedrock and needs to ensure that the model's responses do not include any toxic or harmful language. Which Bedrock feature should they configure?

A.Bedrock Knowledge Bases

B.Bedrock Playground

C.Bedrock Agents

D.Bedrock Guardrails

AnswerD

Guardrails allow configuring content filters, denied topics, and PII detection to control model output safety.

Why this answer

Bedrock Guardrails provide content filters, topic denial, PII detection, and other safety controls. Knowledge Bases, Agents, and Playground do not directly enforce content safety rules.

Full explanation →

636

MCQmedium

A data scientist needs to select an Amazon Bedrock model for a real-time chat application that requires low latency and high throughput. The responses must be generated as the user types. Which model invocation approach is MOST suitable?

A.Use a smaller context window to limit memory usage

B.Use the InvokeModel API with a synchronous request and wait for the full response

C.Use the InvokeModelWithResponseStream API to stream the response tokens as they are generated

D.Use the Converse API with a high temperature setting

AnswerC

Streaming delivers tokens progressively, allowing the chat interface to show partial responses and improving user experience.

Why this answer

Streaming responses allow the model to send tokens as they are generated, enabling the application to display incremental output to the user, which reduces perceived latency.

Full explanation →

637

MCQeasy

What is the main purpose of a system prompt in a large language model?

A.To increase the temperature for more creative responses

B.To provide an example of the desired output format

C.To list all possible tokens to be used in the response

D.To define the high-level instructions that set the model's behavior and persona

AnswerD

System prompts are used to establish context, style, and rules for the conversation.

Why this answer

A system prompt sets the overall behavior, persona, and constraints for the model's responses, guiding how it should interact.

Full explanation →

638

MCQeasy

Which Amazon Titan model is specifically designed to convert text into numerical vectors for use in semantic search and Retrieval-Augmented Generation (RAG)?

A.Amazon Titan Text Express

B.Amazon Titan Multimodal Embeddings

C.Amazon Titan Image Generator

D.Amazon Titan Embeddings

AnswerD

Titan Embeddings generates vector embeddings for text inputs.

Why this answer

Amazon Titan Embeddings is a text embedding model that produces vector representations. Titan Text models generate text; Titan Image Generator creates images; Titan Multimodal Embeddings handles both text and images but the question asks for text-to-vectors specifically.

Full explanation →

639

MCQhard

A security team needs to detect anomalies in AWS CloudTrail logs to identify potential unauthorized access. They want to use machine learning without manually labeling data or training custom models. Which AWS service should they use?

A.Amazon Macie

B.Amazon SageMaker

C.Amazon GuardDuty

D.Amazon Detective

AnswerC

GuardDuty uses ML to detect anomalous behavior from CloudTrail logs automatically.

Why this answer

Amazon GuardDuty uses machine learning (including anomaly detection) to analyze CloudTrail logs, VPC flow logs, and DNS logs, and it works out-of-the-box without manual labeling. Macie is for sensitive data, Detective for investigation, and SageMaker would require custom model building.

Full explanation →

640

MCQmedium

A developer is building a chatbot using Amazon Bedrock and Claude. They notice that the model sometimes generates harmful or biased responses. Which AWS service can they use to implement guardrails?

A.AWS WAF

B.Amazon GuardDuty

C.AWS Shield

D.Amazon Bedrock Guardrails

AnswerD

Bedrock Guardrails allows you to define content filters and deny topics to moderate model responses.

Why this answer

Option C, Amazon Bedrock Guardrails, is the native service for adding content filters and safety controls to models in Bedrock. Option A (AWS WAF) is a web application firewall, not for model output. Option B (Amazon GuardDuty) is a threat detection service.

Option D (AWS Shield) protects against DDoS attacks.

Full explanation →

641

MCQeasy

A developer wants to invoke a foundation model in Amazon Bedrock with a large number of similar requests for cost savings. Which feature can help reduce inference cost?

A.Model caching

B.Fine-tuning

C.Prompt management

D.Batch inference

AnswerA

Correct. Caching avoids recomputing responses for identical prompts, saving cost.

Why this answer

Model caching in Amazon Bedrock stores recently generated responses for identical prompts and reuses them, reducing the number of inference calls and thus costs. Batch inference processes multiple requests together but still charges per request. Prompt management and fine-tuning do not directly reduce per-request costs.

Full explanation →

642

MCQmedium

A company uses Amazon Bedrock to power a code generation assistant. They notice that the generated code sometimes contains security vulnerabilities. Which approach would BEST address this issue without sacrificing code quality?

A.Switch to a smaller model to reduce the chance of generating complex vulnerable code

B.Increase the temperature parameter to make output more deterministic

C.Fine-tune the model on a curated dataset of secure code and use Bedrock Guardrails to block insecure patterns

D.Use a longer prompt that instructs the model to avoid vulnerabilities

AnswerC

Fine-tuning on secure code teaches the model best practices, while guardrails add a safety layer.

Why this answer

Guardrails can filter harmful content, but for code-specific vulnerabilities, a more targeted approach is to fine-tune on secure code examples or use a specialised model. However, among the options, applying guardrails for code safety is a practical step.

Full explanation →

643

MCQhard

A company is using Amazon SageMaker to train a large language model with hundreds of billions of parameters. The model does not fit into the memory of a single GPU. Which approach should they use to train the model efficiently?

A.Use a larger instance with more GPU memory, such as p4d.24xlarge

B.Use SageMaker's data parallelism strategy

C.Use SageMaker's model parallelism strategy with the SageMaker distributed training library

D.Reduce the model size by pruning layers until it fits into memory

AnswerC

Model parallelism splits the model across GPUs, enabling training of very large models.

Why this answer

Option C is correct because SageMaker's model parallelism strategy with the SageMaker distributed training library is specifically designed for training large models that do not fit into the memory of a single GPU. It partitions the model layers across multiple GPUs, enabling efficient training of models with hundreds of billions of parameters by overlapping computation and communication.

Exam trap

The AIF-C01 exam often tests the distinction between data parallelism and model parallelism, and the trap here is that candidates may confuse data parallelism (which splits data, not the model) as a solution for models that don't fit in memory, when in fact model parallelism is required for such cases.

How to eliminate wrong answers

Option A is wrong because even the largest GPU instances like p4d.24xlarge have limited GPU memory (40 GB per A100 GPU), which is insufficient for a model with hundreds of billions of parameters; scaling vertically is not feasible for such large models. Option B is wrong because SageMaker's data parallelism strategy replicates the entire model on each GPU and splits the data across GPUs, which requires the model to fit into a single GPU's memory; it does not solve the memory constraint issue. Option D is wrong because pruning layers to reduce model size would degrade model quality and is not a practical or efficient approach for training large language models; the goal is to train the full model, not a smaller version.

Full explanation →

644

Multi-Selectmedium

Which TWO actions help ensure fairness in an AI system deployed on AWS? (Select two.)

Select 2 answers

A.Train the model on a representative dataset

B.Enable AWS CloudTrail for audit

C.Use SageMaker Clarify to detect bias

D.Use a single validation set

E.Encrypt data at rest using AWS KMS

AnswersA, C

Representative data reduces bias.

Why this answer

Options A and B are correct: Diverse training data and bias detection with Clarify. Options C, D, E are not direct fairness actions.

Full explanation →

645

MCQmedium

A data scientist uses Amazon Bedrock. The model responses are too long. Which parameter should they adjust to limit the output length?

A.temperature

B.max_tokens

C.stop sequences

D.top_p

AnswerB

Reducing max_tokens directly caps the output length.

Why this answer

The `max_tokens` parameter directly controls the maximum number of tokens (words or subwords) the model can generate in a single response. By reducing this value, the data scientist caps the output length, preventing overly long responses. Temperature and top_p affect randomness and diversity, not length, while stop sequences define when generation halts but do not enforce a hard token limit.

Exam trap

AWS often tests the distinction between parameters that control output length (`max_tokens`) versus those that control output randomness or diversity (`temperature`, `top_p`), leading candidates to confuse 'limiting length' with 'limiting creativity'.

How to eliminate wrong answers

Option A is wrong because temperature controls the randomness of token selection (higher values increase creativity, lower values make output more deterministic), not the length of the response. Option C is wrong because stop sequences are custom strings (e.g., '###' or 'END') that tell the model to cease generation when encountered, but they do not limit the total number of tokens generated before that point. Option D is wrong because top_p (nucleus sampling) limits the cumulative probability of token choices to a threshold (e.g., 0.9), affecting diversity, not the maximum output length.

Full explanation →

646

MCQmedium

A financial services company is evaluating Amazon Bedrock for a compliance application that requires explainable AI decisions. The model's output must be auditable and traceable to specific reasoning. Which Bedrock feature should they use to meet this requirement?

A.Create a knowledge base with financial regulations to guide the model.

B.Fine-tune a custom model on regulatory documents to improve reasoning.

C.Enable model invocation logging in Amazon Bedrock and store logs in Amazon S3.

D.Amazon Bedrock Guardrails to filter sensitive content.

AnswerC

Logging captures full input/output pairs, enabling auditors to review and trace decisions.

Why this answer

Option C is correct because model invocation logging records all requests and responses, enabling traceability. Option A is wrong because guardrails filter content but don't provide reasoning. Option B is wrong because custom models are still black boxes.

Option D is wrong because knowledge bases are for retrieval, not reasoning traceability.

Full explanation →

647

MCQhard

A SageMaker endpoint is configured with automatic scaling. The model's inference time is 50ms, and traffic increases gradually. What scaling metric should be used to add instances before latency increases?

A.Memory utilization

B.Concurrent requests

C.CPU utilization

D.Invocations per instance

AnswerD

Invocations per instance directly measures the load per instance, allowing proactive scaling before latency rises.

Why this answer

D is correct because 'Invocations per instance' is a custom metric that directly measures the number of inference requests each instance is handling. By setting a target value for this metric, the scaling policy can proactively add instances when the per-instance request count approaches a threshold, preventing latency increases before they occur. This is the recommended approach for SageMaker endpoints with gradual traffic increases, as it anticipates demand rather than reacting to latency spikes.

Exam trap

The trap here is that candidates often choose 'Concurrent requests' (Option B) thinking it directly measures load, but AWS SageMaker does not expose that metric for scaling; instead, 'Invocations per instance' is the correct metric that normalizes load per instance and enables proactive scaling.

How to eliminate wrong answers

Option A is wrong because memory utilization is not a reliable indicator of inference latency; SageMaker endpoints typically have sufficient memory, and scaling based on memory would not prevent latency from increasing due to request queuing. Option B is wrong because 'Concurrent requests' is not a supported metric for SageMaker automatic scaling; the correct metric is 'Invocations per instance' which normalizes request load across the number of instances. Option C is wrong because CPU utilization can spike due to other processes and does not directly correlate with inference latency; scaling on CPU may add instances too late or unnecessarily, as inference is often I/O-bound rather than CPU-bound.

Full explanation →

648

Multi-Selecthard

A machine learning engineer is using Amazon SageMaker to deploy a real-time inference endpoint for a classification model. The model must provide low-latency predictions and handle variable traffic. Which THREE actions should the engineer take? (Select THREE.)

Select 3 answers

A.Enable data capture to log input and output payloads

B.Use a larger instance type to maximize throughput

C.Choose an instance type optimized for inference, such as Inf1

D.Deploy the model as a batch transform job instead of a real-time endpoint

E.Configure auto scaling to add or remove instances based on traffic

AnswersA, C, E

Data capture is important for monitoring, auditing, and debugging predictions.

Why this answer

Using a smaller instance type reduces latency, auto scaling handles variable traffic, and enabling data capture allows monitoring and debugging.

Full explanation →

649

MCQmedium

A company is deploying a real-time inference endpoint using Amazon SageMaker. The security team requires that all data sent to the endpoint be encrypted in transit and that the endpoint is only accessible from within the company's VPC. Which configuration should be used?

A.Deploy the endpoint in a private subnet with a VPC endpoint for SageMaker Runtime

B.Deploy the endpoint in a public subnet and use a security group to restrict source IPs

C.Enable network isolation on the endpoint

D.Use an AWS Lambda function to proxy requests and restrict access via IAM

AnswerA

This makes the endpoint accessible only via the VPC endpoint, ensuring no public internet access, and HTTPS is used automatically.

Why this answer

SageMaker endpoints can be deployed in a VPC and configured to use a private VPC endpoint (AWS PrivateLink) so that they are accessible only from within the VPC. HTTPS ensures encryption in transit; SageMaker endpoints automatically use HTTPS.

Full explanation →

650

MCQmedium

A company uses a foundation model for real-time translation in a chat application. The latency is high. Which optimization would reduce latency the most?

A.Increase batch size

B.Use model distillation to create a smaller model

C.Use a larger model

D.Use a CDN for model weights

AnswerB

Distillation reduces model size and inference latency.

Why this answer

Model distillation reduces the size of the foundation model by training a smaller 'student' model to mimic the behavior of a larger 'teacher' model. This directly decreases inference latency because the smaller model requires fewer computational resources (FLOPs) per forward pass, which is critical for real-time translation in a chat application where low latency is paramount.

Exam trap

The AIF-C01 exam often tests the distinction between throughput optimization (batch size) and latency optimization (model size/distillation), leading candidates to mistakenly choose increasing batch size when the question explicitly asks for reducing latency.

How to eliminate wrong answers

Option A is wrong because increasing batch size improves throughput (more requests processed per unit time) but does not reduce per-request latency; in fact, it can increase latency for individual requests as the model must wait for the batch to fill. Option C is wrong because using a larger model increases the number of parameters and computational complexity, which would increase latency, not reduce it. Option D is wrong because a CDN for model weights only accelerates the initial download of the model to edge locations, not the inference latency of each translation request; once the model is loaded, inference speed is determined by the model architecture and hardware, not network delivery.

Full explanation →

651

Multi-Selectmedium

A company wants to use Amazon SageMaker Ground Truth to build a labeled dataset for a custom object detection model. Which TWO labeling strategies are available? (Choose two.)

Select 2 answers

A.Private workforce labeling (company employees)

B.Crowd-based labeling using Amazon Mechanical Turk

C.Automated labeling using pre-trained models

D.Active learning with manual verification

E.Fully automated labeling via AWS Lambda

AnswersA, B

Private workforce uses the company's own employees for labeling.

Why this answer

Amazon SageMaker Ground Truth supports private workforce labeling where company employees (e.g., via a corporate directory or invited users) perform manual annotation. This is ideal for sensitive data or domain-specific tasks like custom object detection, where internal expertise ensures high label accuracy.

Exam trap

The AIF-C01 exam often tests the distinction between labeling strategies (workforce types) and labeling features (like automated labeling or active learning), causing candidates to confuse automated data labeling as a workforce option when it is actually a post-labeling automation feature.

Full explanation →

652

MCQeasy

A data scientist is preparing data for a machine learning model. What is the purpose of splitting the data into training, validation, and test sets?

A.To tune hyperparameters

B.To balance class distributions

C.To prevent overfitting during training

D.To evaluate model generalization

AnswerD

The test set provides an unbiased estimate of performance on new data.

Why this answer

The test set is used to evaluate the final model's generalization performance on unseen data. The validation set is for hyperparameter tuning during development. Training set is for fitting the model.

Full explanation →

653

MCQmedium

A financial services company uses Amazon Bedrock to power a customer-facing chatbot that provides investment advice. The company must ensure that the chatbot's responses comply with regulatory standards, meaning that the model should not generate advice that is speculative or promises returns. The company has implemented Bedrock Guardrails with content filters. However, during testing, the chatbot still generates responses that violate the guidelines. A review of the guardrail configuration shows that the content filters are set to the lowest sensitivity. The company wants to enforce stricter filtering without completely blocking legitimate responses. What should the company do?

A.Increase the sensitivity of the content filters in the Bedrock Guardrails configuration.

B.Use a different foundational model that has built-in compliance filters.

C.Configure the chatbot to route all responses to a human reviewer before delivering to the customer.

D.Add a deny topic for investment advice to completely block that topic.

AnswerA

Higher sensitivity blocks more content that violates guidelines, while still allowing compliant responses.

Why this answer

Option A is correct because increasing the sensitivity of the content filters in Bedrock Guardrails directly addresses the issue: the current filters are set to the lowest sensitivity, allowing speculative or promise-based responses to pass through. By raising the sensitivity, the guardrails will block more non-compliant content while still permitting legitimate investment advice, striking the required balance between regulatory compliance and functionality.

Exam trap

The trap here is that candidates may think adding a deny topic (Option D) is the simplest way to enforce compliance, but they overlook that it completely blocks all investment advice, which violates the requirement to allow legitimate responses; the exam tests understanding of granular guardrail tuning versus blunt blocking.

How to eliminate wrong answers

Option B is wrong because switching to a different foundational model does not guarantee built-in compliance filters that meet the specific regulatory standards; models themselves do not enforce content policies—guardrails do. Option C is wrong because routing all responses to a human reviewer introduces latency and scalability issues, and does not solve the underlying guardrail configuration problem; it is a workaround, not a fix. Option D is wrong because adding a deny topic for investment advice would completely block all investment-related queries, which is overly restrictive and prevents the chatbot from providing any legitimate advice, violating the requirement to avoid completely blocking legitimate responses.

Full explanation →

654

MCQhard

Refer to the exhibit. A developer receives an error when trying to invoke the Claude Instant model from an application. The application uses the IAM role 'MyAppRole'. Which IAM policy statement should be added to the role to resolve the error?

A.{"Effect":"Allow","Action":"bedrock:GetFoundationModel","Resource":"*"}

B.{"Effect":"Allow","Action":"bedrock:InvokeModel","Resource":"arn:aws:bedrock:us-east-1::foundation-model/*"}

C.{"Effect":"Allow","Action":"bedrock:InvokeModel","Resource":"arn:aws:bedrock:us-east-1::foundation-model/anthropic.claude-instant-v1"}

D.{"Effect":"Allow","Action":"bedrock:*","Resource":"*"}

AnswerC

This grants the minimal required permission for the specific model.

Why this answer

The error indicates missing permission to invoke the specific model. The correct action is 'bedrock:InvokeModel' on the specific model ARN.

Full explanation →

655

MCQmedium

A developer is using Bedrock Studio to prototype a summarization application. They want to quickly test different foundation models and prompts without writing code. What should they use?

A.Bedrock Knowledge Bases

B.Bedrock Agents

C.Bedrock Guardrails

D.Bedrock Playground

AnswerD

The Playground provides an interactive console to try different models and prompts without coding.

Why this answer

Bedrock Playground is a no-code interface within Bedrock Studio for experimenting with models and prompts. Knowledge Bases, Agents, and Guardrails are not designed for rapid prototyping of prompts and models.

Full explanation →

656

MCQhard

A company is building a text classification system using embeddings. They need to choose between Amazon Titan Text Embeddings and Cohere Embed. The documents are in multiple languages, and the team requires strong cross-lingual performance without additional training. Which model is optimized for multilingual use cases?

A.Amazon Titan Text Embeddings

B.Neither model supports multilingual text

C.Both are equally multilingual

D.Cohere Embed

AnswerD

Cohere Embed is designed for multilingual text and provides cross-lingual embeddings out of the box.

Why this answer

Cohere Embed supports multiple languages natively, while Titan Text Embeddings is primarily English-focused.

Full explanation →

657

MCQmedium

An AI team uses the IAM policy shown in the exhibit to control endpoint creation. Why does this policy support responsible AI?

A.It requires human approval before deploying any model

B.It prevents the use of GPU instances to reduce cost

C.It ensures data capture is enabled for model monitoring

D.It restricts endpoints to only use models built in SageMaker

AnswerC

Data capture allows bias detection and explainability.

Why this answer

Option C is correct because the IAM policy includes a condition that enforces the `DataCaptureConfig.EnableCapture` parameter to be set to `true` when creating a SageMaker endpoint. This ensures that model monitoring data is automatically collected, which is a key practice for responsible AI as it allows continuous monitoring of model performance, bias detection, and drift analysis. Without data capture, teams cannot audit or validate model behavior in production, undermining accountability and transparency.

Exam trap

The AIF-C01 exam often tests the misconception that IAM policies for responsible AI focus on restricting model sources or instance types, when in fact the key mechanism is enforcing observability through data capture for ongoing monitoring.

How to eliminate wrong answers

Option A is wrong because the IAM policy does not include any condition requiring human approval (e.g., using `sts:AssumeRole` with MFA or a separate approval workflow); it only enforces data capture settings. Option B is wrong because the policy does not restrict instance types (e.g., GPU instances like `ml.p3.2xlarge`); it focuses solely on data capture configuration. Option D is wrong because the policy does not restrict endpoints to models built in SageMaker; it allows any model to be deployed as long as data capture is enabled, and there is no condition referencing model origin.

Full explanation →

658

MCQhard

A financial services company must comply with regulatory requirements that mandate explainability of credit scoring models. They have deployed a model using SageMaker and need to generate reports showing feature importance for each prediction. Which combination of services should they use to automate this?

A.SageMaker Model Monitor + Amazon QuickSight

B.SageMaker Ground Truth + AWS Lambda

C.SageMaker Clarify + SageMaker Pipelines

D.SageMaker Data Wrangler + SageMaker Studio

AnswerC

Clarify computes SHAP values; Pipelines automates the workflow to generate and store reports.

Why this answer

SageMaker Clarify can compute SHAP values for per-prediction explanations. SageMaker Pipelines can orchestrate the generation of reports on a schedule. Data Wrangler is for data preparation, Model Monitor for drift, Ground Truth for labeling – none generate per-prediction explanation reports automatically.

Full explanation →

659

MCQeasy

Refer to the exhibit. A data scientist is training a model in SageMaker using a KMS-encrypted dataset. The training job fails with the error shown. Which action should be taken to resolve this issue?

A.Add the SageMaker execution role to the KMS key policy with the kms:Decrypt permission.

B.Create a new KMS key and update the bucket policy to use the new key.

C.Attach an IAM policy to the SageMaker execution role that allows kms:Decrypt on the key.

D.Disable server-side encryption on the S3 bucket and use client-side encryption.

AnswerA

The key policy must explicitly grant the execution role the kms:Decrypt permission.

Why this answer

The error indicates the SageMaker execution role lacks permission to decrypt the KMS key. The most direct solution is to add the role to the KMS key policy with the kms:Decrypt permission. Option B is unnecessary; Option C changes encryption approach unnecessarily; Option D is also valid but less direct because the key policy must allow the role anyway, and modifying the key policy is the standard practice for SageMaker.

Full explanation →

660

MCQmedium

A developer is using Amazon Bedrock to generate text responses. They want to reduce the randomness of the output and make the model more deterministic. Which parameter should the developer decrease?

A.stop_sequences

B.Temperature

C.max_tokens

D.top_p

AnswerB

Lowering temperature (e.g., 0.1) makes the model choose the most likely tokens, reducing randomness.

Why this answer

Temperature controls randomness; lower values make the model more deterministic by sharpening the probability distribution.

Full explanation →

661

MCQeasy

A data scientist wants to quickly build a supervised learning model for binary classification on a tabular dataset with 10,000 rows and 200 features. The dataset has some missing values and requires minimal code. Which AWS service should the data scientist use?

A.Amazon SageMaker Studio Lab

B.Amazon SageMaker Clarify

C.Amazon SageMaker Autopilot

D.Amazon SageMaker JumpStart

AnswerC

Autopilot automates model building for tabular data.

Why this answer

Amazon SageMaker Autopilot is the correct choice because it automatically performs data preprocessing (including handling missing values), feature engineering, model selection, and hyperparameter tuning for supervised learning tasks like binary classification. It requires minimal code—users can simply point to a tabular dataset in Amazon S3 and specify the target column, and Autopilot will automatically train and evaluate multiple candidate models, making it ideal for quickly building a binary classifier on a 10,000-row, 200-feature dataset with missing values.

Exam trap

The AIF-C01 exam often tests the distinction between automated ML services (Autopilot) and model hosting or development environments (Studio Lab, JumpStart), so the trap here is that candidates may confuse SageMaker Autopilot with SageMaker JumpStart, thinking JumpStart also automates model building, when in fact JumpStart only provides pre-built models and requires manual configuration.

How to eliminate wrong answers

Option A is wrong because Amazon SageMaker Studio Lab is a free, no-code ML development environment that provides JupyterLab notebooks and limited compute resources, but it does not automate model building or handle missing values—it requires the user to write all code manually. Option B is wrong because Amazon SageMaker Clarify is designed for bias detection, model explainability, and fairness analysis, not for building or training supervised learning models; it cannot handle missing values or perform automated model selection. Option D is wrong because Amazon SageMaker JumpStart provides pre-built models and solutions for transfer learning and fine-tuning, but it does not automatically preprocess missing values or perform automated model selection for tabular binary classification—it requires the user to select and configure a model manually.

Full explanation →

662

MCQmedium

A company wants to automatically detect anomalies in server metrics. Which algorithm is most appropriate?

A.XGBoost

B.One-class SVM

C.Linear SVM

D.K-Means

AnswerB

One-class SVM is commonly used for anomaly detection by learning a boundary around normal data.

Why this answer

One-class SVM is specifically designed for anomaly detection, as it learns a boundary around the normal data points in the feature space and identifies any point falling outside this boundary as an anomaly. This makes it ideal for detecting unusual patterns in server metrics without requiring labeled anomaly examples.

Exam trap

The AIF-C01 exam often tests the distinction between supervised and unsupervised learning, and the trap here is that candidates may choose XGBoost or Linear SVM because they are familiar with them for classification, forgetting that anomaly detection typically requires a one-class approach when only normal data is available.

How to eliminate wrong answers

Option A is wrong because XGBoost is a supervised ensemble learning algorithm used for classification and regression, not for unsupervised anomaly detection; it requires labeled training data and is not designed to identify outliers without prior examples. Option C is wrong because Linear SVM is a supervised binary classifier that separates data into two classes using a hyperplane, and it cannot perform one-class anomaly detection without negative samples. Option D is wrong because K-Means is an unsupervised clustering algorithm that partitions data into clusters based on distance, but it does not inherently detect anomalies; while outliers can be inferred from cluster distances, it is not a dedicated anomaly detection method and lacks the statistical boundary learning of one-class SVM.

Full explanation →

663

Multi-Selecteasy

Which TWO actions are essential for ensuring accountability in AI systems according to AWS responsible AI guidelines?

Select 2 answers

A.Automate all decisions to ensure consistency

B.Establish clear human oversight and decision-making authority

C.Maintain detailed documentation and version control for models

D.Remove all human review processes to eliminate bias

E.Share raw training data publicly for transparency

AnswersB, C

Human oversight ensures accountability for AI outcomes.

Why this answer

Accountability requires clear ownership and documentation. Establishing human oversight and maintaining version control of models are key. Removing humans reduces accountability.

Using only automated decisions removes oversight. Sharing raw data broadly may violate privacy.

Full explanation →

664

MCQhard

Refer to the exhibit. A developer is optimizing latency for a generative AI model deployed on SageMaker. Based on the exhibit, which change would most likely reduce per-token latency?

A.Use a CPU instance

B.Reduce model size through quantization

C.Switch to a larger instance type

D.Increase batch size to 10

AnswerB

Quantization reduces the precision of model weights, decreasing compute per token and thus latency.

Why this answer

Option C, reduce model size through quantization, directly reduces computation per token, lowering latency. Option A (larger instance) may help but is less targeted. Option B (increase batch size) improves throughput but not per-token latency.

Option D (CPU instance) would increase latency.

Full explanation →

665

MCQeasy

Which vector store is a fully managed AWS service that can be used with Amazon Bedrock Knowledge Bases for semantic search?

A.Amazon DynamoDB

B.Amazon RDS for MySQL

C.Amazon S3

D.Amazon OpenSearch Serverless

AnswerD

OpenSearch Serverless supports vector engine for semantic search.

Why this answer

Amazon OpenSearch Serverless is a fully managed AWS vector database. The others are either not AWS-managed or not vector stores.

Full explanation →

666

MCQhard

An enterprise wants to ensure that generative AI applications built on AWS comply with data privacy regulations. They need to prevent the model from using customer data in future training. Which feature of Amazon Bedrock should they enable?

A.Policy-based data governance

B.Opt-out of model improvement

C.Data encryption at rest

D.Model customization with customer data

AnswerB

Opting out ensures customer data is not used for AWS model training or service improvement.

Why this answer

Option D, opt-out of model improvement, prevents AWS from using customer data for service improvement and training. Option A (encryption) protects data at rest but does not prevent use in training. Option B (model customization) may actually use customer data.

Option C (policy-based data governance) is not a specific Bedrock feature for this purpose.

Full explanation →

667

MCQmedium

A data scientist is using Amazon SageMaker to train a model and wants to understand the contribution of each feature to individual predictions. Which technique should they use to generate local explanations?

A.Permutation feature importance

B.Global feature importance

C.SHAP values

D.Partial dependence plots

AnswerC

SHAP values decompose predictions into additive feature contributions for each instance.

Why this answer

SHAP values provide per-instance (local) explanations of feature contributions. Permutation feature importance, partial dependence plots, and global feature importance all give global explanations, not local.

Full explanation →

668

Multi-Selecthard

A company is using Amazon Bedrock to generate creative marketing copy. They want to reduce the randomness of the output while maintaining diversity. Which TWO parameters should they adjust?

Select 2 answers

A.Increase the temperature

B.Increase the max token count

C.Increase the top_k value

D.Decrease the top_p value

E.Decrease the temperature

AnswersD, E

Lower top_p reduces the set of possible tokens, making output less random.

Why this answer

Decreasing the temperature (Option E) reduces randomness by lowering the probability of sampling lower-ranked tokens, making the model more deterministic. Decreasing top_p (Option D) narrows the cumulative probability threshold for token selection, which also reduces randomness while still allowing some diversity within the narrowed set. Together, these parameters control the trade-off between creativity and determinism in Amazon Bedrock's text generation.

Exam trap

The AIF-C01 exam often tests the misconception that increasing top_k or top_p reduces randomness, when in fact increasing either expands the token pool and can increase randomness, while decreasing them is what reduces randomness.

Full explanation →

669

MCQmedium

A company is using Amazon Rekognition to detect objects in images. They find that the service sometimes mislabels objects. What is the best way to improve accuracy for their specific use case?

A.Use a larger image size

B.Contact AWS support

C.Increase the confidence threshold

D.Use Amazon SageMaker to build a custom model

AnswerD

A custom model trained on domain-specific data can significantly improve accuracy.

Why this answer

Amazon Rekognition is a pre-trained service that may not perform optimally for specialized or domain-specific use cases. By using Amazon SageMaker to build a custom model, you can train a model on your own labeled dataset, which directly addresses the mislabeling issue by tailoring the model to your specific images and objects.

Exam trap

The trap here is that candidates often assume increasing the confidence threshold is a universal fix for accuracy issues, but the AIF-C01 exam tests the understanding that pre-trained services have limitations and that custom training (via SageMaker) is required for domain-specific improvements.

How to eliminate wrong answers

Option A is wrong because using a larger image size does not inherently improve Rekognition's detection accuracy; the service already resizes images to a standard input size, and larger images may only increase processing time without correcting mislabeling. Option B is wrong because contacting AWS support will not modify the underlying pre-trained model or improve its accuracy for your specific use case; support can only assist with service configuration or bugs, not model retraining. Option C is wrong because increasing the confidence threshold reduces false positives but does not fix systematic mislabeling; it may cause the service to return fewer results, potentially missing correct detections, without addressing the root cause of incorrect object identification.

Full explanation →

670

Multi-Selecthard

Which THREE considerations are essential when deploying a generative AI application in a regulated industry such as healthcare?

Select 3 answers

A.Lowest possible inference latency for real-time responses.

B.Full audit trail of model inputs and outputs for accountability.

C.Robust content filtering to block harmful or inaccurate outputs.

D.Maximum creative freedom for the model to generate diverse responses.

E.Data privacy and compliance with regulations like HIPAA.

AnswersB, C, E

Required for compliance and investigation.

Why this answer

Options A, B, and D are correct. Data privacy and compliance (e.g., HIPAA) are mandatory. Robust filtering for harmful output is required to prevent harm.

Full auditability of model responses is needed for regulatory compliance. Option C is wrong because creative freedom is often restricted in regulated contexts. Option E is wrong because faster inference is a performance concern, not a regulatory essential.

Full explanation →

671

MCQmedium

A company is using Amazon Bedrock Knowledge Bases to power a legal document Q&A application. They need to ensure that the model only answers based on the retrieved documents and does not generate information not present in the documents. Which feature should they enable?

A.Content filtering

B.Topic denial

C.Grounding check

D.PII detection

AnswerC

Grounding check verifies that the response is supported by the retrieved source documents, reducing hallucinations.

Why this answer

Bedrock Guardrails' grounding check ensures the model's response is grounded in the retrieved source documents and can flag or block ungrounded content.

Full explanation →

672

MCQhard

A company is building a resume screening model and discovers that the training data contains only resumes from one gender, leading to biased predictions. Which type of bias does this represent, and what is the most effective mitigation strategy?

A.Aggregation bias; mitigate by using a single model for all groups

B.Representation bias; mitigate by collecting more diverse training data

C.Measurement bias; mitigate by using more precise measurement tools

D.Historical bias; mitigate by removing sensitive attributes from the model

AnswerB

Representation bias stems from underrepresentation of groups in training data. Collecting diverse data is the most direct mitigation.

Why this answer

Representation bias occurs when certain groups are underrepresented in the training data. Mitigation includes collecting more diverse data or using techniques like re-weighting or synthetic data generation.

Full explanation →

673

MCQmedium

A data science team is using Amazon SageMaker to train multiple models with different hyperparameters. They want to track metrics, compare runs, and reproduce the best result. Which SageMaker feature should they use?

A.SageMaker Model Registry

B.SageMaker Debugger

C.SageMaker Autopilot

D.SageMaker Experiments

AnswerD

Experiments provides a framework for tracking and comparing multiple training runs.

Why this answer

SageMaker Experiments is the correct feature because it is specifically designed to track, organize, and compare machine learning training runs (trials) with different hyperparameters and metrics. It allows data scientists to log parameters, metrics, and artifacts for each run, compare results across runs, and retrieve the exact configuration needed to reproduce the best-performing model.

Exam trap

The trap here is that candidates often confuse SageMaker Experiments with SageMaker Model Registry, mistakenly thinking that model versioning and run tracking are the same feature, when in fact Experiments focuses on the iterative training process and Registry focuses on the final model lifecycle.

How to eliminate wrong answers

Option A is wrong because SageMaker Model Registry is a catalog for managing and versioning trained models, not for tracking and comparing individual training runs or hyperparameter experiments. Option B is wrong because SageMaker Debugger monitors training jobs in real time for issues like vanishing gradients or overfitting, but it does not provide a structured way to log, compare, or reproduce runs with different hyperparameters. Option C is wrong because SageMaker Autopilot automatically explores different algorithms and hyperparameters to find the best model, but it does not give the team the ability to manually track, compare, and reproduce their own custom runs with specific hyperparameters.

Full explanation →

674

MCQmedium

A company wants to build a customer service chatbot that answers questions about their internal policy documents. The documents are updated monthly, and the team cannot afford to retrain a model each time. Which approach is MOST appropriate?

A.Fine-tune a base LLM on the policy documents monthly

B.Use a larger foundation model with a longer context window and paste all documents into each prompt

C.Train a custom model from scratch on the policy documents each month

D.Use Retrieval-Augmented Generation (RAG) with the policy documents indexed in a vector store

AnswerD

RAG retrieves relevant document chunks at query time, ensuring the chatbot always answers from the latest uploaded documents without any model retraining.

Why this answer

RAG (Retrieval-Augmented Generation) allows the LLM to retrieve relevant document sections at inference time, so knowledge stays current without retraining. The other options either require expensive retraining for each update or lack document grounding.

Full explanation →

675

MCQhard

A team is developing a real-time code completion feature using an LLM deployed on Amazon SageMaker. They observe high latency under load. Which optimization technique should they prioritize?

A.Increase batch size

B.Switch to a larger instance type

C.Increase instance count with Auto Scaling

D.Use model quantization

AnswerD

Quantization reduces model precision and size, leading to faster inference with minimal accuracy loss.

Why this answer

Option B, model quantization, reduces the model size and speeds up inference directly, lowering latency. Option A (Auto Scaling) improves throughput but not per-request latency. Option C (increase batch size) improves throughput but may increase per-token latency.

Option D (larger instance) may improve but not as effectively as quantization.

Full explanation →

Page 9 of 14

All pages

1 2 3 4 5 6 7 8 9 10 11 12 13 14

Practice AIF-C01 by domain

Target a specific domain to shore up weak areas.

Applications of Foundation Models Fundamentals of AI and ML Fundamentals of Generative AI Guidelines for Responsible AI Security, Compliance and Governance for AI Solutions

See all domains with question counts →