CCNA Guidelines for Responsible AI Questions

75 of 77 questions · Page 1/2 · Guidelines for Responsible AI · Answers revealed

1
MCQhard

A healthcare company must train a model on sensitive patient data while complying with privacy regulations. They want to add noise to the training process to prevent re-identification. Which technique should they implement?

A.Differential privacy
B.k-anonymity
C.Federated learning
D.Homomorphic encryption
AnswerA

Differential privacy adds calibrated noise to training to protect individual data points.

Why this answer

Option B is correct because differential privacy injects controlled noise into the training algorithm to protect individual records. k-anonymity (A) focuses on generalization, not noise injection. Homomorphic encryption (C) allows computation on ciphertext but is not typically used during training. Federated learning (D) trains on decentralized data but does not inherently provide differential privacy guarantees.

2
MCQmedium

A healthcare company is training a model on sensitive patient data using Amazon SageMaker. They need to ensure that individual patient data cannot be reverse-engineered from the model. Which technique should they implement during training?

A.Data encryption at rest
B.AWS Identity and Access Management (IAM) policies
C.Differential privacy
D.SageMaker Model Monitor
AnswerC

Differential privacy provides mathematical guarantees that the model does not memorize individual data points.

Why this answer

Differential privacy adds noise to the training process to protect individual records. Data encryption and IAM control access but do not prevent inference from model parameters; Model Monitor is for post-deployment monitoring.

3
MCQmedium

Refer to the exhibit. An AWS customer runs SageMaker Clarify to evaluate bias in their training data. The report shows multiple metrics with status 'violated'. What should the customer do next?

A.Use data augmentation to balance the dataset
B.Reduce the number of features
C.Retrain the model with more data
D.Ignore the metrics because thresholds are too strict
AnswerA

Data augmentation can balance representation.

Why this answer

Option B is correct: Data augmentation or resampling can address class imbalance and demographic parity issues. Option A is wrong: Simply retraining with more data may not fix imbalance. Option C is wrong: Ignoring violations is irresponsible.

Option D is wrong: Reducing features may not help.

4
MCQeasy

Refer to the exhibit. An ML team finds that their training data is stored in two subfolders under s3://my-bucket/train/. They need to ensure that the dataset is balanced for training a classification model. What should they do?

A.Use AWS Glue to create a balanced dataset
B.Use Amazon Rekognition custom labels
C.Count the number of files in each subfolder and resample
D.Enable versioning on the bucket
AnswerC

Assessing distribution and resampling ensures balance.

Why this answer

Option B is correct: They need to check the number of files in each subfolder and resample if necessary. Option A is wrong: AWS Glue can be used but not specifically for balancing. Option C is wrong: Rekognition custom labels are for image labeling.

Option D is wrong: Versioning does not affect balance.

5
MCQmedium

An e-commerce company uses an Amazon Lex chatbot to handle customer inquiries. They want to implement human oversight for sensitive interactions, such as when the chatbot cannot provide a confident response. Which AWS service should they integrate?

A.Amazon Rekognition
B.Amazon Comprehend
C.Amazon Augmented AI (A2I)
D.Amazon SageMaker Ground Truth
AnswerC

A2I provides workflows to route predictions to humans for review when confidence is low.

Why this answer

Amazon Augmented AI (A2I) enables human review loops for low-confidence predictions or sensitive cases. Other services are not designed for human-in-the-loop in conversational AI.

6
MCQmedium

A team is deploying a regression model for loan approval. To ensure transparency for regulators, they need to explain individual predictions. Which interpretability method can provide local explanations by approximating the model with a simpler surrogate?

A.SHAP values
B.Partial dependence plots
C.LIME
D.Permutation feature importance
AnswerC

LIME creates local surrogate models to explain individual predictions.

Why this answer

Option C is correct because LIME (Local Interpretable Model-agnostic Explanations) generates local explanations by fitting a simpler model around each prediction. SHAP (option A) is also local but uses Shapley values; partial dependence (option B) is global; permutation importance (option D) is global feature importance.

7
MCQeasy

Refer to the exhibit. An AWS administrator sets up a SageMaker Model Monitor schedule for bias detection. What is the primary purpose of this configuration?

A.To retrain the model weekly
B.To replace the model with a new version
C.To monitor data drift in training data
D.To generate weekly bias reports for the deployed endpoint
AnswerD

The schedule runs monitoring jobs weekly, comparing current predictions to baseline and outputting results to S3.

Why this answer

The configuration creates a weekly monitoring job (cron expression for Monday) that compares endpoint predictions against a baseline and outputs bias reports to S3. It does not retrain, replace the model, or monitor training data drift.

8
MCQhard

A data scientist runs the SageMaker Clarify job shown in the exhibit for a credit risk model. After reviewing the results, they find a high bias metric for the gender facet. Which action is most consistent with responsible AI?

A.Proceed with deployment because the model is already in production
B.Remove the gender attribute from the training data and retrain
C.Investigate the root cause and retrain with balanced data
D.Increase the acceptance threshold for the model
AnswerC

Root cause analysis and retraining address bias.

Why this answer

Option C is correct because responsible AI requires understanding and mitigating bias at its source, not just masking it. Investigating the root cause (e.g., data collection bias, labeling bias, or proxy features) and retraining with balanced data directly addresses the high bias metric detected by SageMaker Clarify, aligning with AWS's principle of fairness. Simply removing the gender attribute may not eliminate bias if other features act as proxies, and increasing the threshold does not fix the underlying model bias.

Exam trap

Cisco often tests the misconception that simply removing a sensitive attribute (like gender) is sufficient to eliminate bias, but the trap here is that proxy features can still encode the same bias, making root-cause investigation and balanced retraining the only responsible action.

How to eliminate wrong answers

Option A is wrong because deploying a model with a known high bias metric violates responsible AI principles and could lead to unfair outcomes, even if the model is already in production; SageMaker Clarify is designed to detect such issues before or during deployment. Option B is wrong because removing the gender attribute alone does not guarantee bias removal—other features like zip code or income can act as proxies for gender, and the model may still learn biased correlations. Option D is wrong because increasing the acceptance threshold (e.g., for a binary classifier) only changes the decision boundary, not the underlying biased patterns learned by the model; it does not reduce the bias metric reported by Clarify.

9
MCQeasy

A social media company uses Amazon Comprehend to moderate user comments. They want to avoid censoring legitimate speech while catching hate speech. Which approach aligns with responsible AI governance?

A.Implement a human-in-the-loop review for borderline cases
B.Use multiple models and average their scores
C.Use a single model with high confidence threshold
D.Rely solely on automated filtering
AnswerA

Human review adds nuance.

Why this answer

Option B is correct: Human-in-the-loop review for borderline cases reduces false positives. Option A is wrong: High threshold may miss hate speech. Option C is wrong: Full automation can censor legitimate speech.

Option D is wrong: Averaging models may not help.

10
Multi-Selectmedium

Which THREE practices are recommended for promoting robustness and security in AI systems?

Select 3 answers
A.Deploy the model immediately after training without validation
B.Implement strong access controls and encryption for model artifacts
C.Regularly test the model against adversarial examples
D.Monitor model performance for data drift and concept drift
E.Remove logging and monitoring to improve performance
AnswersB, C, D

Security controls protect models from unauthorized access and tampering.

Why this answer

Robustness and security involve testing for adversarial inputs, monitoring data drift, and implementing secure access controls. Using unvalidated models is risky. Removing logging reduces traceability.

11
Multi-Selectmedium

Which TWO actions should a data scientist take to evaluate fairness of a binary classification model using Amazon SageMaker Clarify? (Choose two.)

Select 2 answers
A.Use post-training bias metrics like Difference in Positive Proportions
B.Ensure the training dataset is balanced by resampling
C.Generate SHAP values for feature importance
D.Use pre-training bias metrics such as Class Imbalance
E.Run a data quality monitoring job on unlabeled data
AnswersA, D

Post-training metrics compare predictions across groups.

Why this answer

Options A and D are correct. SageMaker Clarify computes pre-training bias metrics (like Class Imbalance) and post-training metrics (like Difference in Positive Proportions) to assess fairness. Option B is for explainability, not bias.

Option C is for unlabeled data, not model evaluation. Option E is a general practice but not specific to Clarify fairness evaluation.

12
MCQhard

A security team is concerned about adversarial attacks on their image classification model deployed on Amazon SageMaker. They want to test robustness against carefully crafted inputs that cause misclassification. What approach should they use?

A.Data augmentation on the training set
B.A/B testing between two similar models
C.SageMaker Model Monitor with adversarial drift
D.Generating adversarial examples using SageMaker Clarify
AnswerD

Clarify includes adversarial validation capabilities to test robustness.

Why this answer

Option D is correct because adversarial validation involves generating adversarial examples using attacks like FGSM to test model robustness. Model Monitor (A) does not test adversarial robustness. Data augmentation (B) improves generalization but is not a test.

A/B testing (C) compares models, not robustness.

13
MCQmedium

Refer to the exhibit. A data scientist runs an Amazon SageMaker Clarify bias analysis on a binary classifier. The pre-training ClassImbalance is 1.5 and the post-training DPPL is 0.15. What should the data scientist conclude?

A.The data is highly imbalanced and the model is unbiased.
B.The data has a mild class imbalance, but the model shows a noticeable bias in predictions.
C.The pre-training metric indicates a fairness issue, but the post-training metric is acceptable.
D.The data is perfectly balanced and the model is fair.
AnswerB

ClassImbalance of 1.5 is moderate; DPPL of 0.15 indicates a 15% difference, which is concerning.

Why this answer

Option B is correct. A ClassImbalance of 1.5 indicates the majority class is 1.5x the minority, mild imbalance. A DPPL of 0.15 indicates a 15% difference in positive prediction rates between groups, which is a significant fairness concern.

Option A misinterprets both; C is wrong because bias is present; D confuses the metrics.

14
Multi-Selecthard

Which THREE practices support transparency in AI systems? (Select three.)

Select 3 answers
A.Implement explainability methods like SHAP
B.Use open-source models exclusively
C.Provide documentation of model limitations
D.Publish model performance metrics
E.Allow users to opt out of data collection
AnswersC, D, E

Documentation informs users about model constraints.

Why this answer

Options A, B, and D are correct: documentation, user opt-out, and published performance metrics. Option C is not necessary, option E is explainability, which is related but not transparency per se.

15
Multi-Selectmedium

A company is deploying an AI-based diagnostic system in healthcare. Which THREE practices align with AWS responsible AI guidelines? (Choose THREE.)

Select 3 answers
A.Deploy the model in production immediately after training without manual review.
B.Continuously monitor model performance for drift using SageMaker Model Monitor.
C.Use only automated decision-making without any human oversight.
D.Document the model's intended use and limitations with model cards.
E.Implement a human-in-the-loop process for high-risk predictions using Amazon A2I.
AnswersB, D, E

Monitoring ensures ongoing reliability and safety.

Why this answer

Continuous monitoring, model cards documentation, and human-in-the-loop review are all recommended. Deploying without manual review and fully automated decisions without oversight violate responsible AI principles.

16
MCQeasy

A company uses Amazon SageMaker to build a binary classification model for loan approvals. After training, the data science team wants to evaluate the model for potential bias against a protected group. Which AWS service should they use to compute bias metrics?

A.Amazon SageMaker Model Monitor
B.Amazon SageMaker Debugger
C.Amazon SageMaker Clarify
D.Amazon SageMaker Experiments
AnswerC

SageMaker Clarify provides built-in bias detection metrics and explainability.

Why this answer

Option A is correct because SageMaker Clarify is designed specifically for bias detection and explainability. Options B, C, and D serve other purposes (monitoring, debugging, experiment tracking) and do not compute bias metrics directly.

17
MCQmedium

A startup uses Amazon Lex to build a chatbot for mental health support. They must ensure user conversations are private and not used for model improvement. Which AWS service can help anonymize text data before storage?

A.Amazon Textract
B.AWS Key Management Service (KMS)
C.Amazon Comprehend
D.Amazon Macie
AnswerC

Comprehend's PII detection can be used to redact entities.

Why this answer

Option C is correct: Amazon Comprehend can detect and redact personally identifiable information (PII). Option A is wrong: KMS encrypts but doesn't redact. Option B is wrong: Macie discovers sensitive data but doesn't anonymize.

Option D is wrong: Textract extracts text from documents.

18
Multi-Selecteasy

Which TWO practices help ensure transparency in AI systems? (Choose 2)

Select 2 answers
A.Combine multiple models to obscure decision logic
B.Use model-agnostic explainability tools like SHAP
C.Remove all features except the most predictive ones
D.Provide documentation on model limitations and data sources
E.Use black-box models to protect proprietary algorithms
AnswersB, D

Explainability tools clarify decisions.

Why this answer

Option B is correct because model-agnostic explainability tools like SHAP (SHapley Additive exPlanations) provide post-hoc explanations for any machine learning model by computing feature contributions based on cooperative game theory. This allows stakeholders to understand how each input feature influences a prediction, directly supporting transparency without requiring access to the model's internal structure.

Exam trap

Cisco often tests the misconception that transparency means simplifying the model (e.g., removing features) or hiding logic (e.g., using ensembles or black-box models), when in fact transparency is achieved through explainability tools and thorough documentation of limitations and data sources.

19
Multi-Selecteasy

Which TWO actions can help mitigate bias in a face recognition model trained on AWS? (Select two.)

Select 2 answers
A.Ensure the training dataset is balanced across demographics
B.Regularly evaluate model performance across subgroups
C.Deploy the model in multiple regions
D.Use a larger neural network
E.Use Amazon Rekognition's content moderation
AnswersA, B

Balanced data reduces demographic bias.

Why this answer

Options A and C are correct: balanced dataset and subgroup evaluation. Option B is for content moderation, D is network size irrelevant, E is deployment not mitigation.

20
MCQhard

A healthcare organization is developing a clinical decision support system using Amazon Bedrock with a large language model (LLM) to analyze patient symptoms and suggest potential diagnoses. The system must comply with HIPAA and internal responsible AI guidelines. During testing, the model occasionally generates diagnoses that are inconsistent with established medical guidelines and shows a tendency to recommend more aggressive treatments for patients from certain demographic groups. The team has already implemented data encryption, access controls, and basic content filtering. They need to further reduce biased and unsafe outputs without delaying the deployment timeline. What should the team do next?

A.Increase the logging of all model inputs and outputs to Amazon CloudWatch and set up alarms for any mentions of protected attributes.
B.Replace the current LLM with a different pre-trained model that has been benchmarked for lower bias on medical datasets.
C.Fine-tune the model using a curated dataset of anonymized patient records that is balanced across demographic groups and aligned with clinical guidelines.
D.Apply stronger content filtering rules using Amazon Comprehend Medical to block any diagnosis that contains demographic-related terms.
AnswerC

Fine-tuning on a balanced, guideline-aligned dataset reduces both bias and inaccuracy by teaching the model correct patterns.

Why this answer

Option C is correct because fine-tuning the model with a balanced, curated dataset directly addresses both the bias and clinical accuracy issues at the model level, which is the most effective approach for reducing biased and unsafe outputs without delaying deployment. This method adjusts the model's internal weights to align with established medical guidelines and demographic fairness, rather than relying on post-processing filters or logging that do not fix the root cause. Since the team has already implemented basic content filtering, fine-tuning provides a targeted, efficient solution that can be completed within a reasonable timeline.

Exam trap

The trap here is that candidates may confuse monitoring and logging (Option A) with actual bias mitigation, or assume that a different pre-trained model (Option B) will inherently solve domain-specific bias without requiring additional fine-tuning or validation.

How to eliminate wrong answers

Option A is wrong because increasing logging and setting alarms for protected attributes only monitors for bias after it occurs, but does not prevent or reduce biased or unsafe outputs; it adds operational overhead without addressing the model's behavior. Option B is wrong because replacing the current LLM with a different pre-trained model introduces significant risk of deployment delays due to re-evaluation, integration, and compliance validation, and does not guarantee lower bias on the specific medical domain without further customization. Option D is wrong because applying stronger content filtering with Amazon Comprehend Medical to block diagnoses containing demographic terms is a blunt, post-processing approach that can suppress legitimate clinical information and still allow biased patterns that do not explicitly mention protected attributes, failing to address the underlying model bias.

21
MCQhard

A company uses an AI system to automate loan approvals. The model uses demographic features and achieves high accuracy, but the company wants to ensure compliance with responsible AI guidelines. Which practice best balances performance and fairness?

A.Use demographic features but with minimal monitoring
B.Use a complex black-box model and rely on post-hoc explanations
C.Remove sensitive attributes and monitor for proxy bias
D.Optimize the model solely for accuracy on historical data
AnswerC

Removing attributes reduces direct bias, monitoring detects proxies.

Why this answer

Option C is correct because removing sensitive attributes (e.g., race, gender) from the training data directly addresses fairness by preventing the model from explicitly using these features. However, simply removing them is insufficient; monitoring for proxy bias (e.g., zip code or income correlating with race) is critical to ensure the model does not inadvertently learn discriminatory patterns through correlated features. This approach balances performance by retaining predictive power from non-sensitive features while actively auditing for fairness violations.

Exam trap

Cisco often tests the misconception that simply removing sensitive attributes from the dataset guarantees fairness, without considering proxy bias or the need for ongoing monitoring.

How to eliminate wrong answers

Option A is wrong because using demographic features with minimal monitoring violates responsible AI guidelines; it risks encoding historical biases and does not mitigate fairness concerns, as even high-accuracy models can be discriminatory. Option B is wrong because relying on a complex black-box model with post-hoc explanations (e.g., SHAP or LIME) does not inherently ensure fairness; post-hoc explanations can be unreliable and do not prevent the model from learning biased correlations from sensitive attributes. Option D is wrong because optimizing solely for accuracy on historical data ignores fairness; historical data often contains systemic biases, and maximizing accuracy can amplify those biases, leading to unfair outcomes for protected groups.

22
MCQhard

A government agency uses Amazon Rekognition for identity verification. They want to ensure the model is robust against adversarial attacks. What should they do?

A.Use SageMaker Model Monitor to detect adversarial inputs
B.Implement input validation and use Amazon SageMaker Shadow Testing
C.Train the model with adversarial examples
D.Apply AWS Shield Advanced
AnswerC

Adversarial training hardens the model.

Why this answer

Option A is correct: Training with adversarial examples improves robustness. Option B is wrong: Model Monitor detects drift, not adversarial inputs. Option C is wrong: Input validation helps but not the primary defense.

Option D is wrong: AWS Shield protects against DDoS.

23
MCQeasy

A hospital uses an AI system to prioritize patients for organ transplant based on predicted survival rates. The system was trained on historical data that includes socioeconomic factors. A review reveals that the system systematically assigns lower priority to patients from lower-income neighborhoods, even when medical urgency is similar. The hospital's ethics board demands an immediate remedy. The data science team is small and must act quickly. What should the hospital do to address this fairness issue most effectively?

A.Discontinue the AI system and have all prioritization done by a human committee
B.Retrain the model with only medically relevant features, after removing socioeconomic factors and correlated proxies
C.Apply a re-weighting penalty to boost priority for low-income patients
D.Use a different model type, such as a random forest instead of gradient boosting, on the same data
AnswerB

Removing biased features addresses the root cause.

Why this answer

The best course is to retrain the model using only medically relevant features, removing socioeconomic factors and correlated proxies. This directly addresses the source of bias. Adding a penalty for low-income patients is artificial and may not reflect medical reality.

Relying solely on human review delays the issue and introduces potential inconsistency. Using a different model without data changes may not eliminate bias.

24
MCQmedium

A healthcare startup deploys a model to predict patient readmission risk using Amazon SageMaker. After deployment, the model shows higher false-positive rates for a specific age group. What is the most responsible first step?

A.Increase the prediction threshold for the affected group
B.Use Amazon SageMaker Clarify to detect bias in predictions
C.Retrain the model with more data from the affected group
D.Immediately retire the model to prevent harm
AnswerB

Clarify provides bias metrics to inform next steps.

Why this answer

Amazon SageMaker Clarify is purpose-built for detecting bias in ML models and data. It provides bias metrics (e.g., Difference in Positive Proportions in Predicted Labels, Disparate Impact) that can quantify whether the model's predictions are systematically skewed against a specific age group. This is the most responsible first step because it objectively measures the bias before any corrective action is taken.

Exam trap

AWS often tests the misconception that the first step to address bias is to immediately retrain or adjust thresholds, rather than using a dedicated bias detection tool like SageMaker Clarify to first diagnose the nature and extent of the bias.

How to eliminate wrong answers

Option A is wrong because increasing the prediction threshold for the affected group is a post-hoc adjustment that does not address the root cause of bias and can introduce new fairness issues or degrade overall model performance. Option C is wrong because retraining with more data from the affected group assumes the bias stems from data imbalance, but without first using SageMaker Clarify to confirm the bias source, this could be ineffective or even harmful (e.g., if bias is due to feature encoding or labeling). Option D is wrong because immediately retiring the model is an overreaction that ignores the possibility of mitigation; responsible AI practices require diagnosis before drastic action.

25
MCQmedium

A healthcare organization uses an AI model to predict patient readmission risks. The model's predictions are used by doctors to allocate follow-up care. The organization wants to ensure compliance with responsible AI guidelines. Which practice best supports explainability?

A.Ensuring the model's overall accuracy exceeds 95%
B.Using a black-box ensemble model that achieves highest accuracy
C.Automating decisions without human review to reduce bias
D.Providing feature importance scores for each prediction
AnswerD

Feature importance helps stakeholders understand model reasoning, supporting explainability.

Why this answer

Providing feature importance scores helps doctors understand why the model made a specific prediction, aligning with explainability. Removing humans oversimplifies accountability. Using a black-box model reduces explainability.

Relying solely on accuracy ignores interpretability.

26
MCQeasy

A data scientist wants to detect potential bias in a binary classification model before deployment. Which AWS service can analyze the model's predictions across different demographic groups?

A.Amazon SageMaker Ground Truth
B.Amazon CloudWatch Logs Insights
C.Amazon SageMaker Clarify
D.Amazon SageMaker Model Monitor
AnswerC

SageMaker Clarify is specifically designed for bias detection and explainability.

Why this answer

Option A is correct because Amazon SageMaker Clarify provides built-in bias metrics and explainability analysis for machine learning models. Options B, C, D are incorrect: SageMaker Model Monitor is for post-deployment monitoring; SageMaker Ground Truth is for labeling; CloudWatch Logs is for logging.

27
MCQhard

Refer to the exhibit. A data scientist runs SageMaker Clarify on a training dataset and receives the above JSON output. Which bias metric exceeds its threshold?

A.DPL
B.Label Imbalance
C.Class Imbalance
D.All metrics exceed thresholds
AnswerA

DPL (Difference in Positive Proportions in Predicted Labels) exceeds threshold.

Why this answer

The DPL value of 0.15 is greater than the threshold of 0.1, indicating significant bias. Class imbalance (0.3) and label imbalance (0.4) are below their respective thresholds.

28
MCQhard

Refer to the exhibit. A team is configuring a SageMaker Model Bias job. The baseline job has been completed. However, the bias job fails with a resource not found error. What is the most likely cause?

A.The StoppingCondition is too short
B.The BaseliningJobName is incorrect
C.The instance type ml.m5.large is not supported
D.The IAM role lacks permissions to DescribeBaselineJob
AnswerB

Typo or mismatch in the baseline job name.

Why this answer

Option C is correct: The BaseliningJobName must exactly match the name of the completed baseline job. Option A is wrong: IAM permissions would give a different error. Option B is wrong: ml.m5.large is supported.

Option D is wrong: Runtime is sufficient.

29
MCQeasy

A financial institution uses a machine learning model to approve loan applications. The model is trained on historical data that includes biased lending practices. What is the most effective first step to address potential bias?

A.Immediately deploy the model and monitor for biased outcomes
B.Retrain the model with synthetic data generated from the original dataset
C.Remove all demographic features from the model
D.Audit the training data for bias and review feature selection
AnswerD

Auditing data and features is the foundational step to identify and mitigate bias.

Why this answer

Before deploying, it is critical to evaluate the training data for bias. Auditing the data for representation and fairness helps identify and mitigate bias early. Post-deployment monitoring is secondary, and simply retraining without review may perpetuate bias.

Excluding demographic data might ignore important fairness dimensions.

30
MCQhard

A company deploys a deep learning model for image classification using Amazon SageMaker. They are concerned about adversarial attacks that could misclassify images with small perturbations. Which of the following is the most effective approach to improve model robustness?

A.Reduce training data size
B.Use early stopping during training
C.Apply adversarial training
D.Increase model complexity
AnswerC

Adversarial training includes perturbed examples in the training set, teaching the model to resist small changes.

Why this answer

Adversarial training incorporates adversarial examples during training, which is the most proven method to improve robustness against such attacks. Increasing model complexity, early stopping, or reducing data are not effective or may harm performance.

31
Multi-Selecteasy

Which TWO techniques provide interpretability for machine learning models at a local (per-prediction) level? (Choose two.)

Select 2 answers
A.SHAP values
B.Partial dependence plots
C.Confusion matrix
D.LIME
E.Permutation feature importance
AnswersA, D

SHAP provides local explanations based on Shapley values.

Why this answer

Options A and C are correct. LIME and SHAP are local interpretability methods. Option B (Permutation importance) is global.

Option D (Partial dependence) is global. Option E (Confusion matrix) is a performance metric, not interpretability.

32
MCQhard

An insurance company uses a machine learning model to adjust premiums. During a review, the model is found to be penalizing customers based on zip codes correlated with racial demographics, leading to potential discrimination. Which combination of actions best addresses this fairness issue while maintaining business value?

A.Remove the zip code feature from the model and retrain
B.Re-engineer features to avoid proxies for protected attributes and rebalance training data
C.Continue using the model but add a disclaimer about potential bias
D.Replace the model with a simpler linear model
AnswerB

This addresses both the features and the data representation, mitigating bias comprehensively.

Why this answer

The most effective approach is to re-engineer features to remove proxy variables for protected attributes (like race) AND to rebalance the training data to ensure fair representation. Simply removing zip code may not eliminate proxies, and ignoring the issue is not acceptable. Using a different model without data changes may not solve the problem.

33
MCQhard

A research lab uses Amazon SageMaker to train a deep learning model for medical diagnosis. They need to ensure the model's decisions are interpretable to clinicians. Which SageMaker feature provides local and global feature importance?

A.SageMaker Model Monitor
B.SageMaker Experiments
C.SageMaker Clarify
D.SageMaker Debugger
AnswerC

Clarify provides explainability metrics.

Why this answer

Option D is correct: SageMaker Clarify provides SHAP-based explanations for feature importance. Option A is wrong: Debugger is for debugging training. Option B is wrong: Model Monitor is for monitoring.

Option C is wrong: Experiments is for managing trials.

34
MCQeasy

A team is developing an AI system and wants to document key information such as intended use, performance benchmarks, and limitations. According to AWS best practices for responsible AI, what should they create?

A.A whitepaper
B.A business requirement document
C.A technical blog
D.Model cards
AnswerD

Model cards provide a structured summary of model characteristics, intended use, fairness, and limitations.

Why this answer

Model cards are a standardized format for documenting model details, promoting transparency. Other options are not specific to responsible AI documentation.

35
MCQhard

A media company uses a generative AI model to automatically create image captions for user-uploaded photos. During quality assurance, testers discover that the model sometimes generates captions that include stereotypes based on gender and race, even when the photos do not contain people. For example, a photo of a kitchen produces captions like 'woman cooking,' and a photo of a sports car generates 'man driving.' The company wants to launch the feature soon but recognizes the reputational risk. They have a limited budget and need to implement a solution that reduces harmful stereotypes without overly restricting the captions' creativity. The team has access to the model's training data, which is a large public dataset of image-caption pairs. Which approach should the team prioritize?

A.Replace the generative model with a simpler classification model that only describes objects
B.Use a different pre-trained generative model that is larger and more accurate
C.Filter the training data to remove or downweight pairs with stereotypes, then fine-tune the model
D.Add a post-processing filter that checks captions for known stereotype patterns and blocks them
AnswerC

Cleaning training data and fine-tuning directly reduces learned stereotypes.

Why this answer

To reduce stereotypical associations, the most effective approach is to filter and rebalance the training data by removing or downweighting caption pairs that contain stereotypes, and then fine-tune the model. This directly addresses the data source of bias. Adding a post-hoc filter may catch some stereotypes but could also remove valid captions.

Using a different model without data changes may not help. Limiting to object-only descriptions reduces creativity and may not meet product goals.

36
Multi-Selecthard

Which THREE considerations are important when implementing responsible AI for a production NLP system? (Choose three.)

Select 3 answers
A.Obtain FDA approval for the model
B.Continuously monitor model outputs for bias and drift
C.Apply encryption at rest for all training code
D.Publish model cards detailing intended use, performance, and limitations
E.Include bias detection in the CI/CD pipeline for every model update
AnswersB, D, E

Production models require ongoing monitoring to ensure fairness over time.

Why this answer

Options A, B, and D are correct. In production, continuous monitoring is vital (A), bias evaluation should be part of the MLOps pipeline (B), and model cards document model details (D). Option C is about code security, not responsible AI.

Option E is specific to medical devices, not generally required.

37
MCQeasy

After deploying a model, a company notices that the distribution of the input features has shifted compared to the training data. Which feature of Amazon SageMaker Model Monitor can alert them to this change?

A.Model quality monitoring
B.Bias drift monitoring
C.Feature importance drift
D.Data quality monitoring
AnswerD

Data quality monitors for drift in input features (baseline vs. live).

Why this answer

Option C is correct because SageMaker Model Monitor's Data Quality monitoring tracks distributions of input features and can alert on drift. Bias drift (A) is specific to demographic groups. Model quality (B) tracks prediction accuracy.

Feature importance drift (D) is not a standard monitoring type.

38
Multi-Selectmedium

Which TWO actions are most aligned with responsible AI practices when deploying a model that makes decisions affecting individuals? (Choose 2)

Select 2 answers
A.Collect as much data as possible without quality checks
B.Continuously monitor the model for fairness metrics
C.Ensure the development team is homogeneous to avoid conflicts
D.Use the most complex model available for maximum accuracy
E.Provide meaningful explanations for model decisions
AnswersB, E

Monitoring ensures ongoing fairness.

Why this answer

Options A and C are correct. Option A: Providing explanations supports transparency. Option C: Regular monitoring detects bias drift.

Option B is wrong because teams should include diverse perspectives. Option D is wrong because using the most complex model may harm explainability. Option E is wrong because more data alone does not ensure fairness.

39
MCQmedium

An e-commerce company uses a recommendation system built with Amazon Personalize. They want to explain to customers why certain products are recommended. Which AWS service can provide model explanations?

A.AWS Config
B.AWS CloudTrail
C.Amazon Detective
D.Amazon SageMaker Clarify
AnswerD

Clarify provides explainability for ML models.

Why this answer

Option D is correct: SageMaker Clarify can provide feature attribution for recommendations. Option A is wrong: AWS Config tracks resources. Option B is wrong: CloudTrail audits API calls.

Option C is wrong: Amazon Detective investigates security findings.

40
MCQmedium

A company wants to ensure accountability for its machine learning models by tracking all changes to the model and its training data. Which AWS feature should they use?

A.Amazon CloudWatch
B.Amazon SageMaker Experiments
C.AWS CloudTrail
D.AWS CodeCommit
AnswerC

CloudTrail logs all API calls for governance and accountability.

Why this answer

Option B is correct because AWS CloudTrail records API calls for auditing and accountability. SageMaker experiments (A) track training parameters but not all data changes. CodeCommit (C) is for source code, not model changes.

CloudWatch (D) monitors metrics but does not track changes comprehensively.

41
MCQhard

A large enterprise has multiple teams deploying ML models on AWS. To ensure governance and accountability, they need to enforce that all models pass a fairness review before production deployment. Which SageMaker feature should they use to implement this approval workflow?

A.SageMaker Studio
B.SageMaker Experiments
C.SageMaker Model Monitor
D.SageMaker Model Registry
AnswerD

Model Registry supports versioning, metadata, and approval statuses for governance.

Why this answer

SageMaker Model Registry manages model versions and supports approval statuses (e.g., Approved, Rejected). This integrates with CI/CD pipelines to enforce review gates. Experiments, Model Monitor, and Studio do not provide such workflow capabilities.

42
MCQeasy

A media company uses Amazon Transcribe for automatic speech recognition. They discover the model has higher error rates for non-native English speakers. Which Responsible AI principle are they failing to uphold?

A.Fairness
B.Explainability
C.Robustness
D.Privacy
AnswerA

Fairness addresses disparate impact.

Why this answer

Option C is correct: Fairness requires equitable performance across groups. Option A is wrong: Explainability is about transparency. Option B is wrong: Privacy is about data protection.

Option D is wrong: Robustness is about reliability under varied conditions.

43
Multi-Selecthard

A team is using Amazon Comprehend to analyze customer feedback for sentiment. They want to detect and mitigate potential bias against certain demographic groups. Which TWO approaches should they consider? (Choose TWO.)

Select 2 answers
A.Use AWS WAF to filter out biased comments.
B.Use AWS CloudTrail to audit API calls.
C.Use Amazon Rekognition to verify images.
D.Use SageMaker Clarify to compute bias metrics on the training data.
E.Use Comprehend custom classification with balanced training data across groups.
AnswersD, E

Clarify supports NLP bias detection and can analyze text datasets.

Why this answer

SageMaker Clarify can compute bias metrics on text data, and training with balanced data reduces bias. WAF is for web security, Rekognition is for image/video, CloudTrail is for auditing API calls – none are relevant to bias in NLP models.

44
MCQmedium

A government agency is deploying an AI system to detect fraudulent benefit claims. The system uses a neural network trained on historical claims data, which includes a disproportionate number of false positives from a particular ethnic group due to historical over-policing. The agency must ensure the system does not perpetuate discrimination. They have a rigorous testing procedure but limited budget. The project lead wants to balance fairness with detection performance. Which combination of steps should they prioritize?

A.Remove the race feature from the model and rely on performance metrics alone
B.Replace the neural network with a logistic regression model retrained on the same data
C.Rebalance the training data to have equal representation across groups and evaluate using a fairness metric like equal opportunity
D.Use a post-hoc explainability tool to identify biased predictions and manually override them
AnswerC

Rebalancing data and using fairness metrics directly mitigate bias and measure progress.

Why this answer

The most effective approach is to rebalance the training data to be more representative and to use a fairness metric, such as equal opportunity, during evaluation. This directly addresses the data bias and quantifies fairness. Excluding race features may still leave proxies.

Using a simpler model may not eliminate bias if data is biased. Post-hoc explanations help understand bias but do not fix it.

45
MCQhard

A company is deploying a generative AI model that produces text summaries of legal documents. To comply with responsible AI guidelines, which of the following is the most critical to ensure transparency?

A.Informing users that the summaries are generated by AI
B.Ensuring the model does not reflect biases from training data
C.Achieving high performance on summary quality metrics
D.Guaranteeing the summaries are factually accurate
AnswerA

Disclosing AI-generated content is a key transparency requirement.

Why this answer

Transparency requires that users know when they are interacting with AI. Disclosing that the summaries are AI-generated is fundamental. Accuracy, bias mitigation, and performance are important but do not directly address transparency about AI use.

46
MCQeasy

Which of the following is a key principle of responsible AI according to AWS?

A.Complexity
B.Speed
C.Profitability
D.Transparency
AnswerD

Transparency is one of the six responsible AI principles from AWS.

Why this answer

Option B is correct because AWS emphasizes transparency as a key principle of responsible AI, alongside fairness, accountability, privacy, and robustness. Profitability (A), speed (C), and complexity (D) are not principles.

47
MCQmedium

A company uses Amazon SageMaker Ground Truth to label a dataset for a binary classifier. To reduce labeling bias, which workforce configuration is most appropriate?

A.Automatic labeling with Active Learning
B.Public workforce with no qualification
C.Private workforce of domain experts
D.Vendor managed workforce
AnswerC

Domain experts provide high-quality, consistent labels, reducing bias.

Why this answer

Option D is correct because using an AWS-managed workforce (Mechanical Turk) often involves multiple workers per item, but for sensitive tasks, a private workforce of domain experts ensures consistency and reduces bias. Public (A) may introduce uncontrolled bias. Vendor (B) is not a built-in type.

Automatic (C) is not a workforce type.

48
MCQmedium

A company uses Amazon Comprehend to analyze customer sentiment. They discover the model performs poorly on text with slang from underrepresented groups. What is the most responsible action?

A.Restrict model use to only standard English
B.Remove slang from input before inference
C.Adjust the confidence threshold only for those groups
D.Collect more representative training data including slang
AnswerD

Representative data reduces bias.

Why this answer

Option D is correct because the core principle of responsible AI requires that models be trained on data that is representative of the populations they serve. Amazon Comprehend's sentiment analysis is a supervised machine learning model; its poor performance on slang from underrepresented groups indicates a training data bias. Collecting more representative training data, including that slang, directly addresses the root cause by enabling the model to learn the linguistic patterns of those groups, improving fairness and accuracy without restricting access or masking the problem.

Exam trap

The trap here is that candidates may choose a quick-fix technical workaround (like removing slang or adjusting thresholds) instead of recognizing that the responsible AI approach requires addressing the root cause of bias through data representativeness, which is a core ethical and technical principle tested in the AIF-C01 exam.

How to eliminate wrong answers

Option A is wrong because restricting model use to only standard English is a discriminatory practice that excludes underrepresented groups, violating responsible AI principles of fairness and inclusivity; it does not fix the model's bias but rather avoids it. Option B is wrong because removing slang from input before inference is a data preprocessing workaround that does not address the underlying model bias; it discards valuable linguistic data and can alter the true sentiment of the text, leading to inaccurate results. Option C is wrong because adjusting the confidence threshold only for those groups is a post-hoc tuning that does not correct the model's learned bias; it may reduce false positives but does not improve the model's understanding of slang, and it introduces inconsistent decision boundaries that can be seen as unfair.

49
MCQmedium

Refer to the exhibit. A data scientist used SageMaker Clarify to evaluate bias in a binary classification model predicting loan approval. The exhibit shows bias metrics for the female facet. What does the analysis indicate about the model's impact on the female group?

A.The metrics are within acceptable thresholds, so no action is needed.
B.The model shows a high positive bias toward the female group.
C.The model has a post-training accuracy difference indicating a negative bias against the female group.
D.The model exhibits a pre-training class imbalance but no post-training bias.
AnswerC

Negative AD means lower accuracy for the female group.

Why this answer

The post-training Accuracy Difference (AD) of -0.22 indicates that the model's accuracy for the female group is 22% lower than for the male group, representing a negative bias. Pre-training metrics show some imbalance (CI=0.2) and DPL=-0.15 indicating underrepresentation and lower proportion of positive labels, but the post-training metric directly shows performance disparity. Option A is incorrect because AD is present.

Option C is incorrect because AD is negative. Option D is incorrect because the threshold typically is 0.1 for bias detection, and -0.22 exceeds it.

50
Multi-Selecteasy

A retail company is deploying a machine learning model to analyze customer reviews and predict sentiment. The team wants to follow responsible AI guidelines to ensure fairness, transparency, and accountability. Which TWO actions should the team take? (Choose TWO.)

Select 2 answers
A.Use SageMaker Debugger to optimize training performance.
B.Use SageMaker Clarify to evaluate bias in the training data.
C.Use SageMaker Model Monitor to automatically retrain the model when drift is detected.
D.Use Amazon Rekognition to detect personally identifiable information (PII) in the review text.
E.Use SageMaker Model Cards to document the model's intended use, limitations, and evaluation results.
AnswersB, E

This is a core fairness practice to detect and mitigate bias.

Why this answer

Option A: SageMaker Clarify detects bias in training data, which is a core fairness practice. Option C: SageMaker Model Cards document the model's intended use, limitations, and evaluation results, promoting transparency and accountability. Option B: Model Monitor tracks data drift, not directly a responsible AI practice.

Option D: Rekognition is for image moderation, not relevant for text sentiment. Option E: Debugger optimizes training, not responsible AI.

51
MCQeasy

A retail company is deploying a chatbot to handle customer inquiries. During testing, they notice the chatbot occasionally uses offensive language when responding to certain user inputs. Which responsible AI principle is being violated?

A.Privacy
B.Transparency
C.Fairness
D.Accountability
AnswerC

Fairness ensures AI systems treat all users equitably; offensive language is a fairness issue.

Why this answer

The correct answer is fairness because the chatbot's responses are biased and offensive, indicating a lack of fairness in treating all users appropriately. Accountability relates to ownership, privacy to data protection, and transparency to explainability.

52
Multi-Selectmedium

Which TWO actions help ensure fairness in an AI system deployed on AWS? (Select two.)

Select 2 answers
A.Train the model on a representative dataset
B.Enable AWS CloudTrail for audit
C.Use SageMaker Clarify to detect bias
D.Use a single validation set
E.Encrypt data at rest using AWS KMS
AnswersA, C

Representative data reduces bias.

Why this answer

Options A and B are correct: Diverse training data and bias detection with Clarify. Options C, D, E are not direct fairness actions.

53
MCQmedium

An AI team uses the IAM policy shown in the exhibit to control endpoint creation. Why does this policy support responsible AI?

A.It requires human approval before deploying any model
B.It prevents the use of GPU instances to reduce cost
C.It ensures data capture is enabled for model monitoring
D.It restricts endpoints to only use models built in SageMaker
AnswerC

Data capture allows bias detection and explainability.

Why this answer

Option C is correct because the IAM policy includes a condition that enforces the `DataCaptureConfig.EnableCapture` parameter to be set to `true` when creating a SageMaker endpoint. This ensures that model monitoring data is automatically collected, which is a key practice for responsible AI as it allows continuous monitoring of model performance, bias detection, and drift analysis. Without data capture, teams cannot audit or validate model behavior in production, undermining accountability and transparency.

Exam trap

Cisco often tests the misconception that IAM policies for responsible AI focus on restricting model sources or instance types, when in fact the key mechanism is enforcing observability through data capture for ongoing monitoring.

How to eliminate wrong answers

Option A is wrong because the IAM policy does not include any condition requiring human approval (e.g., using `sts:AssumeRole` with MFA or a separate approval workflow); it only enforces data capture settings. Option B is wrong because the policy does not restrict instance types (e.g., GPU instances like `ml.p3.2xlarge`); it focuses solely on data capture configuration. Option D is wrong because the policy does not restrict endpoints to models built in SageMaker; it allows any model to be deployed as long as data capture is enabled, and there is no condition referencing model origin.

54
MCQhard

A financial services company must comply with regulatory requirements that mandate explainability of credit scoring models. They have deployed a model using SageMaker and need to generate reports showing feature importance for each prediction. Which combination of services should they use to automate this?

A.SageMaker Model Monitor + Amazon QuickSight
B.SageMaker Ground Truth + AWS Lambda
C.SageMaker Clarify + SageMaker Pipelines
D.SageMaker Data Wrangler + SageMaker Studio
AnswerC

Clarify computes SHAP values; Pipelines automates the workflow to generate and store reports.

Why this answer

SageMaker Clarify can compute SHAP values for per-prediction explanations. SageMaker Pipelines can orchestrate the generation of reports on a schedule. Data Wrangler is for data preparation, Model Monitor for drift, Ground Truth for labeling – none generate per-prediction explanation reports automatically.

55
Multi-Selecteasy

Which TWO actions are essential for ensuring accountability in AI systems according to AWS responsible AI guidelines?

Select 2 answers
A.Automate all decisions to ensure consistency
B.Establish clear human oversight and decision-making authority
C.Maintain detailed documentation and version control for models
D.Remove all human review processes to eliminate bias
E.Share raw training data publicly for transparency
AnswersB, C

Human oversight ensures accountability for AI outcomes.

Why this answer

Accountability requires clear ownership and documentation. Establishing human oversight and maintaining version control of models are key. Removing humans reduces accountability.

Using only automated decisions removes oversight. Sharing raw data broadly may violate privacy.

56
MCQmedium

A data scientist is using Amazon SageMaker to train a model and wants to understand the contribution of each feature to individual predictions. Which technique should they use to generate local explanations?

A.Permutation feature importance
B.Global feature importance
C.SHAP values
D.Partial dependence plots
AnswerC

SHAP values decompose predictions into additive feature contributions for each instance.

Why this answer

SHAP values provide per-instance (local) explanations of feature contributions. Permutation feature importance, partial dependence plots, and global feature importance all give global explanations, not local.

57
MCQmedium

A large e-commerce company uses a recommendation system to suggest products to customers. Recently, a data scientist noticed that the model's recommendations for high-value luxury items are predominantly shown to users in affluent zip codes, while users in less affluent areas rarely see these items, even if they have searched for them. The company is concerned about fairness and wants to ensure all customers have equal access to recommendations regardless of location. The current model uses collaborative filtering on historical purchase data. The team needs to modify the system without sacrificing overall recommendation accuracy. Which action best addresses the fairness concern while maintaining performance?

A.Randomly show luxury recommendations to a subset of users regardless of their behavior
B.Remove zip code and any income-correlated features from the training data
C.Add more training data from less affluent areas to balance the dataset
D.Implement a separate recommendation pipeline for luxury items based only on search history
AnswerB

Removing biased features eliminates the source of unfairness in recommendations.

Why this answer

The most effective approach is to ensure the model does not use zip code or any feature correlated with income as a direct or indirect input. This removes the proxy for socioeconomic status. Simply equalizing recommendation frequency artificially may hurt relevance.

Personalizing based on search history is already being done but zip code bias remains. Adding more training data may not help if the bias is in the features.

58
MCQmedium

A financial services company deploys a generative AI chatbot for customer support. They want to prevent the chatbot from generating harmful or misleading information. Which AWS service can help monitor and filter responses?

A.Amazon GuardDuty
B.Amazon Augmented AI (A2I) with human review
C.AWS WAF
D.Amazon Comprehend
AnswerB

A2I can route responses for human review to ensure safety.

Why this answer

Option A is correct: Amazon Augmented AI (A2I) with human review can monitor and filter responses for sensitive content. Option B is wrong: Amazon Comprehend is for understanding, not filtering. Option C is wrong: AWS WAF is for web traffic.

Option D is wrong: Amazon GuardDuty is for security threats.

59
MCQeasy

A retail company uses a recommendation system that occasionally suggests inappropriate products to minors. Which responsible AI practice should be applied?

A.Implement human review of flagged recommendations
B.Rely solely on user feedback to improve
C.Disable the recommendation system entirely
D.Increase the volume of training data
AnswerA

Human-in-the-loop ensures responsible oversight.

Why this answer

The correct practice is to implement human review of flagged recommendations. This aligns with the responsible AI principle of accountability, where automated systems must have oversight mechanisms to catch and correct inappropriate outputs, especially when minors are involved. Human-in-the-loop (HITL) validation ensures that edge cases or subtle context (e.g., age-inappropriate product suggestions) are caught before they reach end users, rather than relying solely on automated filters or feedback loops.

Exam trap

AWS often tests the misconception that more data or automation alone can solve fairness and safety issues, when in fact responsible AI requires explicit governance mechanisms like human oversight for high-stakes or vulnerable-user scenarios.

How to eliminate wrong answers

Option B is wrong because relying solely on user feedback to improve is reactive and can expose minors to harm before any corrective action is taken; feedback loops are slow and may not capture subtle or rare inappropriate recommendations. Option C is wrong because disabling the recommendation system entirely is an extreme, non-scalable response that eliminates business value and does not teach the system to behave responsibly; responsible AI aims to mitigate harm, not abandon functionality. Option D is wrong because increasing the volume of training data does not inherently address the problem of inappropriate recommendations; if the training data itself contains biased or unlabeled age-sensitive content, more data can amplify the issue rather than fix it.

60
MCQhard

Refer to the exhibit. A team is creating an IAM policy for a SageMaker notebook user. The user needs to access training data in an S3 bucket and create models. Which responsible AI concern is most relevant to this policy?

A.The policy does not enforce encryption for the notebook.
B.The policy does not restrict which S3 buckets the user can read.
C.The policy does not include a condition for model explainability.
D.The policy grants overly broad permissions, violating the principle of least privilege.
AnswerD

Allowing CreateModel and CreateNotebookInstance on all resources can lead to misuse.

Why this answer

Option C is correct. The policy grants broad access (sagemaker:CreateModel and sagemaker:CreateNotebookInstance on all resources) without restrictions. This could allow a user to create models using any data or expose the notebook.

The least privilege principle is violated, leading to potential unintended model creation or data exposure. Options A and B are less directly related; D is about explainability.

61
MCQmedium

A company uses an AI system to screen job applications. The system was trained on resumes from previous hires, which predominantly came from a specific demographic. As a result, the system may unfairly filter out qualified candidates from other backgrounds. Which responsible AI practice should the company implement?

A.Implement bias detection metrics and monitor outcomes by demographic groups
B.Focus solely on improving the model's precision and recall
C.Defer all screening decisions to a human recruiter
D.Increase the size of the training dataset without regard to demographic composition
AnswerA

Bias detection and monitoring help identify and correct unfair outcomes.

Why this answer

To mitigate bias, the company should measure and monitor the system's impact across demographic groups. This aligns with fairness metrics. Using more data without addressing bias may not help.

Relying on human review is good but does not guarantee systematic fairness. Focusing only on performance ignores fairness.

62
MCQeasy

A financial services company uses Amazon Rekognition to verify customer identities. To ensure responsible AI practices, which measure should the company prioritize?

A.Use only black-box models to protect intellectual property
B.Increase model complexity to improve accuracy
C.Minimize the amount of training data collected
D.Regularly audit the model for demographic bias
AnswerD

Bias audits are essential for fairness.

Why this answer

Option D is correct because regularly auditing the model for demographic bias is a core responsible AI practice, especially for identity verification systems where biased outcomes could lead to unfair treatment of certain customer groups. Amazon Rekognition's facial analysis and comparison features must be tested across diverse demographics to ensure equitable performance, as bias can arise from imbalanced training data or algorithmic artifacts.

Exam trap

The trap here is that candidates may confuse 'responsible AI' with generic model optimization (like increasing accuracy or reducing data), but the exam specifically tests the principle of fairness through bias auditing and transparency.

How to eliminate wrong answers

Option A is wrong because using only black-box models contradicts responsible AI principles; explainability and transparency are critical for auditing bias and ensuring fairness, and black-box models obscure how decisions are made, making it harder to detect issues. Option B is wrong because increasing model complexity does not inherently improve accuracy and can amplify bias or reduce interpretability; responsible AI prioritizes balanced performance and fairness over raw accuracy. Option C is wrong because minimizing training data can exacerbate bias by underrepresenting certain demographic groups, leading to poor generalization and unfair outcomes; responsible AI requires diverse, representative datasets.

63
Multi-Selecthard

Which TWO of the following are key components of a responsible AI governance framework?

Select 2 answers
A.Develop and enforce AI ethics policies and standards
B.Focus solely on compliance with legal regulations
C.Minimize human involvement in AI lifecycle decisions
D.Conduct regular bias and fairness impact assessments
E.Deploy AI models as black boxes to avoid scrutiny
AnswersA, D

Policies provide the foundation for governance.

Why this answer

Option A is correct because a responsible AI governance framework must include the development and enforcement of AI ethics policies and standards to ensure alignment with societal values, fairness, and accountability. These policies guide the design, deployment, and monitoring of AI systems, embedding ethical principles such as transparency, privacy, and non-discrimination into the AI lifecycle. Without such policies, organizations risk deploying AI that violates ethical norms or regulatory expectations.

Exam trap

Cisco often tests the distinction between mere legal compliance and comprehensive ethical governance, trapping candidates who think that meeting regulatory requirements alone constitutes responsible AI, while ignoring proactive fairness and transparency measures.

64
MCQeasy

Refer to the exhibit. A developer is reviewing CloudWatch Logs for a deployed model and notices the same input appears multiple times with slightly different probabilities. What responsible AI concern does this pattern suggest?

A.The model is overfitting to the training data.
B.The model is not robust; it produces inconsistent predictions for the same input.
C.The model is exhibiting bias against a demographic group.
D.The input data is drifting from the training distribution.
AnswerB

Identical inputs should yield identical outputs; variation indicates instability.

Why this answer

Option B is correct. Repeated identical inputs with different predictions indicate model instability (lack of robustness). This could be due to randomness in the model or adversarial conditions.

Option A is irrelevant; C is possible but not the primary concern; D is about data drift, but input is same.

65
MCQhard

A healthcare company uses Amazon SageMaker to train a model that predicts patient readmission risk based on electronic health records (EHRs) stored in Amazon HealthLake. The training dataset contains 2 million records from the past three years, with a significant gender imbalance: 70% male and 30% female. The model achieved high overall accuracy, but further analysis using SageMaker Clarify revealed that the precision for female patients is 0.65 while for male patients it is 0.88. Additionally, the model's false positive rate for female patients is significantly higher. The company must comply with healthcare regulations that require fairness and non-discrimination. The data science team has already used SageMaker Data Wrangler for initial preprocessing and SageMaker Clarify for bias detection. They need to take immediate action to mitigate the bias before deploying to production. Which course of action should the team take?

A.Use SageMaker Clarify's bias mitigation feature to apply reweighing techniques and retrain the model with adjusted sample weights.
B.Use SageMaker Clarify to generate SHAP values and adjust the model's feature importance by removing biased features.
C.Use SMOTE (Synthetic Minority Oversampling Technique) to balance the training dataset before retraining.
D.Use SageMaker Model Monitor to detect feature drift and automatically retrain the model with updated data.
AnswerA

This directly mitigates bias by reweighting training samples to reduce disparity.

Why this answer

The correct answer is to use SageMaker Clarify's built-in bias mitigation technique (reweighing) as it directly addresses the disparity by adjusting sample weights during training. Option A: Model Monitor is for monitoring drift, not mitigation. Option B: SHAP values explain predictions but do not change model behavior.

Option C: SMOTE addresses class imbalance but not fairness in terms of group accuracy disparity; it may even worsen bias. Therefore, D is the best choice.

66
MCQhard

A bank uses an AI system to detect fraudulent transactions. The model has high precision but low recall for small transactions, potentially missing fraud. Which approach aligns with responsible AI?

A.Send all flagged transactions to customers for confirmation
B.Focus only on precision to minimize false positives
C.Tune the model to achieve an acceptable balance between recall and precision
D.Increase the detection threshold to reduce false positives
AnswerC

Balancing metrics is a responsible approach.

Why this answer

Option C is correct because responsible AI requires balancing competing objectives like precision and recall to align with ethical principles and business needs. In fraud detection, high precision with low recall means many fraudulent transactions are missed, which can lead to significant financial losses and erode customer trust. Tuning the model to achieve an acceptable trade-off ensures that the system is both effective and fair, minimizing harm while maintaining operational viability.

Exam trap

Cisco often tests the misconception that increasing the detection threshold improves model performance overall, when in fact it only reduces false positives at the cost of lowering recall, which can be detrimental in high-stakes applications like fraud detection.

How to eliminate wrong answers

Option A is wrong because sending all flagged transactions to customers for confirmation shifts the burden to users, degrades user experience, and may not be scalable or timely for real-time fraud detection, nor does it address the underlying model imbalance. Option B is wrong because focusing only on precision ignores the critical need to catch actual fraud (recall), which can result in substantial financial losses and violates the responsible AI principle of beneficence. Option D is wrong because increasing the detection threshold reduces false positives but further lowers recall, worsening the problem of missed fraud and contradicting the goal of responsible AI.

67
MCQmedium

Refer to the exhibit. An AWS CloudTrail log shows the creation of an IAM policy for a SageMaker execution role. Which responsible AI concern does this configuration raise?

A.Insufficient training data
B.Lack of least privilege access control
C.Violation of data residency requirements
D.Absence of model monitoring
AnswerB

The wildcard resource exposes all endpoints to potential misuse.

Why this answer

The policy allows sagemaker:InvokeEndpoint on all resources (*), violating the principle of least privilege. This could allow the role to invoke any SageMaker endpoint, potentially leading to unauthorized inferences. Model monitoring, training data, and data residency are not addressed by this log entry.

68
Multi-Selecteasy

A data science team is building a resume screening model and wants to ensure it does not exhibit gender bias. Which TWO actions are most effective for mitigating bias? (Choose TWO.)

Select 2 answers
A.Apply adversarial debiasing techniques during training.
B.Use a more complex deep learning model.
C.Remove the gender attribute and all correlated features from the dataset.
D.Regularly audit model predictions for disparate impact across genders.
E.Ensure the training dataset has equal numbers of male and female candidates.
AnswersA, D

Adversarial debiasing reduces sensitivity to protected attributes.

Why this answer

Regularly auditing predictions for disparate impact and applying adversarial debiasing are proven techniques. Simply removing attributes may not eliminate bias due to correlated proxies. Balancing datasets is helpful but not sufficient alone.

Complex models do not guarantee fairness.

69
MCQhard

A financial services company uses a machine learning model to automatically reject credit card transactions suspected of fraud. The model was trained on transaction data from the past two years. Over the last three months, the model's false positive rate has increased significantly, causing legitimate transactions to be declined and leading to customer complaints. The company needs to restore the model's accuracy quickly. Initial analysis shows that the distribution of transaction amounts and locations has shifted compared to the training period. The data science team is under pressure to deploy an update within a week. Which approach should they take to most effectively address the issue while adhering to responsible AI guidelines?

A.Deploy a rule-based system with fixed rules for fraud detection
B.Adjust the decision threshold to reduce false positives without retraining
C.Retrain the model using only the most recent three months of transaction data and evaluate on current distribution
D.Build an ensemble model that combines predictions from the old model and a new model trained on recent data
AnswerC

Retraining on recent data adapts to drift and is straightforward.

Why this answer

The most effective approach is to retrain the model using recent data (last three months) to adapt to the distribution shift, and carefully evaluate for any new biases that may emerge. This directly addresses the drift. Simply adjusting the threshold may not capture new fraud patterns.

Using an ensemble of old and recent models could be complex and may not fully adapt. Deploying a simple rule-based system would be a step backward in capability.

70
MCQeasy

A company uses Amazon Rekognition for facial analysis. They want to ensure the model doesn't exhibit bias based on skin tone. What should they do?

A.Ensure the training dataset includes diverse skin tones
B.Apply data augmentation to increase dataset size
C.Use a larger neural network
D.Use a pre-trained model from AWS Marketplace
AnswerA

Balanced representation mitigates bias.

Why this answer

Option D is correct: Training on diverse data reduces bias. Option A is wrong: Network size does not address bias. Option B is wrong: Data augmentation does not guarantee diversity.

Option C is wrong: Pre-trained models may have inherent bias.

71
MCQeasy

A startup is developing a mobile app that uses facial recognition to verify user identity for account access. The app is intended for a global audience, but the training data predominantly includes images of light-skinned individuals. During beta testing, users with darker skin tones report frequent verification failures, while light-skinned users have a high success rate. The startup wants to release the app soon and needs to address this fairness issue without delaying the launch too much. The team has limited resources. Which approach should they take to most effectively mitigate the bias while meeting the launch timeline?

A.Apply a post-processing rule to increase acceptance rate for users with darker skin tones
B.Lower the similarity threshold for all users to improve acceptance rates
C.Defer verification for users with darker skin tones to manual human review
D.Collect more diverse training data and augment the existing dataset, then retrain the model
AnswerD

Adding diverse data addresses the root cause of bias.

Why this answer

The most effective approach is to collect additional training data representing diverse skin tones and augment the dataset, then retrain the model. This directly addresses the data imbalance. Applying a post-processing rule without retraining may not fix the underlying model bias.

Deferring to humans is a temporary workaround and does not scale. Reducing the threshold for all users could increase false positives and may not be acceptable.

72
MCQeasy

A company develops a chatbot using Amazon Lex. To ensure transparency, what should the chatbot do when it cannot answer a question?

A.Remain silent and wait for the next input
B.Provide a random answer from a predefined list
C.Clearly state that it cannot answer and offer alternatives
D.Automatically escalate all unanswered questions to a human
AnswerC

Honest communication builds trust.

Why this answer

Option C is correct because responsible AI guidelines, including those from AWS for Amazon Lex, require that when a chatbot cannot answer a question, it should clearly state its inability and offer alternatives (e.g., rephrasing the query or providing related topics). This maintains transparency and user trust, aligning with the 'Explainability' principle under the Guidelines for Responsible AI.

Exam trap

AWS often tests the misconception that a chatbot should always escalate or remain passive when it cannot answer, but the correct approach under responsible AI is to acknowledge the limitation and offer alternatives, not to hide or mislead.

How to eliminate wrong answers

Option A is wrong because remaining silent and waiting for the next input violates transparency and can confuse users, as it provides no feedback or guidance. Option B is wrong because providing a random answer from a predefined list is deceptive and can mislead users, undermining the principle of honesty and accountability in AI. Option D is wrong because automatically escalating all unanswered questions to a human is inefficient and not always necessary; the chatbot should first attempt to offer alternatives or clarify before escalation, as per responsible AI practices.

73
MCQhard

A machine learning team is building a credit risk model and discovers that the training data has a significant imbalance in loan approval rates between two demographic groups. They decide to reweight the training samples using a preprocessing technique. Which SageMaker Clarify feature can help compute the appropriate sample weights to achieve demographic parity?

A.Clarify preprocessing (reweighting)
B.Clarify post-training bias metrics
C.Model Monitor bias drift
D.Clarify explainability (SHAP)
AnswerA

Clarify provides a preprocessing transformation that reweights data to meet fairness constraints.

Why this answer

Option A is correct because SageMaker Clarify's preprocessing transforms include a reweighting method that assigns weights to instances to adjust for fairness. Post-training (B, C) are not preprocessing. Monitoring (D) is post-deployment.

74
MCQmedium

A financial services company is deploying a generative AI chatbot to assist customers with account inquiries. The company wants to ensure the chatbot does not generate biased or harmful responses. Which combination of AWS services and practices should the company implement to monitor and mitigate these risks?

A.Configure the chatbot to use a pre-trained model from SageMaker JumpStart and disable logging to avoid storing sensitive customer data.
B.Use Amazon Rekognition to analyze chat logs for biased language and automatically block responses with a confidence score above 90%.
C.Use Amazon SageMaker Clarify to detect bias in model outputs and implement a human-in-the-loop workflow with Amazon A2I to review flagged responses.
D.Deploy Amazon Lex with built-in sentiment analysis to detect negative customer emotions and automatically escalate to a human agent.
AnswerC

SageMaker Clarify provides bias detection, and Amazon A2I enables human review of model outputs to ensure responsible AI.

Why this answer

Option C is correct because Amazon SageMaker Clarify is specifically designed to detect bias in machine learning models and their outputs, while Amazon Augmented AI (A2I) enables a human-in-the-loop workflow to review flagged responses. This combination directly addresses the requirement to monitor and mitigate biased or harmful responses from a generative AI chatbot, ensuring responsible AI practices.

Exam trap

Cisco often tests the distinction between services that detect customer sentiment (like Amazon Comprehend or Lex sentiment analysis) versus services that detect bias in model outputs (like SageMaker Clarify), leading candidates to mistakenly choose sentiment analysis options for bias detection.

How to eliminate wrong answers

Option A is wrong because disabling logging prevents the monitoring and auditing necessary to detect biased or harmful responses, and using a pre-trained model from SageMaker JumpStart without additional safeguards does not mitigate bias. Option B is wrong because Amazon Rekognition is an image and video analysis service, not designed for analyzing text chat logs for biased language; it cannot process text-based conversations. Option D is wrong because Amazon Lex's built-in sentiment analysis detects customer emotions but does not detect bias or harmful content in the chatbot's responses, and escalation to a human agent does not proactively mitigate biased outputs.

75
Multi-Selecthard

Which THREE considerations are essential for ensuring responsible AI in a model that predicts employee performance? (Choose 3)

Select 3 answers
A.Minimize the number of features to reduce cost
B.Publish the model's predictions publicly for transparency
C.Incorporate human review before final decisions
D.Ensure employee data privacy and consent
E.Test for bias across demographic groups
AnswersC, D, E

Human-in-the-loop reduces automation bias.

Why this answer

Option C is correct because responsible AI frameworks, such as those outlined by AWS, mandate human-in-the-loop (HITL) oversight for high-stakes decisions like employee performance predictions. This ensures that automated outputs are reviewed for context, fairness, and potential errors before affecting employment outcomes, aligning with the AIF-C01 domain's emphasis on human accountability.

Exam trap

Cisco often tests the misconception that transparency means public disclosure of all model outputs, whereas in responsible AI, transparency refers to explainability and auditability of the model's logic, not exposing sensitive predictions.

Page 1 of 2 · 77 questions totalNext →

Ready to test yourself?

Try a timed practice session using only Guidelines for Responsible AI questions.