AI0-001 Exam Questions and Answers

A company deploys an AI model to predict equipment failure. The model performs well on historical data but fails to generalize to new data from a different factory. Which concept best describes this issue?

Transfer learning

Underfitting

Overfitting

The model fits training data too closely and fails on new data.

Bias-variance tradeoff

Why: Option C (Overfitting) is correct because the model learned patterns specific to the historical data from the original factory, including noise and factory-specific nuances, rather than generalizable features. When applied to new data from a different factory, those learned patterns do not hold, causing poor performance. This is the classic symptom of overfitting: high accuracy on training data but low accuracy on unseen data.

A data scientist trains a linear regression model to predict house prices. The model has high bias and low variance. Which action would most likely reduce bias?

Apply L2 regularization

Increase the training dataset size

Add polynomial features

Adding complexity reduces bias but may increase variance.

Remove irrelevant features

Why: High bias indicates the model is underfitting the data, meaning it is too simple to capture the underlying patterns. Adding polynomial features increases model complexity by introducing non-linear terms, which allows the linear regression model to better fit the training data and thus reduce bias.

An AI engineer trains a deep learning model for image classification. After training, the training accuracy is 99% but validation accuracy is 85%. Which technique would best address this discrepancy?

Increase data augmentation

Decrease the learning rate

Increase the number of layers

Add dropout layers

Dropout reduces overfitting by preventing co-adaptation of neurons.

Why: The high training accuracy (99%) and lower validation accuracy (85%) indicate overfitting, where the model memorizes training data but fails to generalize. Dropout layers randomly deactivate neurons during training, forcing the network to learn more robust features and reducing overfitting. This technique directly addresses the discrepancy by improving validation performance without sacrificing training capacity.

A company implements a chatbot using a rule-based system. Users complain the chatbot cannot handle new queries. Which AI approach should be considered to improve flexibility?

Expert system

Natural language processing (NLP)

Robotic process automation

Machine learning

ML enables the system to learn patterns from data.

Why: Machine learning (ML) enables a chatbot to learn from new data and adapt to unseen queries, unlike a static rule-based system. By training on historical conversations, an ML model can generalize patterns and handle novel inputs without requiring explicit rules for every scenario.

An AI model for detecting fraudulent transactions has high precision but low recall. Which business impact is most likely?

The model has no impact on fraud detection

The model detects all fraudulent transactions

Many fraudulent transactions go undetected

Low recall indicates a high number of false negatives.

Many legitimate transactions are flagged as fraud

Why: High precision means that when the model flags a transaction as fraudulent, it is very likely correct. However, low recall indicates that the model misses a significant proportion of actual fraudulent transactions. Therefore, the most likely business impact is that many fraudulent transactions go undetected, leading to financial losses.

A data scientist splits a dataset into training (80%) and test (20%). After training, the model achieves 95% accuracy on training and 60% on test. Which step should the data scientist take first?

Collect more data

Use cross-validation

Apply regularization

Regularization penalizes large weights, reducing overfitting.

Increase model complexity

Why: The model shows high training accuracy (95%) but significantly lower test accuracy (60%), which is a classic sign of overfitting. Regularization (Option C) directly addresses overfitting by adding a penalty term to the loss function (e.g., L1 or L2 regularization), discouraging the model from learning overly complex patterns that do not generalize. This is the first step because it targets the core issue without requiring additional data or increasing complexity.

Want more AI Concepts and Foundations practice?

All Machine Learning and Deep Learning questions

Domain 2: Machine Learning and Deep Learning

A data scientist is building a classification model to detect fraudulent transactions. The dataset is highly imbalanced with only 1% fraudulent cases. Which approach should the scientist use to evaluate model performance most effectively?

F1 score

F1 score is the harmonic mean of precision and recall, providing a balanced measure for imbalanced datasets.

Accuracy

Recall

Precision

Why: In highly imbalanced datasets like fraud detection (1% positive class), accuracy is misleading because a model that predicts all transactions as legitimate would achieve 99% accuracy yet fail to detect any fraud. The F1 score (harmonic mean of precision and recall) is the most effective metric because it balances both false positives and false negatives, providing a single score that reflects the model's ability to correctly identify the minority class without being skewed by class imbalance.

A machine learning team is deploying a model that predicts customer churn. They notice that the model's predictions are highly sensitive to small changes in input features, leading to inconsistent outputs. Which technique should the team apply to improve model stability?

Increase learning rate

Feature scaling

Regularization

Regularization adds a penalty for large weights, reducing overfitting and sensitivity to input variations.

Cross-validation

Why: Regularization (Option C) is the correct technique because it adds a penalty term to the loss function (e.g., L1 or L2 regularization), which constrains the model's weights. This reduces variance and prevents overfitting to noise in the training data, directly addressing the high sensitivity to small input changes (brittleness). By shrinking coefficients, regularization forces the model to learn more general patterns, improving stability and consistency in predictions.

A deep learning model for image classification is overfitting the training data. The team has already tried data augmentation and dropout. Which additional technique should they implement to reduce overfitting?

Batch normalization

Increase number of epochs

Gradient clipping

Early stopping

Early stopping monitors validation loss and stops training when it starts to increase, reducing overfitting.

Why: Early stopping (Option D) is the correct additional technique because it halts training when validation performance stops improving, directly preventing the model from memorizing noise in the training data. Since data augmentation and dropout are already in use, early stopping provides a complementary regularization effect by limiting the number of training iterations before overfitting occurs.

A company wants to deploy a machine learning model that requires continuous learning as new data arrives. The model must be able to adapt to changing patterns without retraining from scratch. Which approach should be used?

Transfer learning

Online learning

Online learning updates the model incrementally, allowing adaptation to new data without full retraining.

Batch learning

Unsupervised learning

Why: Online learning (also called incremental learning) updates the model incrementally as each new data point arrives, without requiring full retraining. This makes it ideal for scenarios where data arrives continuously and patterns shift over time, as the model can adapt its parameters on the fly.

A data engineer is designing a pipeline to train a linear regression model on a dataset with 10 million rows and 50 features. The dataset fits in memory. Which approach should the engineer use to train the model efficiently?

Normal equation

Batch gradient descent

Principal component analysis

Stochastic gradient descent

SGD updates weights per sample, making it efficient for large datasets.

Why: Stochastic gradient descent (SGD) is the most efficient approach for training a linear regression model on a dataset with 10 million rows and 50 features because it updates the model parameters using only one training example per iteration, leading to much faster convergence per epoch compared to batch methods. Since the dataset fits in memory, SGD can still be implemented efficiently without the overhead of loading data in batches from disk, and it scales well to large datasets where the normal equation or batch gradient descent would be computationally prohibitive.

A data scientist is training a convolutional neural network (CNN) for object detection. The training loss decreases rapidly but then plateaus at a high value, and the validation loss starts increasing. Which action should the scientist take to improve the model?

Increase the learning rate

Increase the number of epochs

Reduce the model complexity

Reducing complexity (e.g., fewer layers) can reduce overfitting and improve validation performance.

Add more convolutional layers

Why: The training loss decreasing rapidly then plateauing at a high value while validation loss increases is classic overfitting. Reducing model complexity (Option C) directly addresses overfitting by decreasing the number of parameters or applying regularization (e.g., dropout, L2), which forces the network to learn more generalizable features rather than memorizing noise in the training data.

Want more Machine Learning and Deep Learning practice?

All AI Models and Data Engineering questions

Domain 3: AI Models and Data Engineering

A data scientist is preparing a dataset for training a classification model. The dataset contains 10,000 records with a binary target variable where 9,500 belong to class A and 500 belong to class B. Which technique should the scientist use to address the class imbalance?

SMOTE (Synthetic Minority Oversampling Technique)

SMOTE creates synthetic minority samples to balance classes.

Random undersampling of class A

Adding Gaussian noise to class B

Principal Component Analysis (PCA)

Why: SMOTE is the correct technique because it generates synthetic samples for the minority class (class B) by interpolating between existing minority instances, effectively balancing the dataset without losing information. This approach avoids the overfitting risk of simple oversampling and the information loss of undersampling, making it ideal for a 19:1 imbalance ratio.

An engineer is building a regression model to predict housing prices. The dataset includes features such as square footage, number of bedrooms, and year built. The engineer notices that the square footage values range from 500 to 10,000, while the number of bedrooms ranges from 1 to 5. Which preprocessing step is most critical before training a gradient descent-based model?

Use k-fold cross-validation

Apply log transformation to all features

Normalize or standardize the features

Scaling improves convergence of gradient descent.

One-hot encode the features

Why: Gradient descent-based models are sensitive to the scale of input features because they update weights proportionally to the gradient, which is influenced by feature magnitudes. With square footage ranging 500–10,000 and bedrooms 1–5, the larger feature will dominate the gradient, causing slow or unstable convergence. Normalizing or standardizing (e.g., Z-score or min-max scaling) ensures all features contribute equally, leading to faster and more reliable training.

A machine learning team is deploying a sentiment analysis model for customer reviews. The model was trained on reviews from an e-commerce site but will be used for a social media platform. The team observes a drop in accuracy. Which concept best explains this issue?

Data drift

The distribution of reviews differs between e-commerce and social media.

Concept drift

Bias-variance tradeoff

Overfitting

Why: Data drift occurs when the statistical properties of the input data change between the training and production environments. Here, the model was trained on e-commerce reviews but is now processing social media posts, which have different vocabulary, tone, and structure, causing a mismatch in the input distribution and leading to accuracy degradation.

A data engineer needs to design a data pipeline for a real-time fraud detection system. The system requires low-latency processing of streaming transactions. Which architecture is most appropriate?

Stream processing with Apache Kafka and Flink

Stream processing provides low-latency real-time analysis.

Data lake with Apache Spark

Batch processing with Apache Hadoop

Microservices architecture with REST APIs

Why: Apache Kafka provides a distributed, fault-tolerant event streaming platform that ingests high-throughput transaction data with low latency, while Apache Flink offers true stream processing with exactly-once semantics and sub-second event-time processing. Together, they enable real-time fraud detection by analyzing transactions as they arrive, without the delays inherent in batch or micro-batch approaches.

A team is training a deep learning model for image classification. The training loss decreases rapidly but validation loss starts increasing after a few epochs. Which regularization technique should be applied to mitigate this issue?

Data augmentation

L2 regularization

Early stopping

Early stopping prevents overfitting by stopping training when validation loss starts to rise.

Dropout

Why: Option C is correct because early stopping halts training when validation loss increases, preventing overfitting. Option A is wrong because L2 regularization penalizes large weights but doesn't stop training. Option B is wrong because dropout randomly drops neurons during training, but early stopping directly addresses the symptom. Option D is wrong because data augmentation increases data diversity, but the issue is overfitting due to training too long.

An organization needs to store sensitive customer data for training a machine learning model. The data must be encrypted at rest and in transit, and access must be audited. Which combination of practices should be implemented?

Use TLS for transfer, AES-256 for storage, and AWS CloudTrail for auditing

These provide encryption and auditing.

Use FTP for transfer, AES-128 for storage, and manual log review

Use SSH for transfer, store data in a database, and enable access logs

Use MD5 for hashing, store data in plaintext, and enable server logs

Why: Option A is correct because it combines TLS (Transport Layer Security) for encrypting data in transit, AES-256 for strong encryption at rest, and AWS CloudTrail for auditing API-level access. TLS ensures confidentiality and integrity during transmission, AES-256 provides robust symmetric encryption for stored data, and CloudTrail logs all AWS API calls for compliance and audit trails. This triad satisfies the requirements of encryption in transit, at rest, and audited access.

Want more AI Models and Data Engineering practice?

All AI Implementation and Operations questions

Domain 4: AI Implementation and Operations

A company deployed a chatbot using a pre-trained language model. Users report that the chatbot provides incorrect answers to domain-specific questions. Which approach should the AI team prioritize to improve accuracy without retraining the entire model?

Fine-tune the model on a curated dataset of domain-specific conversations.

Fine-tuning adapts the model to the domain with less data and compute.

Increase the temperature parameter to reduce randomness.

Collect more general training data and retrain the model from scratch.

Roll back to a previous version of the model that was more accurate.

Why: Fine-tuning on a curated domain-specific dataset is the most efficient way to improve accuracy for specialized queries without retraining the entire model. It adjusts the model's weights using a smaller, targeted dataset, preserving general language understanding while adapting to domain terminology and context.

An AI system misclassifies rare but critical events. The team considers using synthetic data. Which consideration is MOST important for ensuring the synthetic data improves performance on real rare events?

The synthetic data should include a wide variety of events, even if not realistic.

The synthetic data should be generated using an unsupervised generative model.

The synthetic data should accurately represent the distribution and features of real rare events.

Fidelity to real event characteristics is crucial for generalization.

The synthetic data should be as large as possible to cover all possibilities.

Why: Option C is correct because synthetic data must faithfully replicate the distribution and feature space of real rare events to enable the model to learn meaningful decision boundaries. If the synthetic data does not capture the true underlying patterns—such as specific sensor readings or transaction anomalies—the model will fail to generalize to actual rare events, defeating the purpose of augmentation.

A data scientist trains a regression model and notices the training loss is low but validation loss is high. Which technique should be applied FIRST to address this issue?

Increase the learning rate.

Add more layers to the neural network.

Increase the size of the training dataset.

Apply L1 or L2 regularization to the model.

Regularization penalizes large weights, reducing overfitting.

Why: The scenario describes overfitting, where the model memorizes the training data but fails to generalize to unseen data. Applying L1 or L2 regularization (Option D) is the correct first step because it adds a penalty to the loss function for large weights, discouraging complexity and reducing overfitting without requiring additional data or architectural changes.

A company deploys an AI model for loan approval. The model shows bias against a protected group. The team decides to use adversarial debiasing. What is the PRIMARY advantage of this approach?

It guarantees the model's predictions are private.

It reduces bias while preserving predictive performance by learning representations that are invariant to sensitive attributes.

This is the core benefit of adversarial debiasing.

It is simpler to implement than pre-processing techniques.

It ensures equal approval rates across all groups.

Why: Adversarial debiasing is an in-processing technique that trains a primary model to predict the target (e.g., loan approval) while simultaneously training an adversary to predict the sensitive attribute from the model's learned representations. The primary model is penalized when the adversary succeeds, forcing it to learn representations that are invariant to the sensitive attribute. This reduces bias while preserving predictive performance because the model retains the ability to learn task-relevant patterns that are not correlated with the protected attribute.

An AIOps platform monitors server metrics and triggers alerts. The team notices too many false positives. Which adjustment should be made to the anomaly detection model?

Use a more complex model to better fit the data.

Shorten the observation window to detect anomalies faster.

Increase the training data to include more normal patterns.

Raise the anomaly score threshold for triggering alerts.

A higher threshold means only more extreme deviations trigger alerts.

Why: Raising the anomaly score threshold (Option D) directly reduces false positives by requiring a higher deviation from normal behavior before an alert is triggered. In AIOps platforms, the anomaly score is a numeric value (e.g., 0–100) that quantifies how unusual a metric is; a higher threshold means only more extreme deviations generate alerts, filtering out minor fluctuations that were incorrectly flagged.

A team deploys a machine learning model as a REST API. They want to monitor model drift. Which metric is MOST appropriate for detecting drift in the input data distribution?

Model accuracy on a recent holdout set.

Population stability index (PSI) comparing training and recent data.

PSI directly quantifies distribution shift.

F1 score on the training data.

Root mean squared error (RMSE) on test data.

Why: Population stability index (PSI) is the most appropriate metric for detecting drift in input data distribution because it directly measures the shift between the training data distribution and the recent production data distribution. PSI is calculated by binning both distributions and computing the sum of (proportion in bin of recent data minus proportion in bin of training data) times the natural log of their ratio, making it sensitive to changes in feature distributions without requiring ground truth labels.

Want more AI Implementation and Operations practice?

All AI Security, Ethics and Governance questions

Domain 5: AI Security, Ethics and Governance

A healthcare organization deploys an AI system to analyze medical images and detect anomalies. During a routine audit, the security team discovers that the AI model occasionally returns results that include data from patients who have opted out of data sharing. Which security control should be implemented to prevent this violation?

Apply data anonymization techniques to the training dataset.

Anonymization removes personally identifiable information, ensuring that the model cannot output data linked to specific patients.

Implement role-based access control (RBAC) on the AI model's inference API.

Use differential privacy during model training.

Encrypt the training data at rest and in transit.

Why: Option B is correct because data anonymization ensures that patient identities are removed from training data, preventing re-identification of opt-out patients. Option A is incorrect because access control does not address data already in the model. Option C is incorrect because encryption protects data in transit/rest but does not prevent data leakage from model outputs. Option D is incorrect because differential privacy adds noise to queries but does not directly remove specific patient data from model results.

A financial institution is implementing an AI-based fraud detection system. The compliance officer is concerned about potential bias in the model that could lead to unfair treatment of certain customer groups. Which governance practice should be prioritized to address this concern?

Increase the diversity of the training data by collecting more samples from underrepresented groups.

Schedule regular bias audits using fairness metrics.

Bias audits with metrics like demographic parity can detect unfair treatment and guide mitigation.

Retrain the model every month with the latest transaction data.

Use SHAP values to provide explanations for each prediction.

Why: Regular bias audits using fairness metrics (Option B) are the correct governance practice because they provide a systematic, quantitative method to detect and measure disparate impact across protected groups. Unlike simply collecting more data, audits directly evaluate model outputs for statistical parity, equal opportunity, or other fairness definitions, enabling the institution to identify and remediate bias proactively. This aligns with regulatory expectations for ongoing monitoring and accountability in AI governance.

A company uses a machine learning model to recommend products to customers. The marketing team notices that the model is recommending high-profit items more frequently than low-profit items, even when customers are likely to prefer the latter. This behavior is causing customer dissatisfaction. Which approach would best align the model with customer preferences while maintaining profitability?

Train the model with a loss function that weights profit more heavily than customer satisfaction.

Use a multi-objective optimization framework to balance profit and customer satisfaction.

Multi-objective optimization explicitly considers trade-offs between competing objectives, leading to more balanced recommendations.

Adjust the model's hyperparameters to reduce the influence of profit features.

Remove profit data from the training set and only use customer preference data.

Why: Option D is correct because multi-objective optimization allows the model to balance multiple goals (e.g., profit and customer satisfaction) explicitly. Option A is incorrect because it still prioritizes profit, which may not address satisfaction. Option B is incorrect because it completely removes profit, which may harm business goals. Option C is incorrect because it only adjusts profit weights without true multi-objective framework.

An AI system used for resume screening is found to consistently rank male candidates higher than female candidates with similar qualifications. The HR director wants to remediate this bias without significantly reducing model accuracy. Which technique should be applied?

Apply adversarial debiasing to the model during training.

Adversarial debiasing reduces bias by training the model to be unable to predict protected attributes from its predictions.

Use a random selection of candidates to avoid bias.

Remove the gender feature from the dataset and retrain.

Collect more training data from underrepresented groups.

Why: Adversarial debiasing is the correct technique because it directly addresses bias during training by introducing an adversarial network that attempts to predict the protected attribute (e.g., gender) from the model's predictions. The main model is trained to maximize accuracy while minimizing the adversary's ability to infer the protected attribute, thereby reducing bias without a significant drop in predictive performance. This approach is more effective than simple feature removal or data collection because it actively learns to remove correlations between the protected attribute and the output.

A company is developing an AI chatbot for customer service. The legal team is concerned that the chatbot might generate responses that violate privacy regulations. Which governance mechanism should be implemented to mitigate this risk?

Use explainable AI techniques to understand why the chatbot generates certain responses.

Encrypt all chatbot conversations at rest and in transit.

Implement a human-in-the-loop review process for high-risk responses.

Human review can catch and block responses that violate privacy regulations before they are sent to customers.

Anonymize the training data used to train the chatbot.

Why: Option C is correct because a human-in-the-loop (HITL) review process directly addresses the risk of privacy violations by ensuring that high-risk responses are reviewed by a human before being sent to the customer. This governance mechanism provides a safety net for unpredictable outputs from the generative AI model, which may inadvertently leak personally identifiable information (PII) or violate data protection regulations like GDPR or CCPA. Unlike technical controls that only reduce the attack surface, HITL offers real-time compliance oversight for the chatbot's natural language generation (NLG) outputs.

A self-driving car company is testing an AI model for pedestrian detection. During simulation, the model fails to detect pedestrians in low-light conditions. The safety team wants to improve robustness without retraining the entire model from scratch. Which approach is most appropriate?

Replace the convolutional layers with transformer layers to improve attention.

Apply data augmentation techniques to simulate low-light conditions in the training dataset.

Data augmentation can expand the training data to include low-light scenarios, improving robustness without full retraining.

Use adversarial training to add imperceptible perturbations to training images.

Increase the model's depth by adding more convolutional layers.

Why: Option B is correct because data augmentation techniques, such as adjusting brightness, contrast, and adding noise, can synthetically create low-light training examples from existing data. This improves the model's robustness to low-light conditions without requiring a full retraining from scratch, as it directly addresses the distribution shift in the input data.

Want more AI Security, Ethics and Governance practice?

Browse all AI0-001 questions Take a timed practice test

Frequently asked questions

How many questions are on the AI0-001 exam?

The AI0-001 exam has 80 questions and must be completed in 90 minutes. The passing score is 700/1000.

What types of questions appear on the AI0-001 exam?

Multiple-choice and performance-based questions covering IT security, networking, and operations. Some questions are performance-based (PBQs), asking you to complete tasks in a simulated environment.

How are AI0-001 questions organised by domain?

The exam covers 5 domains: AI Concepts and Foundations, Machine Learning and Deep Learning, AI Models and Data Engineering, AI Implementation and Operations, AI Security, Ethics and Governance. Questions are weighted by domain — higher-weight domains appear more on your actual exam.

Are these the actual AI0-001 exam questions?

No. These are original exam-style practice questions written against the official CompTIA AI0-001 exam objectives. They are not copied from the real exam. Courseiva focuses on genuine understanding, not memorisation of braindumps.

Ready to practice all 80 AI0-001 questions?

Courseiva tracks your accuracy per domain and routes you toward weak areas automatically. Free, no account required.

CompTIA · Free Practice Questions · Last reviewed May 2026

AI0-001 Exam Questions and Answers

30real exam-style questions organised by domain, each with the correct answer highlighted and a plain-English explanation of why it's right — and why the others are wrong.

80 exam questions

90 min time limit

Pass: 700/1000 / 1000

5 exam domains

Overview Domain Blueprint Study Guide All QuestionsSample by Domain

1. AI Concepts and Foundations 2. Machine Learning and Deep Learning 3. AI Models and Data Engineering 4. AI Implementation and Operations 5. AI Security, Ethics and Governance

Domain 1: AI Concepts and Foundations

All AI Concepts and Foundations questions

Transfer learning

Underfitting

Overfitting

The model fits training data too closely and fails on new data.

Bias-variance tradeoff

A data scientist trains a linear regression model to predict house prices. The model has high bias and low variance. Which action would most likely reduce bias?

Apply L2 regularization

Increase the training dataset size

Add polynomial features

Adding complexity reduces bias but may increase variance.

Remove irrelevant features

An AI engineer trains a deep learning model for image classification. After training, the training accuracy is 99% but validation accuracy is 85%. Which technique would best address this discrepancy?

Increase data augmentation

Decrease the learning rate

Increase the number of layers

Add dropout layers

Dropout reduces overfitting by preventing co-adaptation of neurons.

A company implements a chatbot using a rule-based system. Users complain the chatbot cannot handle new queries. Which AI approach should be considered to improve flexibility?

Expert system

Natural language processing (NLP)

Robotic process automation

Machine learning

ML enables the system to learn patterns from data.

An AI model for detecting fraudulent transactions has high precision but low recall. Which business impact is most likely?

The model has no impact on fraud detection

The model detects all fraudulent transactions

Many fraudulent transactions go undetected

Low recall indicates a high number of false negatives.

Many legitimate transactions are flagged as fraud

A data scientist splits a dataset into training (80%) and test (20%). After training, the model achieves 95% accuracy on training and 60% on test. Which step should the data scientist take first?

Collect more data

Use cross-validation

Apply regularization

Regularization penalizes large weights, reducing overfitting.

Increase model complexity

Want more AI Concepts and Foundations practice?

All Machine Learning and Deep Learning questions

Domain 2: Machine Learning and Deep Learning

F1 score

F1 score is the harmonic mean of precision and recall, providing a balanced measure for imbalanced datasets.

Accuracy

Recall

Precision

Increase learning rate

Feature scaling

Regularization

Regularization adds a penalty for large weights, reducing overfitting and sensitivity to input variations.

Cross-validation

Batch normalization

Increase number of epochs

Gradient clipping

Early stopping

Early stopping monitors validation loss and stops training when it starts to increase, reducing overfitting.

Transfer learning

Online learning

Online learning updates the model incrementally, allowing adaptation to new data without full retraining.

Batch learning

Unsupervised learning

Normal equation

Batch gradient descent

Principal component analysis

Stochastic gradient descent

SGD updates weights per sample, making it efficient for large datasets.

Increase the learning rate

Increase the number of epochs

Reduce the model complexity

Reducing complexity (e.g., fewer layers) can reduce overfitting and improve validation performance.

Add more convolutional layers

Want more Machine Learning and Deep Learning practice?

All AI Models and Data Engineering questions

Domain 3: AI Models and Data Engineering

SMOTE (Synthetic Minority Oversampling Technique)

SMOTE creates synthetic minority samples to balance classes.

Random undersampling of class A

Adding Gaussian noise to class B

Principal Component Analysis (PCA)

Use k-fold cross-validation

Apply log transformation to all features

Normalize or standardize the features

Scaling improves convergence of gradient descent.

One-hot encode the features

Data drift

The distribution of reviews differs between e-commerce and social media.

Concept drift

Bias-variance tradeoff

Overfitting

A data engineer needs to design a data pipeline for a real-time fraud detection system. The system requires low-latency processing of streaming transactions. Which architecture is most appropriate?

Stream processing with Apache Kafka and Flink

Stream processing provides low-latency real-time analysis.

Data lake with Apache Spark

Batch processing with Apache Hadoop

Microservices architecture with REST APIs

Data augmentation

L2 regularization

Early stopping

Early stopping prevents overfitting by stopping training when validation loss starts to rise.

Dropout

Use TLS for transfer, AES-256 for storage, and AWS CloudTrail for auditing

These provide encryption and auditing.

Use FTP for transfer, AES-128 for storage, and manual log review

Use SSH for transfer, store data in a database, and enable access logs

Use MD5 for hashing, store data in plaintext, and enable server logs

Want more AI Models and Data Engineering practice?

All AI Implementation and Operations questions

Domain 4: AI Implementation and Operations

Fine-tune the model on a curated dataset of domain-specific conversations.

Fine-tuning adapts the model to the domain with less data and compute.

Increase the temperature parameter to reduce randomness.

Collect more general training data and retrain the model from scratch.

Roll back to a previous version of the model that was more accurate.

The synthetic data should include a wide variety of events, even if not realistic.

The synthetic data should be generated using an unsupervised generative model.

The synthetic data should accurately represent the distribution and features of real rare events.

Fidelity to real event characteristics is crucial for generalization.

The synthetic data should be as large as possible to cover all possibilities.

A data scientist trains a regression model and notices the training loss is low but validation loss is high. Which technique should be applied FIRST to address this issue?

Increase the learning rate.

Add more layers to the neural network.

Increase the size of the training dataset.

Apply L1 or L2 regularization to the model.

Regularization penalizes large weights, reducing overfitting.

A company deploys an AI model for loan approval. The model shows bias against a protected group. The team decides to use adversarial debiasing. What is the PRIMARY advantage of this approach?

It guarantees the model's predictions are private.

It reduces bias while preserving predictive performance by learning representations that are invariant to sensitive attributes.

This is the core benefit of adversarial debiasing.

It is simpler to implement than pre-processing techniques.

It ensures equal approval rates across all groups.

An AIOps platform monitors server metrics and triggers alerts. The team notices too many false positives. Which adjustment should be made to the anomaly detection model?

Use a more complex model to better fit the data.

Shorten the observation window to detect anomalies faster.

Increase the training data to include more normal patterns.

Raise the anomaly score threshold for triggering alerts.

A higher threshold means only more extreme deviations trigger alerts.

A team deploys a machine learning model as a REST API. They want to monitor model drift. Which metric is MOST appropriate for detecting drift in the input data distribution?

Model accuracy on a recent holdout set.

Population stability index (PSI) comparing training and recent data.

PSI directly quantifies distribution shift.

F1 score on the training data.

Root mean squared error (RMSE) on test data.

Want more AI Implementation and Operations practice?

All AI Security, Ethics and Governance questions

Domain 5: AI Security, Ethics and Governance

Apply data anonymization techniques to the training dataset.

Anonymization removes personally identifiable information, ensuring that the model cannot output data linked to specific patients.

Implement role-based access control (RBAC) on the AI model's inference API.

Use differential privacy during model training.

Encrypt the training data at rest and in transit.

Increase the diversity of the training data by collecting more samples from underrepresented groups.

Schedule regular bias audits using fairness metrics.

Bias audits with metrics like demographic parity can detect unfair treatment and guide mitigation.

Retrain the model every month with the latest transaction data.

Use SHAP values to provide explanations for each prediction.

Train the model with a loss function that weights profit more heavily than customer satisfaction.

Use a multi-objective optimization framework to balance profit and customer satisfaction.

Multi-objective optimization explicitly considers trade-offs between competing objectives, leading to more balanced recommendations.

Adjust the model's hyperparameters to reduce the influence of profit features.

Remove profit data from the training set and only use customer preference data.

Apply adversarial debiasing to the model during training.

Adversarial debiasing reduces bias by training the model to be unable to predict protected attributes from its predictions.

Use a random selection of candidates to avoid bias.

Remove the gender feature from the dataset and retrain.

Collect more training data from underrepresented groups.

Use explainable AI techniques to understand why the chatbot generates certain responses.

Encrypt all chatbot conversations at rest and in transit.

Implement a human-in-the-loop review process for high-risk responses.

Human review can catch and block responses that violate privacy regulations before they are sent to customers.

Anonymize the training data used to train the chatbot.

Replace the convolutional layers with transformer layers to improve attention.

Apply data augmentation techniques to simulate low-light conditions in the training dataset.

Data augmentation can expand the training data to include low-light scenarios, improving robustness without full retraining.

Use adversarial training to add imperceptible perturbations to training images.

Increase the model's depth by adding more convolutional layers.

Want more AI Security, Ethics and Governance practice?