Knowledge + Practice

CompTIA AI+ AI0-001 (AI0-001) — Questions 76–150

500 questions total · 7pages · All types, answers revealed

Take a mock exam Exam hub

Page 2 of 7

76

MCQhard

A cybersecurity firm is developing an AI system to detect zero-day malware using behavior analysis. The team collects a dataset of 1,000 malware samples and 10,000 benign files from corporate endpoints. The model is a random forest classifier. After deployment, the false positive rate is 5%, which is acceptable, but the detection rate for new malware variants drops to 30%. The security analyst suspects the model is overfitting to the specific malware families in the training set. Which improvement should the team implement first?

A.Use a boosting ensemble instead of bagging

B.Collect more malware samples from the same families

C.Replace the random forest with a deep neural network

D.Engineer features that capture generic behavioral patterns

AnswerD

Generic features (e.g., process creation frequency, registry changes) help the model learn behaviors common to malware, improving detection of new variants.

Why this answer

The core issue is that the model has overfitted to the specific malware families in the training set, causing poor generalization to unseen zero-day variants. Engineering features that capture generic behavioral patterns (e.g., API call sequences, file system interactions, network connection anomalies) reduces reliance on family-specific signatures, improving detection of novel malware. This directly addresses the root cause of the 30% detection rate drop without introducing new model complexity or data imbalance issues.

Exam trap

CompTIA often tests the misconception that more complex models (boosting, DNNs) automatically improve performance, when in reality, feature engineering to address the specific failure mode (overfitting to training families) is the most effective first step.

How to eliminate wrong answers

Option A is wrong because boosting ensembles (e.g., AdaBoost, XGBoost) are more prone to overfitting on noisy data than bagging (Random Forest), which would exacerbate the existing overfitting problem. Option B is wrong because collecting more samples from the same families reinforces the model's bias toward those specific patterns, worsening generalization to new variants. Option C is wrong because replacing Random Forest with a deep neural network (DNN) typically requires significantly more data to avoid overfitting, and with only 1,000 malware samples, a DNN would likely perform worse, not better.

Full explanation →

77

MCQmedium

A data engineer needs to design a data pipeline for a real-time fraud detection system. The system requires low-latency processing of streaming transactions. Which architecture is most appropriate?

A.Stream processing with Apache Kafka and Flink

B.Data lake with Apache Spark

C.Batch processing with Apache Hadoop

D.Microservices architecture with REST APIs

AnswerA

Stream processing provides low-latency real-time analysis.

Why this answer

Apache Kafka provides a distributed, fault-tolerant event streaming platform that ingests high-throughput transaction data with low latency, while Apache Flink offers true stream processing with exactly-once semantics and sub-second event-time processing. Together, they enable real-time fraud detection by analyzing transactions as they arrive, without the delays inherent in batch or micro-batch approaches.

Exam trap

CompTIA often tests the distinction between true stream processing (e.g., Flink, Kafka Streams) and micro-batch or near-real-time processing (e.g., Spark Streaming), where candidates mistakenly assume that any 'streaming' API (like Spark Streaming) is equivalent to low-latency stream processing.

How to eliminate wrong answers

Option B is wrong because a data lake with Apache Spark typically relies on micro-batch processing (e.g., Spark Streaming with a minimum batch interval of ~100ms), which introduces higher latency than true stream processing and is unsuitable for sub-second fraud detection. Option C is wrong because batch processing with Apache Hadoop (e.g., MapReduce) is designed for high-throughput, high-latency processing of large static datasets, not for real-time streaming where transactions must be evaluated within milliseconds. Option D is wrong because microservices architecture with REST APIs is a design pattern for building distributed services, not a data pipeline technology; REST APIs introduce synchronous request-response overhead and cannot natively handle continuous, unbounded data streams with low-latency stateful processing.

Full explanation →

78

Multi-Selecteasy

A machine learning engineer is preparing to train a deep neural network for image classification. To avoid overfitting, which TWO techniques should the engineer apply? (Select TWO.)

Select 2 answers

A.Use dropout regularization.

B.Use data augmentation.

C.Increase the number of layers.

D.Remove all non-linear activation functions.

E.Reduce the training dataset size.

AnswersA, B

Dropout is a regularization technique that helps prevent overfitting by randomly dropping units.

Why this answer

Options B and D are correct. Dropout regularization randomly drops neurons during training, preventing co-adaptation. Data augmentation increases the effective size of the training set by applying transformations, reducing overfitting.

Option A (increasing layers) increases model capacity and may worsen overfitting. Option C (removing non-linear activation) reduces model expressiveness, leading to underfitting. Option E (reducing dataset size) would increase overfitting risk.

Full explanation →

79

MCQhard

A machine learning engineer is troubleshooting a recurrent neural network that fails to learn long-range dependencies in sequential data. The gradients are computed using backpropagation through time. Which phenomenon is most likely occurring, and what architectural change would best address it?

A.Underfitting; increase the number of time steps

B.Vanishing gradients; use LSTM or GRU units

C.Exploding gradients; apply gradient clipping

D.Overfitting; reduce the number of layers

AnswerB

Vanishing gradients prevent learning long-range patterns; LSTMs and GRUs have gating mechanisms to preserve gradients.

Why this answer

Option A (Exploding gradients) causes unstable training; gradient clipping helps but not for long-range dependencies. Option C (Overfitting) would not specifically affect long-range learning. Option D (Underfitting) is too general.

Option B correctly identifies vanishing gradients and suggests LSTMs or GRUs, which maintain long-term memory.

Full explanation →

80

MCQmedium

An AI system used for hiring has been found to exhibit racial bias against certain candidates. Which step should the organization take to mitigate this?

A.Remove all demographic features from the model.

B.Use a different algorithm that is inherently unbiased.

C.Regularly audit model predictions across demographic groups and retrain with fairness constraints.

D.Hire more diverse data scientists.

AnswerC

This approach identifies and corrects bias systematically.

Why this answer

Option C is correct because bias in AI systems is often embedded in training data or model behavior, not just in feature selection. Regularly auditing predictions across demographic groups and retraining with fairness constraints (e.g., demographic parity or equalized odds) allows the organization to detect and correct disparate impact without sacrificing model performance. This aligns with the AI0-001 focus on continuous monitoring and iterative improvement in AI operations.

Exam trap

CompTIA often tests the misconception that removing sensitive attributes (like race or gender) automatically makes a model fair, when in reality proxy features and biased training data can perpetuate discrimination.

How to eliminate wrong answers

Option A is wrong because simply removing demographic features does not eliminate bias; proxy features (e.g., zip code, education level) can still encode the same discriminatory patterns, and the model may learn biased correlations from the remaining data. Option B is wrong because no algorithm is inherently unbiased; bias arises from data, labeling, and deployment context, so switching algorithms without addressing root causes will not guarantee fairness. Option D is wrong because hiring more diverse data scientists, while beneficial for broader perspectives, does not directly mitigate existing model bias; technical interventions like auditing and retraining with fairness constraints are required.

Full explanation →

81

Multi-Selecthard

An organization is implementing an AI governance framework. Which THREE components are essential for compliance with ethical AI standards?

Select 3 answers

A.Data privacy protection measures (e.g., differential privacy).

B.Open-source licensing of all models.

C.Maximizing model accuracy to increase revenue.

D.Model explainability and interpretability mechanisms.

E.Regular bias auditing of models.

AnswersA, D, E

Privacy is a key ethical requirement.

Why this answer

Options A, B, and D are correct because bias auditing, explainability, and privacy protection are core ethical requirements. Option C is wrong because maximizing revenue is a business goal, not an ethical requirement. Option E is wrong because open-source is not a mandatory component.

Full explanation →

82

MCQmedium

A data engineering team is designing a pipeline to train a model on streaming data. The data arrives in a time-series format. Which approach should they use to ensure the model reflects current trends without catastrophic forgetting?

A.Implement incremental learning with periodic validation

B.Use a sliding window of the most recent data for training

C.Deploy an ensemble of models trained on different time periods

D.Retrain the entire model from scratch every week

AnswerA

Incremental learning adapts to new data while retaining previous knowledge.

Why this answer

Incremental learning (also called online learning) allows the model to update its parameters continuously as new streaming data arrives, without requiring access to historical data. By coupling this with periodic validation on a held-out set, the team can detect concept drift and ensure the model adapts to current trends while avoiding catastrophic forgetting, which occurs when new updates overwrite previously learned patterns.

Exam trap

CompTIA often tests the misconception that a sliding window of recent data alone prevents catastrophic forgetting, but without a mechanism like elastic weight consolidation or replay buffers, the model still forgets older but recurring patterns.

How to eliminate wrong answers

Option B is wrong because a sliding window of only the most recent data discards older patterns entirely, which can cause catastrophic forgetting of long-term seasonality or trends. Option C is wrong because an ensemble of models trained on different time periods does not inherently adapt to streaming data; it requires retraining or adding new models over time and can become computationally expensive without addressing forgetting in individual models. Option D is wrong because retraining the entire model from scratch every week is inefficient for streaming data, introduces latency, and may still cause forgetting of intra-week patterns if the retraining window is too narrow.

Full explanation →

83

MCQeasy

A company is building a recommendation system for an e-commerce platform. They want the system to learn from user purchase history and browsing behavior to suggest products. Which type of machine learning is most appropriate for this task?

A.Supervised learning

B.Semi-supervised learning

C.Unsupervised learning

D.Transfer learning

AnswerC

Unsupervised learning can find patterns in user behavior without labels, suitable for recommendations.

Why this answer

Unsupervised learning is the most appropriate because the system must discover hidden patterns and groupings in user purchase history and browsing behavior without labeled outcomes. Recommendation systems often use clustering or association rule mining (e.g., market basket analysis) to identify product affinities and user segments, which are core unsupervised techniques. This allows the system to suggest products based on learned co-occurrence patterns rather than predefined categories.

Exam trap

CompTIA often tests the misconception that recommendation systems always require labeled data, leading candidates to choose supervised learning, but the key is that unsupervised learning excels at finding hidden structures in unlabeled behavioral data.

How to eliminate wrong answers

Option A is wrong because supervised learning requires labeled training data (e.g., explicit ratings or purchase/no-purchase labels), which the scenario does not provide; the system must learn from unlabeled behavioral data. Option B is wrong because semi-supervised learning still requires a small amount of labeled data to guide the learning, but the problem statement specifies only raw purchase history and browsing behavior with no labels. Option D is wrong because transfer learning involves applying knowledge from a pre-trained model on a different but related task, which is unnecessary here since the system can learn directly from the available data without needing to transfer from another domain.

Full explanation →

84

MCQeasy

A data scientist trains a linear regression model on housing prices. The training error is low, but test error is high. What is the most likely issue?

A.Overfitting

B.Multicollinearity

C.Data leakage

D.Underfitting

AnswerA

Correct: low training error and high test error is classic overfitting.

Why this answer

Option A is correct because overfitting occurs when the model fits the training data too well but performs poorly on unseen data. Options B, C, and D are incorrect because underfitting would show high training error, data leakage would cause artificially high performance, and multicollinearity affects coefficient interpretation but not necessarily test error.

Full explanation →

85

MCQhard

An e-commerce company deploys a recommendation system using collaborative filtering. After launch, the system shows high accuracy for popular items but fails to recommend niche products to users who would likely buy them. Which technique should the team implement to improve recommendations for long-tail items?

A.Apply matrix factorization with higher latent factors

B.Switch to a hybrid filtering approach that incorporates item metadata

C.Increase the weight of popular items in the recommendation score

D.Collect more user interaction data over time

AnswerB

Hybrid filtering uses item features to recommend niche items even with sparse interaction data.

Why this answer

Collaborative filtering relies on user-item interactions, which are sparse for niche products (the long tail). A hybrid filtering approach that incorporates item metadata (e.g., category, description, attributes) can bridge the gap by using content-based signals to recommend niche items even when interaction data is limited. This directly addresses the cold-start and sparsity problems for long-tail items.

Exam trap

CompTIA often tests the misconception that more data or higher model complexity (like more latent factors) automatically solves sparsity, when in fact the core issue is the lack of interaction signals for niche items, which requires a hybrid approach to incorporate auxiliary information.

How to eliminate wrong answers

Option A is wrong because increasing latent factors in matrix factorization can lead to overfitting and does not inherently solve the sparsity problem for long-tail items; it may even amplify noise. Option C is wrong because increasing the weight of popular items would further bias recommendations toward the head of the distribution, worsening the neglect of niche products. Option D is wrong because simply collecting more user interaction data over time does not guarantee that long-tail items will receive sufficient interactions; the data will still be skewed toward popular items, and the system needs a mechanism to leverage non-interaction signals like metadata.

Full explanation →

86

MCQmedium

A hospital deploys an AI diagnostic assistant that analyzes medical images. The system has been in use for six months, and radiologists have reported that the AI is increasingly confident in its predictions, but sometimes misses rare conditions. The AI ethics board is concerned about overreliance and potential harm from false negatives. They want to implement a governance framework that ensures appropriate human oversight. The hospital has a limited IT budget. What is the best approach?

A.Implement a human-in-the-loop process where the AI flags low-confidence or rare condition predictions for mandatory radiologist review

B.Add a warning to the AI interface that says 'This tool may miss rare conditions'

C.Require all AI predictions to be reviewed by a radiologist before final diagnosis

D.Increase the AI's false positive threshold to reduce missed cases

AnswerA

This balances efficiency with safety, ensuring oversight where it matters.

Why this answer

Option C is correct because establishing a human-in-the-loop process for high-risk cases ensures oversight without full automation removal. Option A is wrong because requiring all cases to be reviewed eliminates efficiency gains. Option B is wrong because increasing false positive threshold could increase false negatives.

Option D is wrong because confidence scores alone are not sufficient; mandatory human review for low-confidence cases is needed.

Full explanation →

87

MCQhard

A fraud detection model has high precision but low recall. The cost of false negatives is very high. Which threshold adjustment should be made?

A.Use class weights during training

B.Apply SMOTE to the training data

C.Decrease classification threshold

D.Increase classification threshold

AnswerC

Decreasing the threshold increases the number of positive predictions, raising recall and reducing false negatives.

Why this answer

Option B is correct because decreasing the classification threshold classifies more samples as positive, increasing recall (at the cost of some precision). Option A would increase precision but further reduce recall. Option C (class weights) is a training-time adjustment.

Option D (SMOTE) addresses imbalance but is applied to training data, not prediction threshold.

Full explanation →

88

MCQhard

A financial institution uses an AI model to approve loan applications. The model was trained on historical data that included biased lending practices. The bank's ethics committee wants to mitigate bias without removing protected attributes. Which approach best balances fairness and model performance?

A.Retrain the model using a balanced dataset

B.Remove all protected attributes from the training data

C.Post-process model outputs to adjust for demographic parity

D.Apply adversarial debiasing during training

AnswerD

Adversarial debiasing reduces bias by learning non-discriminatory representations.

Why this answer

Adversarial debiasing is the best approach because it directly optimizes the model to reduce bias during training while preserving predictive accuracy. It uses an adversarial network that tries to predict the protected attribute from the model's predictions, forcing the main model to learn representations that are less correlated with that attribute. This allows the bank to keep protected attributes in the data (as required by the ethics committee) while actively mitigating bias.

Exam trap

CompTIA often tests the misconception that simply removing protected attributes (Option B) is sufficient to eliminate bias, when in reality proxy features and correlated variables can perpetuate discrimination.

How to eliminate wrong answers

Option A is wrong because retraining on a balanced dataset only addresses representation bias (e.g., equal numbers of approved/rejected loans across groups) but does not remove the underlying biased correlations learned from historical lending practices; it may also reduce model performance by discarding real-world data distributions. Option B is wrong because removing all protected attributes does not eliminate bias—correlated features (e.g., zip code, income) can act as proxies for race or gender, leading to indirect discrimination, and the ethics committee explicitly wants to keep protected attributes. Option C is wrong because post-processing adjusts outputs after the model is trained, which can improve demographic parity but often at the cost of significant accuracy loss and does not address bias embedded in the model's internal representations.

Full explanation →

89

MCQmedium

A data scientist is training a neural network to classify images of animals. The training accuracy is 99%, but validation accuracy is only 65%. Which technique should the data scientist use to address this issue?

A.Apply batch normalization

B.Increase the number of training epochs

C.Add dropout layers to the network

D.Increase the learning rate

AnswerC

Dropout randomly deactivates neurons, which reduces overfitting by making the model less sensitive to specific weights.

Why this answer

The high training accuracy (99%) and low validation accuracy (65%) indicate overfitting, where the model memorizes the training data but fails to generalize. Adding dropout layers randomly drops neurons during training, which forces the network to learn more robust features and reduces overfitting. This technique is specifically designed to improve generalization without requiring more data or altering the learning rate.

Exam trap

CompTIA often tests the distinction between techniques that improve training speed (batch normalization, learning rate tuning) versus those that improve generalization (dropout, regularization), and the trap here is that candidates may confuse overfitting with underfitting or assume that more training always helps.

How to eliminate wrong answers

Option A is wrong because batch normalization normalizes layer inputs to stabilize and accelerate training, but it does not directly address overfitting; it can even slightly reduce the need for dropout but is not the primary solution for this gap. Option B is wrong because increasing the number of training epochs would likely worsen overfitting, as the model would have more opportunities to memorize the training data, further increasing the accuracy gap. Option D is wrong because increasing the learning rate can cause the model to converge too quickly to a suboptimal solution or diverge, and it does not target the root cause of overfitting.

Full explanation →

90

MCQhard

A global retailer uses an AI model to forecast demand across thousands of stores. After deployment, the model's predictions become less accurate during holiday seasons. The training data included two years of holiday periods. What is the most effective operational strategy to handle this recurring seasonal drift?

A.Deploy an anomaly detection system to flag holiday prediction outliers

B.Implement a scheduled retraining cycle just before each holiday period

C.Use an ensemble of models trained on different time periods

D.Increase the volume of training data by including five years of history

AnswerB

Proactive retraining with recent holiday data mitigates seasonal drift.

Why this answer

Scheduled retraining just before each holiday season directly addresses the recurring seasonal drift by updating the model with the most recent holiday data patterns. This is the most effective operational strategy because it proactively aligns the model with the known, periodic shift in demand behavior, rather than reacting to errors or relying on static historical data.

Exam trap

CompTIA often tests the misconception that more data or anomaly detection is the universal solution to drift, but the trap here is that candidates overlook the need for proactive, scheduled updates tailored to known recurring patterns rather than reactive or static fixes.

How to eliminate wrong answers

Option A is wrong because anomaly detection only flags outliers after predictions are made, it does not correct the underlying model drift or improve forecast accuracy during the holiday period. Option C is wrong because an ensemble of models trained on different time periods may reduce variance but does not specifically target the recurring seasonal pattern; it could still suffer from drift if none of the models are updated for the current holiday context. Option D is wrong because simply adding more historical data (five years) does not guarantee the model will adapt to the most recent seasonal shifts; older data may even introduce outdated patterns that dilute the relevance of recent holiday trends.

Full explanation →

91

MCQmedium

A startup is building a chatbot to handle customer inquiries. They want the chatbot to understand context and provide accurate responses without requiring extensive labeled data. Which AI approach is most suitable?

A.Reinforcement learning from human feedback

B.Rule-based natural language processing

C.Convolutional neural networks (CNNs)

D.Transfer learning with a pre-trained transformer model

AnswerD

Transfer learning leverages pre-trained language models and fine-tunes with small data.

Why this answer

Transfer learning with a pre-trained transformer model (e.g., BERT, GPT) is the most suitable approach because it allows the chatbot to understand context and generate accurate responses using knowledge learned from vast general-domain text, requiring only minimal fine-tuning on the startup's specific customer inquiry data. This eliminates the need for extensive labeled datasets, as the model already captures nuanced language patterns and contextual relationships through its self-attention mechanism.

Exam trap

CompTIA often tests the misconception that RLHF alone reduces the need for labeled data, when in fact it requires a pre-trained model and a reward model trained on human preferences, making transfer learning the more direct solution for minimizing labeled data requirements.

How to eliminate wrong answers

Option A is wrong because reinforcement learning from human feedback (RLHF) is a fine-tuning technique that still requires a substantial initial labeled dataset or a reward model, and it is typically applied on top of a pre-trained model rather than being a standalone solution for reducing labeled data needs. Option B is wrong because rule-based NLP relies on handcrafted rules and pattern matching, which cannot handle the variability and contextual ambiguity of natural language in customer inquiries without extensive manual effort and brittle maintenance. Option C is wrong because convolutional neural networks (CNNs) are primarily designed for spatial pattern recognition (e.g., images) and, while they can be applied to text, they lack the sequential context modeling and long-range dependency capture that transformer architectures provide, making them less effective for conversational understanding.

Full explanation →

92

MCQhard

Refer to the exhibit. Which model is NOT in full compliance with the policy?

A.ChurnPredict v1

B.FraudDetect v4

C.CreditScorer v2

D.LoanApproval v3

AnswerC

CreditScorer v2 is explicitly granted an exception, meaning it does not need to comply.

Why this answer

CreditScorer v2 is listed in the exceptions, meaning it is not required to meet the policy requirements.

Full explanation →

93

Multi-Selectmedium

Which THREE are key principles of trustworthy AI according to the OECD?

Select 3 answers

A.Profitability

B.Robustness

C.Transparency

D.Scalability

E.Accountability

AnswersB, C, E

Robustness ensures AI systems perform reliably under varied conditions.

Why this answer

Transparency, accountability, and robustness are three of the five OECD principles (along with inclusive growth and human-centred values). Profitability and scalability are not principles.

Full explanation →

94

Multi-Selecthard

Which THREE of the following are key components of an AI governance framework?

Select 3 answers

A.Cloud infrastructure configuration

B.Model accuracy benchmarks

C.Risk assessment and mitigation plans

D.Transparency and explainability policies

E.Data management and privacy controls

AnswersC, D, E

Essential for identifying and managing AI risks.

Why this answer

Options A, B, and D are correct because risk assessment, data management, and transparency are core pillars. Option C is wrong because model accuracy is a performance metric, not a governance component. Option E is wrong because cloud infrastructure is deployment, not governance.

Full explanation →

95

MCQmedium

A manufacturing company uses a predictive maintenance AI system to schedule equipment repairs. The system was trained on sensor data from machinery. Recently, the system has been missing failures, leading to unexpected downtime. An investigation reveals that the sensor data from one plant has been corrupted due to a sensor malfunction. The corrupted data was used in retraining. The company needs to restore system accuracy quickly. The data science team can access the training logs. What is the best course of action?

A.Roll back to the previous model version before the corrupt data was ingested, then clean the sensor data and retrain

B.Switch to a simpler linear regression model that is less sensitive to data quality issues

C.Retrain the model using all available data, including the corrupted sensor data

D.Apply a weight to sensor data from that plant to reduce its influence

AnswerA

Reverting removes the damage, cleaning ensures future data is correct, and retraining updates the model.

Why this answer

Option D is correct because reverting to the last good model and then cleaning the data ensures the corruption is removed. Option A is wrong because retraining with both corrupted and good data may not remove the influence. Option B is wrong because reweighting may not fully correct.

Option C is wrong because using a simpler model may not capture complex patterns.

Full explanation →

96

MCQhard

A machine learning team is developing a model to predict server failure from telemetry data. They use a deep neural network with 3 hidden layers. After training, the model achieves 99% accuracy on training data but only 85% on validation data. Which technique should the team apply to reduce the generalization error?

A.Increase the number of hidden layers

B.Apply L2 regularization

C.Increase the learning rate

D.Add more training data

AnswerB

Regularization adds a penalty on large weights, reducing overfitting and improving generalization.

Why this answer

The large gap between training and validation accuracy indicates overfitting. L2 regularization penalizes large weights and reduces overfitting. Increasing layers or learning rate would exacerbate overfitting, and adding data helps but may not be immediately available.

Full explanation →

97

Multi-Selectmedium

Which TWO of the following are common methods for mitigating bias in AI models?

Select 2 answers

A.Using adversarial training

B.Reweighting training samples based on sensitive attributes

C.Applying L1 regularization

D.Adding fairness constraints during training

E.Performing k-fold cross-validation

AnswersB, D

Reweighting can adjust for underrepresented groups to reduce bias.

Why this answer

Reweighting training samples based on sensitive attributes is a common pre-processing bias mitigation technique. It assigns higher weights to underrepresented groups or lower weights to overrepresented groups to balance the dataset, thereby reducing the model's reliance on biased correlations. This method directly addresses data-level bias before model training begins.

Exam trap

CompTIA often tests the distinction between bias mitigation techniques (pre-processing, in-processing, post-processing) and general ML best practices like regularization or cross-validation, leading candidates to confuse L1 regularization or k-fold cross-validation with fairness methods.

Full explanation →

98

MCQhard

A data scientist is training a convolutional neural network (CNN) for object detection. The training loss decreases rapidly but then plateaus at a high value, and the validation loss starts increasing. Which action should the scientist take to improve the model?

A.Increase the learning rate

B.Increase the number of epochs

C.Reduce the model complexity

D.Add more convolutional layers

AnswerC

Reducing complexity (e.g., fewer layers) can reduce overfitting and improve validation performance.

Why this answer

The training loss decreasing rapidly then plateauing at a high value while validation loss increases is classic overfitting. Reducing model complexity (Option C) directly addresses overfitting by decreasing the number of parameters or applying regularization (e.g., dropout, L2), which forces the network to learn more generalizable features rather than memorizing noise in the training data.

Exam trap

CompTIA often tests the misconception that high training loss plateau means underfitting or insufficient learning, leading candidates to increase model complexity or epochs, when the real issue is overfitting indicated by the validation loss increase.

How to eliminate wrong answers

Option A is wrong because increasing the learning rate would likely cause the loss to oscillate or diverge, not fix the plateau or overfitting; it addresses convergence speed, not generalization. Option B is wrong because increasing epochs would continue training on an already overfitting model, worsening the validation loss divergence. Option D is wrong because adding more convolutional layers increases model complexity, which would exacerbate overfitting by adding more parameters to memorize training data.

Full explanation →

99

Multi-Selectmedium

Which THREE of the following are key components of an AI governance framework?

Select 3 answers

A.Regular auditing and monitoring for compliance.

B.Cloud-based deployment for scalability.

C.Ethical guidelines for AI development and deployment.

D.Explainability mechanisms for model decisions.

E.Model accuracy thresholds for production deployment.

AnswersA, C, D

Auditing ensures ongoing adherence to policies and regulations.

Why this answer

Regular auditing and monitoring for compliance (A) is a key component of an AI governance framework because it ensures that AI systems operate within legal, ethical, and organizational policies over time. Continuous monitoring detects drift, bias, or security violations, while audits provide evidence of adherence to standards such as ISO/IEC 42001 or internal governance rules. Without this, governance becomes a static policy with no enforcement or verification.

Exam trap

CompTIA often tests the distinction between governance components (policies, ethics, oversight) and operational or technical metrics (deployment, accuracy thresholds), leading candidates to confuse performance requirements with governance pillars.

Full explanation →

100

MCQhard

A credit risk model is being developed to predict loan defaults. The dataset has 95% non-default and 5% default instances. The data scientist trains a logistic regression model and obtains 95% accuracy, but the recall for defaults is only 10%. Which action is most appropriate to improve the model's ability to identify defaults?

A.Apply principal component analysis (PCA) to reduce dimensionality

B.Collect more data from loan applicants to increase dataset size

C.Undersample the non-default class to match the number of defaults

D.Use SMOTE to oversample the default class

AnswerD

SMOTE creates synthetic samples, balancing classes and improving recall.

Why this answer

SMOTE (Synthetic Minority Oversampling Technique) is the most appropriate action because it generates synthetic samples for the minority class (defaults) rather than simply duplicating existing ones. This directly addresses the severe class imbalance (95% non-default vs. 5% default) that causes the logistic regression model to achieve high accuracy by predicting nearly all instances as non-default, while failing to identify actual defaults (recall of only 10%). By creating realistic synthetic default instances, SMOTE balances the training data and forces the model to learn decision boundaries that better capture the minority class.

Exam trap

CompTIA often tests the misconception that undersampling the majority class is always better than oversampling the minority class, but in this scenario, undersampling would discard valuable non-default patterns and reduce model robustness, whereas SMOTE generates new, realistic default samples without data loss.

How to eliminate wrong answers

Option A is wrong because PCA reduces dimensionality by projecting data onto principal components, which does not address class imbalance and can even discard variance that distinguishes defaults from non-defaults. Option B is wrong because simply collecting more data does not guarantee a better ratio of defaults; if the underlying population imbalance remains, the model will still be biased toward the majority class. Option C is wrong because undersampling the non-default class discards a large amount of potentially useful data, which can lead to loss of information and reduced model performance, especially when the majority class contains important patterns.

Full explanation →

101

MCQmedium

A financial institution uses an AI model to approve small business loans. The model has a high approval rate for women-owned businesses but low for minority-owned businesses. The compliance officer is concerned about disparate impact. Which governance process should be implemented first?

A.Remove gender and ethnicity features from the model

B.Conduct a bias audit and fairness assessment using relevant metrics

C.Publish the model's decision-making criteria to the public

D.Immediately adjust the approval threshold to equalize rates

AnswerB

An audit identifies the extent and sources of bias to inform remediation.

Why this answer

Option D is correct: a bias audit and fairness assessment should be conducted to quantify impact. Option A is wrong because without analysis, altering thresholds is arbitrary. Option B is wrong because equalizing rates may lead to reverse discrimination or mask root causes.

Option C is wrong because full disclosure may expose the company to liability without remediation.

Full explanation →

102

MCQhard

A large e-commerce company has deployed a real-time product recommendation system using a neural collaborative filtering model. The model was trained on six months of user click and purchase data. For the first three months after deployment, the click-through rate (CTR) improved by 15%. However, starting in the fourth month, CTR began decreasing steadily despite no changes to the system infrastructure or data pipeline. The product manager suspects model decay but the engineering team insists the model is static and should not degrade. The data science lead suggests investigating further. They have access to production logs, A/B testing framework, and historical model versions. What is the BEST course of action to diagnose and address the issue?

A.Re-deploy the model with additional features such as time of day and user device.

B.Increase the frequency of batch inference from hourly to every 10 minutes to improve responsiveness.

C.Set up an A/B test comparing the current model against the original baseline model using recent traffic.

D.Retrain the model on only the most recent 30 days of data and replace the current model.

AnswerC

A/B testing isolates whether the current model underperforms relative to a known good version, confirming decay.

Why this answer

Option C is correct because setting up an A/B test comparing the current model against the original baseline model using recent traffic directly isolates whether the model's predictive performance has degraded due to concept drift (changes in user behavior over time). Since the model is static but the data distribution has shifted, the A/B test provides empirical evidence of decay by measuring CTR differences under identical conditions, which is the standard diagnostic step before any retraining or feature engineering.

Exam trap

CompTIA often tests the principle that diagnosing model decay requires a controlled comparison (A/B test) rather than immediately retraining or adding features, and the trap here is assuming that a static model cannot degrade when the underlying data distribution changes.

How to eliminate wrong answers

Option A is wrong because adding features like time of day or user device without first diagnosing the root cause of CTR decline may introduce noise or overfitting, and does not address the likely concept drift. Option B is wrong because increasing batch inference frequency improves latency but does not affect model accuracy or counteract data distribution shifts; the model's predictions remain unchanged regardless of inference cadence. Option D is wrong because retraining on only the most recent 30 days of data could discard valuable long-term patterns and may cause catastrophic forgetting, and it bypasses the necessary diagnostic step of confirming that model decay is indeed the issue.

Full explanation →

103

MCQmedium

A team is deploying an AI model that predicts patient readmission risk. The model was trained on data from three hospitals but will be used in a fourth hospital with different patient demographics. What is the most important security risk to assess?

A.Data poisoning during training

B.Adversarial attacks that cause misclassification

C.Model inversion to extract patient data

D.Data breach of the inference API

AnswerB

The shift in demographics can make the model more vulnerable to adversarial examples that cause incorrect readmission predictions.

Why this answer

Using a model on data from a different distribution (population shift) can degrade performance, but from a security perspective, the main risk is adversarial attacks that exploit the model's unfamiliarity with new data. Model inversion and poisoning are training-time attacks; data breach is an operational risk but not specific to this scenario.

Full explanation →

104

MCQeasy

A healthcare startup is developing a diagnostic system using medical images. The team has collected 10,000 labeled images of skin lesions. They plan to train a convolutional neural network (CNN) from scratch. However, training converges slowly, and the validation accuracy plateaus at 70%. The data scientist suspects overfitting. The dataset contains 8,000 images of benign lesions and 2,000 of malignant. The team has limited GPU resources. Which of the following is the MOST effective course of action to improve validation accuracy? A. Reduce the number of convolutional layers. B. Apply transfer learning using a pre-trained model on ImageNet. C. Increase the learning rate by a factor of 10. D. Add more dropout after every convolutional layer.

A.Increase the learning rate by a factor of 10.

B.Reduce the number of convolutional layers.

C.Add more dropout after every convolutional layer.

D.Apply transfer learning using a pre-trained model on ImageNet.

AnswerD

Transfer learning provides a strong feature extractor learned from a large dataset, which can significantly improve performance with limited data.

Why this answer

Option B is correct. Transfer learning leverages a model pre-trained on a large dataset (e.g., ImageNet), which provides useful features for medical images and reduces the need for large amounts of data and computational resources. It is particularly effective when the dataset is small and imbalanced.

Option A (reducing layers) may reduce capacity and underfit. Option C (increasing learning rate) might cause divergence or overshoot minima. Option D (adding dropout) can help with overfitting but is unlikely to jump from 70% to a significantly higher accuracy given limited data; transfer learning provides a stronger boost.

Full explanation →

105

MCQhard

A company's AI governance board requires each model to have a model card documenting intended use, performance metrics, and limitations. What is the primary purpose of a model card?

A.To provide transparent documentation of model capabilities and limitations

B.To specify the exact training algorithm and hyperparameters

C.To outline a complete risk assessment framework

D.To serve as a legal contract between developers and users

AnswerA

They promote accountability and informed deployment.

Why this answer

Option B is correct because model cards communicate key information to stakeholders for informed use and transparency. Option A is wrong because model cards are not legal contracts. Option C is wrong because model cards are documentation, not training methodology.

Option D is wrong because they describe limitations but not a full risk assessment framework.

Full explanation →

106

MCQeasy

A company streams sensor data from IoT devices. The data arrives as JSON messages at high velocity. Which data pipeline architecture is BEST suited to handle this streaming data for near-real-time analytics?

A.Batch processing using Hadoop MapReduce every 24 hours.

B.Batch processing using nightly ETL jobs.

C.Single-node database with periodic inserts.

D.Stream processing using Apache Kafka and Spark Streaming.

AnswerD

Kafka ingests streaming data, Spark Streaming processes it with low latency.

Why this answer

Apache Kafka acts as a distributed, fault-tolerant ingestion layer that can handle high-velocity JSON messages, while Spark Streaming processes the data in micro-batches for near-real-time analytics. This combination provides the low-latency, scalable pipeline required for streaming IoT sensor data, unlike batch or single-node approaches.

Exam trap

CompTIA often tests the distinction between batch and stream processing by presenting batch options that seem 'reliable' or 'traditional,' trapping candidates who overlook the explicit 'near-real-time' requirement in the question.

How to eliminate wrong answers

Option A is wrong because Hadoop MapReduce is designed for batch processing of large static datasets, not for continuous high-velocity streaming data, and a 24-hour cycle cannot meet near-real-time requirements. Option B is wrong because nightly ETL jobs introduce hours of latency, making them unsuitable for near-real-time analytics on streaming data. Option C is wrong because a single-node database with periodic inserts cannot scale to handle high-velocity IoT data streams and will become a bottleneck, failing to provide near-real-time processing.

Full explanation →

107

MCQmedium

An organization uses a machine learning model to approve loans. The model shows higher false positive rates for a protected group. Which data engineering step should be taken to mitigate this?

A.Remove the protected attribute from training data

B.Use adversarial debiasing technique

C.Increase model complexity

D.Add synthetic data to balance groups

AnswerB

Adversarial debiasing forces the model to be invariant to protected attributes, reducing bias.

Why this answer

Option C is correct because adversarial debiasing explicitly reduces bias during training. Option A (removing the attribute) often fails due to correlated features. Option B (synthetic data) can help but may not be sufficient.

Option D increases complexity, potentially worsening bias.

Full explanation →

108

MCQmedium

A batch inference pipeline fails intermittently with out-of-memory errors when processing large datasets. The pipeline uses pandas DataFrames and feeds a pre-trained model. Which change would most effectively reduce memory consumption?

A.Increase the instance size of the compute node

B.Use a database instead of CSV files

C.Convert the model to use half-precision

D.Split the data into smaller chunks and process sequentially

AnswerD

Chunking reduces peak memory by processing subsets of the data at a time.

Why this answer

Option D is correct because splitting a large dataset into smaller chunks and processing them sequentially directly addresses the root cause of the out-of-memory error: the entire dataset is loaded into memory at once via pandas DataFrames. By processing data in batches, each chunk fits within the available RAM, preventing memory exhaustion while still allowing the pipeline to complete the full inference workload.

Exam trap

CompTIA often tests the misconception that scaling up hardware (Option A) is the best solution, when in fact architectural changes like chunking (Option D) are more effective and cost-efficient for batch processing workloads.

How to eliminate wrong answers

Option A is wrong because increasing the instance size merely adds more memory, which is a temporary workaround that does not fix the underlying inefficiency and increases cost; the pipeline will still fail if the dataset grows beyond the new limit. Option B is wrong because using a database instead of CSV files changes the storage layer but does not inherently reduce memory consumption during inference—pandas still loads the entire result set into a DataFrame unless chunked queries are explicitly used. Option C is wrong because converting the model to half-precision (FP16) reduces model memory footprint but does not address the primary memory consumer, which is the pandas DataFrame holding the large dataset; the model is typically much smaller than the data.

Full explanation →

109

MCQeasy

A company uses linear regression to predict sales based on advertising spend. The model's residuals show a pattern of increasing variance as spend increases. Which assumption of linear regression is violated?

A.Normality

B.Homoscedasticity

C.Linearity

D.Independence

AnswerB

Homoscedasticity requires constant variance of residuals; increasing variance violates it.

Why this answer

Option D is correct because homoscedasticity assumes constant variance of residuals; increasing variance indicates heteroscedasticity. Option A is incorrect because linearity is about the relationship, not residual variance. Option B is incorrect because independence refers to errors being independent.

Option C is incorrect because normality is about the distribution of residuals, not variance.

Full explanation →

110

MCQeasy

A data scientist is training a model to classify customer support tickets into categories. The dataset has 10,000 labeled examples, but the 'billing' category contains 8,000 examples while the 'technical' category contains 2,000. Which technique is most appropriate to address this imbalance before training?

A.Apply random oversampling on the 'technical' category.

B.Remove all examples except 'billing' and use a one-class classifier.

C.Use accuracy as the only evaluation metric.

D.Train the model as is, then adjust thresholds post-training.

AnswerA

Correct; oversampling balances the classes.

Why this answer

Option A is correct because random oversampling duplicates examples from the minority class ('technical') to balance the class distribution, preventing the model from becoming biased toward the majority class ('billing'). This technique directly addresses the class imbalance before training, which is critical for classification tasks where the minority class is underrepresented.

Exam trap

CompTIA often tests the misconception that adjusting thresholds post-training can compensate for class imbalance, but the trap here is that the model's internal weights are already skewed by the imbalanced training data, making threshold tuning ineffective without prior balancing.

How to eliminate wrong answers

Option B is wrong because removing all 'billing' examples discards the majority of the data, forcing a one-class classifier that cannot learn to distinguish between categories, which defeats the purpose of multi-class classification. Option C is wrong because accuracy is a misleading metric for imbalanced datasets; a model that always predicts 'billing' would achieve 80% accuracy without learning anything about 'technical' tickets. Option D is wrong because training the model as is on imbalanced data will bias the model toward the majority class, and post-training threshold adjustment alone cannot fix the underlying skewed decision boundary learned during training.

Full explanation →

111

MCQeasy

An organization wants to classify support tickets into categories (billing, technical, etc.). Which type of machine learning is most suitable?

A.Unsupervised learning

B.Reinforcement learning

C.Supervised learning

D.Regression

AnswerC

Classification uses labeled data to predict categories.

Why this answer

Supervised learning is the correct choice because the organization has labeled historical support tickets (e.g., 'billing' or 'technical') and wants to train a model to map new tickets to these predefined categories. This is a classic classification task, where the algorithm learns from input-output pairs to predict the correct label for unseen data.

Exam trap

CompTIA often tests the distinction between classification (supervised) and clustering (unsupervised), so the trap here is that candidates mistakenly choose unsupervised learning because they think 'grouping tickets' is clustering, ignoring that the categories are predefined and labeled.

How to eliminate wrong answers

Option A is wrong because unsupervised learning discovers hidden patterns or clusters in unlabeled data, but here the categories are known and labeled, so clustering is unnecessary. Option B is wrong because reinforcement learning involves an agent learning through trial-and-error interactions with an environment to maximize a reward signal, which is not applicable to static ticket classification. Option D is wrong because regression predicts continuous numerical values (e.g., ticket resolution time), not discrete categorical labels like 'billing' or 'technical'.

Full explanation →

112

MCQhard

A company uses a neural network for fraud detection. The dataset has 99% legitimate, 1% fraudulent. The model achieves 99% accuracy but fails to detect most frauds. Which metric should they focus on?

A.Precision

B.F1-score

C.Recall

D.AUC-ROC

AnswerC

Correct: Recall measures the proportion of actual frauds that are correctly identified.

Why this answer

Option B is correct because recall measures the ability to find all positive samples (frauds), which is critical in fraud detection. Options A, C, and D are incorrect: precision is important but not as crucial as recall in this imbalanced scenario, F1-score balances precision and recall but recall directly addresses the issue, and AUC-ROC is not as intuitive for this specific problem.

Full explanation →

113

MCQmedium

A hospital uses an AI system to prioritize patient triage based on vital signs and medical history. During a trial, the system consistently assigns lower urgency to elderly patients with chronic conditions, even when their symptoms suggest high risk. Which approach best addresses this bias?

A.Use a different dataset from a similar hospital without checking demographics

B.Manually increase the weight of age-related features in the model

C.Replace the neural network with a decision tree to simplify decision logic

D.Audit the training data for representation of elderly patients and retrain with balanced data

AnswerD

Auditing and retraining with balanced data addresses the root cause of bias.

Why this answer

Option D is correct because the bias originates from the training data underrepresenting elderly patients with chronic conditions, causing the model to learn skewed urgency patterns. Auditing the data for representation and retraining with balanced data directly addresses the root cause by ensuring the model learns from a fair distribution of cases, which is a standard bias mitigation technique in AI systems.

Exam trap

CompTIA often tests the misconception that changing the model architecture (e.g., switching to a decision tree) or manually tweaking feature weights can fix bias, when the real solution lies in auditing and rebalancing the training data.

How to eliminate wrong answers

Option A is wrong because using a different dataset from a similar hospital without checking demographics merely shifts the problem; it does not guarantee balanced representation and may introduce new biases. Option B is wrong because manually increasing the weight of age-related features is a form of ad hoc feature engineering that can overcorrect and introduce new biases, and it does not address the underlying data imbalance. Option C is wrong because replacing the neural network with a decision tree does not inherently fix bias; the decision tree will still learn from the same biased data, and its simpler logic does not prevent it from replicating the skewed patterns.

Full explanation →

114

MCQhard

A self-driving car company is testing an AI model for pedestrian detection. During simulation, the model fails to detect pedestrians in low-light conditions. The safety team wants to improve robustness without retraining the entire model from scratch. Which approach is most appropriate?

A.Replace the convolutional layers with transformer layers to improve attention.

B.Apply data augmentation techniques to simulate low-light conditions in the training dataset.

C.Use adversarial training to add imperceptible perturbations to training images.

D.Increase the model's depth by adding more convolutional layers.

AnswerB

Data augmentation can expand the training data to include low-light scenarios, improving robustness without full retraining.

Why this answer

Option B is correct because data augmentation techniques, such as adjusting brightness, contrast, and adding noise, can synthetically create low-light training examples from existing data. This improves the model's robustness to low-light conditions without requiring a full retraining from scratch, as it directly addresses the distribution shift in the input data.

Exam trap

CompTIA often tests the distinction between improving robustness to natural distribution shifts (e.g., low-light) via augmentation versus defending against adversarial perturbations, causing candidates to mistakenly choose adversarial training for non-adversarial scenarios.

How to eliminate wrong answers

Option A is wrong because replacing convolutional layers with transformer layers would require significant architectural changes and likely full retraining, not a lightweight fix, and transformers are not inherently more robust to low-light conditions without specific training. Option C is wrong because adversarial training focuses on imperceptible perturbations that cause misclassification, which is a different problem (adversarial attacks) than natural low-light degradation; it does not simulate the global brightness reduction or noise patterns of low-light environments. Option D is wrong because increasing model depth by adding more convolutional layers does not address the specific data distribution shift (low-light) and may lead to overfitting or vanishing gradients without corresponding training data changes.

Full explanation →

115

MCQhard

A team is designing an AI system for autonomous driving. They need to decide between an end-to-end deep learning approach versus a modular pipeline (perception, planning, control). Which is a key advantage of the modular approach?

A.It typically has lower inference latency.

B.Each module can be validated separately.

C.It handles novel scenarios better due to joint training.

D.It requires less engineering effort.

AnswerB

Correct; separability improves safety and troubleshooting.

Why this answer

Option C is correct because modular systems allow independent testing and debugging of each component. Option A (end-to-end simplicity) is not true; modular is more complex. Option B (lower latency) is not inherent.

Option D (end-to-end can be more robust to novel situations) is debated but modular offers better interpretability.

Full explanation →

116

MCQmedium

While training a deep neural network, the loss function fails to converge and oscillates wildly. Which adjustment is most likely to stabilize training?

A.Increase the number of hidden layers

B.Decrease the batch size

C.Reduce the learning rate

D.Use a test set

AnswerC

Lower learning rate reduces step size, stabilizing training.

Why this answer

When the loss function oscillates wildly and fails to converge, it typically indicates that the learning rate is too high, causing the optimizer to overshoot the minima. Reducing the learning rate allows the gradient descent updates to take smaller, more stable steps, which helps the loss converge smoothly. This is a fundamental hyperparameter tuning step in deep learning training.

Exam trap

CompTIA often tests the misconception that increasing model complexity (more layers) or using more data (test set) directly fixes training instability, when in fact the learning rate is the primary culprit for oscillation and non-convergence.

How to eliminate wrong answers

Option A is wrong because increasing the number of hidden layers adds more parameters and non-linearity, which can exacerbate instability and overfitting, not stabilize training. Option B is wrong because decreasing the batch size increases the variance in gradient estimates, which often leads to noisier updates and can worsen oscillation, not reduce it. Option D is wrong because using a test set is for evaluating generalization performance after training, not for stabilizing the training process itself.

Full explanation →

117

MCQhard

Refer to the exhibit. The training pod is using 2 GPUs. During training, the GPU utilization is only 30% each. What is the most likely cause?

A.The learning rate is too high

B.The image is missing CUDA libraries

C.The number of epochs is too high

D.The batch size is too small to fully utilize GPUs

AnswerD

Small batch size leads to low compute-to-overhead ratio, underutilizing GPU resources.

Why this answer

Option A is correct because a batch size of 32 is small for two GPUs, leading to underutilization as GPUs spend time on kernel launches and synchronization. Option B is incorrect because learning rate does not directly impact GPU utilization. Option C is incorrect because number of epochs does not affect utilization per step.

Option D is incorrect because missing CUDA would cause errors, not low utilization.

Full explanation →

118

MCQmedium

Refer to the exhibit. A stream processor ingests events. One event arrives with missing "user_id". What will happen?

A.The event will be stored in a dead-letter queue automatically.

B.The event will be accepted but user_id will be set to null.

C.The event will be accepted with a default user_id of 0.

D.The event will be rejected because user_id is required.

AnswerD

Validation against the required field causes rejection.

Why this answer

Option D is correct because in stream processing systems like Apache Kafka or AWS Kinesis, if a required field such as 'user_id' is missing and the schema (e.g., Avro, JSON Schema) defines it as required, the event will be rejected at ingestion. The stream processor typically validates the event against the schema; if validation fails, the event is not accepted into the stream, preventing downstream processing errors.

Exam trap

CompTIA often tests the misconception that stream processors automatically handle missing fields by setting defaults or using dead-letter queues, but the correct behavior is strict rejection when the field is required by the schema.

How to eliminate wrong answers

Option A is wrong because a dead-letter queue is not automatically used for missing required fields; it is typically configured for events that fail processing after ingestion, not for schema validation failures at ingestion time. Option B is wrong because setting user_id to null would violate the required constraint; stream processors do not automatically coerce missing required fields to null unless explicitly configured with a default value. Option C is wrong because assigning a default user_id of 0 is not standard behavior; default values must be explicitly defined in the schema, and without such definition, the event is rejected.

Full explanation →

119

Multi-Selecthard

Which THREE factors are most critical to consider when designing a continuous integration/continuous deployment (CI/CD) pipeline for machine learning?

Select 3 answers

A.Data quality and schema validation

B.A/B testing framework for comparing models

C.Automated model performance benchmarking

D.Automated unit testing of application code

E.Versioning of datasets, models, and training code

AnswersA, C, E

Data validation ensures reliable model inputs.

Why this answer

Data quality and schema validation (A) are critical because ML pipelines are highly sensitive to data drift, missing values, and schema mismatches that can silently degrade model performance. Without automated validation at the CI stage, bad data can pass through and corrupt model training or inference, leading to unreliable outputs in production.

Exam trap

CompTIA often tests the distinction between ML-specific pipeline requirements and general DevOps practices, so candidates mistakenly select generic options like unit testing (D) or A/B testing (B) instead of the ML-critical factors of data validation, model benchmarking, and versioning.

Full explanation →

120

MCQmedium

A company wants to roll out a new recommendation model to production. They decide to run an A/B test where 10% of users see the new model and 90% see the old model. After one week, the new model shows a 5% improvement in click-through rate. What is the next best action?

A.Immediately roll out the new model to 100% of users

B.Revert to the old model because the improvement is minimal

C.Run the test for another month to ensure statistical significance

D.Increase the testing percentage gradually while monitoring performance metrics and guardrails

AnswerD

This approach captures benefits while controlling risk.

Why this answer

Option D is correct because a 5% improvement observed over only one week with a 10% traffic split is insufficient to confirm statistical significance or rule out novelty effects, data drift, or seasonal bias. The recommended best practice in AI deployment is to gradually increase the testing percentage (e.g., 10% → 25% → 50% → 100%) while continuously monitoring performance metrics and guardrails (e.g., click-through rate, conversion rate, latency, and error rates) to ensure the new model generalizes safely across the full user population.

Exam trap

CompTIA often tests the misconception that a short-term observed improvement is automatically statistically significant, tempting candidates to choose immediate full rollout (Option A) or premature reversion (Option B), when the correct answer emphasizes incremental deployment with continuous monitoring.

How to eliminate wrong answers

Option A is wrong because immediately rolling out to 100% of users risks exposing the entire user base to a model that may have only shown a temporary or statistically insignificant improvement, potentially causing negative business impact if the model fails under full load or exhibits unexpected behavior. Option B is wrong because reverting to the old model based on a 'minimal' improvement is premature; a 5% uplift could be meaningful depending on the baseline, and the test should be allowed to run longer to gather sufficient data for a valid statistical conclusion. Option C is wrong because running the test for another month without adjusting the traffic split or monitoring guardrails is inefficient and may still not guarantee statistical significance if the sample size remains too small; the correct approach is to increase the traffic percentage gradually while verifying performance at each step.

Full explanation →

121

Multi-Selecteasy

A data scientist is preparing a dataset for supervised learning. Which TWO steps are essential?

Select 2 answers

A.One-hot encoding all features

B.Normalizing features

C.Labeling the data

D.Removing outliers

E.Splitting into training and test sets

AnswersC, E

Correct; supervised learning requires labeled examples.

Why this answer

Labeling the data is essential for supervised learning because the algorithm requires input-output pairs to learn a mapping function. Without labeled data, the model cannot be trained to predict outcomes, as supervised learning relies on ground-truth targets for error correction during training.

Exam trap

CompTIA often tests the distinction between mandatory preprocessing steps and optional optimizations, trapping candidates who confuse best practices (like normalization or outlier removal) with absolute requirements for supervised learning.

Full explanation →

122

MCQmedium

A team is building a regression model to predict house prices. The dataset includes numerical features (square footage, number of bedrooms) and categorical features (neighborhood, roof type). The categorical features have high cardinality (neighborhood has 200+ unique values). Which encoding strategy should the team use to avoid overfitting and maintain model interpretability?

A.Target encoding with regularization.

B.Label encoding.

C.Binary encoding.

D.One-hot encoding with feature selection.

AnswerA

Target encoding condenses categories using target mean, and regularization prevents overfitting.

Why this answer

Target encoding with regularization is the best choice because it replaces each categorical value with the mean of the target variable for that category, which captures the relationship between the category and house prices. Regularization (e.g., adding a prior or using cross-validation) shrinks the encoded values toward the global mean, preventing overfitting on rare categories (e.g., neighborhoods with only a few houses). This maintains interpretability because each encoded value directly reflects the average price impact of that category, unlike black-box embeddings.

Exam trap

CompTIA often tests the misconception that one-hot encoding is always safe for categorical variables, but here the high cardinality (200+ neighborhoods) makes one-hot encoding impractical and prone to overfitting, leading candidates to overlook target encoding with regularization as the correct solution.

How to eliminate wrong answers

Option B (Label encoding) is wrong because it assigns arbitrary integer labels to categories (e.g., neighborhood 1, 2, 3), which implies an ordinal relationship that does not exist, misleading the regression model into treating categories as ordered numeric features. Option C (Binary encoding) is wrong because while it reduces dimensionality compared to one-hot, it still creates multiple binary columns per category, which can lead to overfitting with high-cardinality features and does not directly capture the target relationship, reducing interpretability. Option D (One-hot encoding with feature selection) is wrong because one-hot encoding with 200+ neighborhoods would create over 200 dummy variables, causing extreme sparsity and high risk of overfitting even after feature selection, and feature selection methods (e.g., Lasso) may discard important rare categories, losing signal.

Full explanation →

123

MCQhard

An e-commerce company needs to update its recommendation model continuously as user preferences change. The model currently retrains from scratch every night, but the training time is too long. Which approach would reduce training time while keeping the model up-to-date?

A.Use dimensionality reduction on features.

B.Implement incremental learning using online gradient descent.

C.Switch to a simpler model.

D.Increase the batch size for retraining.

AnswerB

Online learning updates the model incrementally, avoiding full retrain.

Why this answer

Incremental learning using online gradient descent updates the model parameters with each new data point or mini-batch, avoiding the need to retrain from scratch. This approach significantly reduces training time while continuously adapting to changing user preferences, making it ideal for real-time recommendation systems.

Exam trap

CompTIA often tests the misconception that dimensionality reduction or simpler models are the primary solution for reducing training time, when in fact incremental learning directly addresses the need for continuous updates without full retraining.

How to eliminate wrong answers

Option A is wrong because dimensionality reduction reduces the number of features but does not eliminate the need to retrain the entire model from scratch each night; the training time savings are marginal and the core problem of full retraining remains. Option C is wrong because switching to a simpler model may reduce training time but typically sacrifices model accuracy and expressiveness, which is critical for capturing nuanced user preferences in recommendations. Option D is wrong because increasing the batch size for retraining can actually increase memory usage and may not reduce overall training time if the model still retrains from scratch nightly; it does not address the fundamental inefficiency of full retraining.

Full explanation →

124

Multi-Selecteasy

Which THREE are common pitfalls when operationalizing AI models? (Select THREE.)

Select 3 answers

A.Training-serving skew due to differences in data preprocessing

B.Using simpler models that are easier to debug

C.Lack of monitoring for model performance drift

D.Ignoring infrastructure scalability requirements

E.Automating the model retraining process

AnswersA, C, D

Causes model inaccuracies in production.

Why this answer

Option A is correct because training-serving skew occurs when the data preprocessing logic used during model training differs from that used during inference in production. This is a common pitfall in operationalizing AI models, as even minor discrepancies in feature engineering, normalization, or encoding can cause significant performance degradation. For example, using different libraries or versions for tokenization between training and serving pipelines directly leads to skew.

Exam trap

CompTIA often tests the distinction between operational pitfalls and best practices, so the trap here is that candidates may mistake a recommended practice (like using simpler models or automating retraining) for a pitfall, when in fact the pitfall is the lack of monitoring or ignoring scalability.

Full explanation →

125

MCQeasy

A healthcare startup is building an AI system to predict patient readmission risk. The team collects structured data from electronic health records (EHR) including age, diagnosis codes, lab results, and previous admissions. During initial training, the model achieves 95% accuracy on the validation set but only 60% accuracy on a holdout test set from a different hospital. The data scientist suspects overfitting. Which action should the team take first to improve generalization?

A.Apply L2 regularization to the model

B.Switch to a linear regression model

C.Increase the model complexity by adding more layers

D.Collect more data from the same hospital

AnswerA

Regularization penalizes large coefficients, reducing overfitting and improving generalization to new data.

Why this answer

The model's high accuracy on the validation set but poor accuracy on a holdout test set from a different hospital indicates overfitting to the training data's specific patterns, which do not generalize to new data. L2 regularization (ridge regression) adds a penalty proportional to the square of the weights, discouraging the model from fitting noise and encouraging simpler, more generalizable decision boundaries. This directly addresses overfitting by reducing variance without requiring more data or reducing model capacity too drastically.

Exam trap

CompTIA often tests the misconception that overfitting is always solved by more data, but the trap here is that collecting more data from the same source does not fix distribution shift—regularization directly penalizes model complexity to improve generalization to unseen distributions.

How to eliminate wrong answers

Option B is wrong because switching to a linear regression model would reduce model capacity, potentially underfitting the complex relationships in EHR data, and does not specifically target the overfitting caused by high variance. Option C is wrong because increasing model complexity by adding more layers would exacerbate overfitting, making the model even more sensitive to training data noise and further reducing generalization. Option D is wrong because collecting more data from the same hospital would reinforce the same distributional biases and does not address the core issue of the model failing to generalize to a different hospital's data distribution.

Full explanation →

126

MCQeasy

A company wants to deploy a chatbot that uses natural language understanding (NLU) to answer customer queries. Which AI technique is most suitable for understanding the intent of user input?

A.K-means clustering

B.Linear regression

C.Sequence-to-sequence model with attention

D.Decision tree

AnswerC

This architecture effectively models sequences and captures important parts of input via attention, ideal for understanding user intent.

Why this answer

Option C is correct because sequence-to-sequence models with attention are specifically designed to handle variable-length input sequences (like user queries) and map them to output sequences (like intent labels or responses). The attention mechanism allows the model to focus on the most relevant parts of the input when determining intent, which is critical for understanding nuanced or long user queries in NLU tasks.

Exam trap

The trap here is that candidates often confuse clustering (K-means) or simple classification (decision trees) with NLU, failing to recognize that understanding intent requires modeling sequential dependencies and context, which only sequence-to-sequence models with attention provide.

How to eliminate wrong answers

Option A is wrong because K-means clustering is an unsupervised learning algorithm used for grouping similar data points into clusters, not for understanding the intent of user input, which requires supervised learning or sequence modeling. Option B is wrong because linear regression is a regression technique for predicting continuous numerical values, not for classifying or interpreting the intent of natural language text. Option D is wrong because decision trees are simple rule-based classifiers that lack the ability to capture sequential dependencies and context in natural language, making them unsuitable for intent recognition in chatbot NLU.

Full explanation →

127

MCQmedium

An operations team monitors a classification model in production. The confusion matrix for the model shows the following values: TP=1500, FN=500, FP=600, TN=2400. Which metric should the team calculate to assess the model's ability to avoid false positives?

A.Accuracy

B.F1-score

C.Recall

D.Precision

AnswerD

Precision = TP/(TP+FP) = 1500/2100 = 0.714, directly relates to false positives.

Why this answer

Precision (TP / (TP + FP)) directly measures the proportion of positive identifications that were actually correct, making it the ideal metric to assess the model's ability to avoid false positives. With TP=1500 and FP=600, precision is 1500/(1500+600)=0.714, indicating that 71.4% of predicted positives are true positives, while 28.6% are false positives. The question explicitly asks about avoiding false positives, which is the inverse of precision's focus on the correctness of positive predictions.

Exam trap

CompTIA often tests the distinction between precision and recall by framing a question about 'avoiding false positives' or 'avoiding false negatives,' leading candidates to confuse precision with recall or to default to F1-score as a balanced metric.

How to eliminate wrong answers

Option A is wrong because Accuracy ((TP+TN)/(TP+TN+FP+FN)) measures overall correctness across all classes, not specifically the model's ability to avoid false positives; it can be high even when false positives are numerous if the negative class dominates. Option B is wrong because F1-score is the harmonic mean of precision and recall, balancing both false positives and false negatives, but it does not isolate the ability to avoid false positives as precision does. Option C is wrong because Recall (TP/(TP+FN)) measures the model's ability to find all positive samples, focusing on false negatives, not false positives.

Full explanation →

128

MCQhard

An organization has a dataset with categorical features having high cardinality (e.g., ZIP codes). They plan to use a tree-based model. Which encoding method is most appropriate?

A.Label encoding

B.One-hot encoding

C.Target encoding (mean encoding)

D.Frequency encoding

AnswerC

Target encoding maps categories to the mean target, preserving predictive information compactly.

Why this answer

Option B is correct because target encoding replaces categories with the mean target value, capturing predictive power without explosion of features. Option A is incorrect because one-hot encoding creates many sparse features, inefficient for trees. Option C is incorrect because label encoding implies ordinality, which is misleading.

Option D is incorrect because frequency encoding may lose target relationship.

Full explanation →

129

MCQhard

A medical imaging team is developing an AI model to detect tumors from CT scans. They have 10,000 labeled scans, but the labels were created by a semi-automated process with an estimated 20% error rate (mislabeled tumor vs. no tumor). The team trains a convolutional neural network (CNN) and achieves 90% accuracy on a held-out test set that was carefully validated by an expert radiologist. However, when deployed to a new hospital's patient population, the accuracy drops to 70%. The team suspects domain shift and label noise. Which strategy is most likely to improve model robustness for the new hospital?

A.Use active learning to select the most uncertain predictions from the new hospital's data, then have an expert radiologist correct those labels

B.Randomly select 1,000 scans from the new hospital and have them re-labeled by the radiologist

C.Collect 20,000 more scans with the same semi-automated labeling process

D.Reduce the CNN's number of layers and apply dropout to combat overfitting

AnswerA

Active learning targets the most informative samples, maximizing improvement per expert effort.

Why this answer

Option C is correct. Active learning helps select the most informative samples (e.g., uncertain predictions) for expert review, efficiently improving the model with limited expert effort. Option A is wrong because simply adding more noisy labels will amplify errors.

Option B is wrong because random sampling may not capture the most valuable corrections. Option D is wrong because reducing model complexity may underfit, and dropouts fine-tuning might not address label noise or domain shift.

Full explanation →

130

MCQeasy

A data scientist is training a binary classification model to detect fraudulent transactions. The dataset is highly imbalanced with 99% legitimate and 1% fraudulent. Which evaluation metric should be prioritized to assess model performance?

A.Accuracy

B.F1-score

C.Mean Squared Error

D.Log Loss

AnswerB

F1-score balances precision and recall, making it ideal for imbalanced classification.

Why this answer

Option A (Accuracy) is misleading because a model that always predicts 'legitimate' would achieve 99% accuracy but fail to detect fraud. Option C (Mean Squared Error) is for regression, not classification. Option D (Log Loss) can be used but is less interpretable for imbalanced data.

Option B (F1-score) balances precision and recall, making it ideal for imbalanced datasets.

Full explanation →

131

MCQmedium

A data engineer is preprocessing text data for sentiment analysis. Which technique preserves word order while converting text to numeric features?

A.TF-IDF

B.N-grams with frequency counts

C.Word2Vec

D.Bag-of-words

AnswerB

N-grams represent contiguous sequences of n words, thus preserving local word order.

Why this answer

Option D is correct because n-grams capture sequences of words, preserving local order. Option A and B ignore order entirely. Option C (word2vec) produces embeddings that encode semantic similarity but not exact word order.

Full explanation →

132

MCQmedium

Refer to the exhibit. An AI auditor reviews the fairness configuration. What is the purpose of this policy?

A.Ensure equal error rates across groups

B.Ensure equal positive prediction rates across groups

C.Ensure equal accuracy across groups

D.Ensure model interpretability

AnswerB

Correct; demographic parity aims for similar selection rates.

Why this answer

The policy sets a fairness constraint that requires the model's positive prediction rate (the fraction of instances predicted as the positive class) to be equal across all defined groups. This is a standard demographic parity requirement, which is implemented by adjusting the decision threshold or reweighting training data to ensure that each group receives the same proportion of positive predictions, regardless of the actual outcome distribution.

Exam trap

CompTIA often tests the distinction between demographic parity (equal positive prediction rates) and equalized odds (equal error rates), so candidates mistakenly choose 'equal error rates' when they see a fairness policy that actually enforces demographic parity.

How to eliminate wrong answers

Option A is wrong because equal error rates across groups refer to equalized odds (equal false positive and false negative rates), not equal positive prediction rates. Option C is wrong because equal accuracy across groups is a different fairness metric (accuracy parity) that does not guarantee equal positive prediction rates. Option D is wrong because model interpretability is a separate concern about understanding model decisions, not a fairness constraint on prediction rates.

Full explanation →

133

MCQhard

A large e-commerce company uses a recommendation engine trained on millions of user interactions. Recently, the marketing team noticed a sharp increase in click-through rates for a particular product category. Upon investigation, an engineer found that a competitor had injected fake user profiles that consistently clicked on their products, skewing the training data. The company needs to remediate the attack and prevent future occurrences. The team has limited time and budget. Which course of action should the company take first?

A.Identify and remove the fake user profiles from the training dataset, then retrain the model

B.Implement adversarial training to make the model robust to future poisoning attempts

C.Decrease the frequency of model retraining to limit exposure to new data

D.Add differential privacy noise to the training data to mask the injected profiles

AnswerA

This directly eliminates the poisoned data and restores model accuracy.

Why this answer

Option A is correct because removing the injected data and retraining directly removes the poison and restores model integrity. Option B is wrong because adversarial training focuses on evasion attacks, not poisoning. Option C is wrong because adding noise does not remove existing poisoned data.

Option D is wrong because reducing retraining frequency does not address the attack and could allow more damage.

Full explanation →

134

MCQhard

An AI system is being developed to diagnose diseases from medical images. The model achieves 99% accuracy on the test set, but when deployed in a different hospital, performance drops significantly. Which of the following is the MOST likely cause?

A.The model is being attacked by adversarial examples.

B.The training data does not represent the new hospital's population or imaging equipment.

C.The model is overfitted to the training data.

D.Data leakage occurred during preprocessing.

AnswerB

Correct; domain shift is a common cause of performance degradation.

Why this answer

The model's high accuracy on the test set but poor performance in a different hospital indicates a distribution shift between the training data and the deployment environment. This is a classic case of dataset shift, where the training data does not represent the new hospital's patient population or imaging equipment, leading to degraded model generalization.

Exam trap

CompTIA often tests the distinction between overfitting and dataset shift, where candidates mistakenly attribute a deployment performance drop to overfitting even when test accuracy is high, missing the real issue of distribution mismatch.

How to eliminate wrong answers

Option A is wrong because adversarial examples are deliberately crafted inputs designed to fool a model, but the scenario describes a general performance drop across all images, not targeted attacks. Option C is wrong because overfitting would cause poor performance on the test set as well, not just on deployment; here the test accuracy is high, ruling out overfitting. Option D is wrong because data leakage would inflate test accuracy artificially, but the drop in deployment is due to distribution mismatch, not leakage during preprocessing.

Full explanation →

135

MCQhard

An AI system for autonomous vehicles uses reinforcement learning (RL) to navigate. The reward function encourages reaching the destination quickly but penalizes collisions heavily. The agent learns to drive aggressively, causing minor accidents. Which modification to the reward function would best align the agent's behavior with desired safe driving?

A.Increase the collision penalty to a very large negative value.

B.Remove the time-based reward and only reward reaching the destination.

C.Use a potential-based reward shaping to encourage progress toward destination.

D.Add a penalty term for high acceleration and jerky movements.

AnswerD

Penalizing aggressive actions directly encourages smooth driving.

Why this answer

Option D is correct because adding a penalty for high acceleration and jerky movements directly addresses the root cause of the aggressive driving behavior—smoothness and safety—without undermining the primary goal of reaching the destination. This modification shapes the reward function to penalize unsafe driving patterns, aligning the agent's learned policy with desired safe navigation while preserving the time-based incentive for efficiency.

Exam trap

CompTIA often tests the misconception that simply increasing the penalty for collisions (option A) is sufficient to ensure safe driving, when in reality it can lead to reward hacking or overly conservative policies, and the correct solution requires shaping the reward to penalize the specific unsafe behaviors (e.g., high acceleration) that cause accidents.

How to eliminate wrong answers

Option A is wrong because simply increasing the collision penalty to a very large negative value may cause the agent to become overly cautious, potentially leading to freezing behavior or failure to navigate effectively, and does not address the underlying aggressive driving patterns that cause minor accidents. Option B is wrong because removing the time-based reward eliminates the incentive for efficiency, which could result in the agent taking excessively long routes or failing to prioritize timely arrival, thus not aligning with the desired safe driving behavior. Option C is wrong because potential-based reward shaping encourages progress toward the destination but does not penalize aggressive maneuvers; it may still allow the agent to drive aggressively as long as it makes progress, failing to mitigate the unsafe driving patterns.

Full explanation →

136

MCQeasy

A team is implementing a machine learning pipeline to classify images for a defect detection system. They are considering using a pre-trained convolutional neural network (CNN) and fine-tuning it on their small dataset. What is the primary advantage of transfer learning in this scenario?

A.It ensures the model is not biased toward the original dataset

B.It eliminates the need for data preprocessing

C.It allows the model to leverage learned features from a large dataset, reducing training time and required data

D.It reduces the risk of overfitting by using a larger model

AnswerC

Transfer learning uses features from a large dataset, so fine-tuning requires less data and time.

Why this answer

Option C is correct because transfer learning leverages features learned from a large dataset, enabling effective training with a small dataset and reducing training time. Option A is incorrect because pre-trained models are often smaller, not larger. Option B is incorrect because preprocessing is still needed.

Option D is incorrect because the model may retain biases from the original dataset.

Full explanation →

137

Multi-Selectmedium

A team is deploying an AI model for credit approval. Which TWO ethical considerations must be addressed?

Select 2 answers

A.Training speed

B.Model interpretability

C.Model accuracy

D.Model fairness to avoid bias

E.Model size

AnswersB, D

Correct; interpretability helps ensure transparency and accountability.

Why this answer

Model interpretability (B) is essential for credit approval because financial decisions must be explainable to regulators and customers under laws like GDPR or ECOA. A black-box model that cannot justify why a loan was denied violates compliance requirements, making interpretability a core ethical and legal necessity.

Exam trap

CompTIA often tests the distinction between ethical requirements (interpretability, fairness) and technical performance metrics (accuracy, speed, size), leading candidates to mistakenly select accuracy as an ethical consideration.

Full explanation →

138

MCQhard

A financial institution deploys an AI credit scoring model. After six months, the model's performance drops significantly. Analysis shows that the relationship between features and labels has changed. Which term describes this phenomenon?

A.Concept drift

B.Model decay

C.Overfitting

D.Data drift

AnswerA

Concept drift directly refers to changes in the relation between inputs and outputs.

Why this answer

Concept drift occurs when the statistical relationship between input features and the target label changes over time, which is exactly what happened when the credit scoring model's performance dropped due to a shift in the feature-label relationship. This is distinct from data drift, which only involves changes in the input data distribution without affecting the label mapping.

Exam trap

CompTIA often tests the distinction between concept drift and data drift, and the trap here is that candidates confuse a change in input data distribution (data drift) with a change in the underlying relationship between features and labels (concept drift), leading them to incorrectly select data drift.

How to eliminate wrong answers

Option B (Model decay) is wrong because model decay is a general term for performance degradation over time, but it does not specifically describe a change in the feature-label relationship; it could be caused by data drift, concept drift, or other factors. Option C (Overfitting) is wrong because overfitting refers to a model learning noise or specific patterns in the training data that do not generalize, not a post-deployment shift in the underlying relationship. Option D (Data drift) is wrong because data drift only describes changes in the distribution of input features (e.g., customer income shifts), not a change in the mapping from features to the target label (e.g., what constitutes a good credit risk).

Full explanation →

139

MCQeasy

Refer to the exhibit. A security auditor identifies a critical vulnerability that could allow an attacker to manipulate model inputs to cause misclassification. Which configuration setting is most directly responsible for this vulnerability?

A.enable_input_sanitization = true

B.audit_level = basic

C.enable_adversarial_defense = false

D.pii_detection = enabled

AnswerC

Disabling adversarial defense leaves the model vulnerable.

Why this answer

Option C (enable_adversarial_defense = false) is correct because adversarial defense specifically prevents input manipulation. Option A (enable_input_sanitization = true) helps but is not specific. Option B (audit_level = basic) is for logging.

Option D (pii_detection = enabled) is for privacy.

Full explanation →

140

MCQeasy

A retail company wants to build a model to predict customer churn based on purchase history and demographics. The dataset includes categorical features like region and gender, and numerical features like total spend. What is the best initial step before training the model?

A.Train a deep neural network directly on raw data

B.One-hot encode categorical variables and normalize numerical variables

C.Remove all categorical features to simplify the model

D.Perform principal component analysis (PCA) on all features

AnswerB

This is the correct initial step to prepare the data for most machine learning models.

Why this answer

One-hot encoding categorical variables and normalizing numerical variables is standard preprocessing to convert categorical data into numeric format and scale features, which many algorithms require for optimal performance.

Full explanation →

141

Multi-Selecthard

An ML operations team needs to monitor a deployed model's performance. Which TWO metrics are most useful for detecting concept drift in a regression model? (Choose two.)

Select 2 answers

A.Distribution of input features

B.Distribution of residuals between predictions and actuals

C.Classification accuracy

D.Model inference latency

E.Mean absolute error (MAE) over a sliding time window

AnswersB, E

Changing residual distribution can indicate concept drift.

Why this answer

Option B is correct because monitoring the distribution of residuals (predicted vs. actual values) directly reveals when the relationship between inputs and outputs has shifted, which is the essence of concept drift. In a regression model, if the residuals become systematically biased or their variance changes over time, it indicates that the underlying data-generating process has changed, even if input feature distributions remain stable.

Exam trap

CompTIA often tests the distinction between covariate drift and concept drift, trapping candidates who think monitoring input features is sufficient for detecting all types of model degradation.

Full explanation →

142

MCQhard

An AI engineer trains a deep learning model for image classification. After training, the training accuracy is 99% but validation accuracy is 85%. Which technique would best address this discrepancy?

A.Increase data augmentation

B.Decrease the learning rate

C.Increase the number of layers

D.Add dropout layers

AnswerD

Dropout reduces overfitting by preventing co-adaptation of neurons.

Why this answer

The high training accuracy (99%) and lower validation accuracy (85%) indicate overfitting, where the model memorizes training data but fails to generalize. Dropout layers randomly deactivate neurons during training, forcing the network to learn more robust features and reducing overfitting. This technique directly addresses the discrepancy by improving validation performance without sacrificing training capacity.

Exam trap

CompTIA often tests the distinction between techniques that address overfitting (like dropout) versus those that improve convergence (like learning rate adjustment) or model capacity (like adding layers), trapping candidates who confuse regularization with optimization.

How to eliminate wrong answers

Option A is wrong because increasing data augmentation can help reduce overfitting by creating more varied training samples, but it is not the best technique here as it may not sufficiently address the already severe overfitting and could introduce noise; dropout is a more direct regularization method. Option B is wrong because decreasing the learning rate addresses convergence issues (e.g., slow training or oscillation) but does not directly combat overfitting; it may even worsen the gap if the model continues to memorize. Option C is wrong because increasing the number of layers adds more parameters, which typically exacerbates overfitting by increasing model capacity, making the discrepancy worse.

Full explanation →

143

MCQhard

An ML team uses the model registry above. After deploying version 3 to production, they discover it has a critical bug. What is the fastest way to roll back to a stable version without re-deploying from scratch?

A.Promote version 2 from Staging to Production

B.Redeploy version 1 by updating the production deployment to use its artifact

C.Retrain the model with corrected data

D.Delete version 3 from the registry

AnswerB

Version 1 is still in Production and can be quickly redeployed.

Why this answer

Option B is correct because it directly updates the production deployment to reference the artifact of version 1, which is a stable, previously validated version. This avoids the overhead of re-deploying from scratch by simply pointing the existing deployment to a known good artifact in the model registry, leveraging the registry's artifact storage and deployment integration.

Exam trap

CompTIA often tests the misconception that deleting a model version or promoting a staging version is the fastest rollback, when in reality the fastest method is to update the existing deployment's artifact reference to a known stable version.

How to eliminate wrong answers

Option A is wrong because promoting version 2 from Staging to Production would require a new deployment process (e.g., updating the deployment configuration or triggering a CI/CD pipeline), which is not the fastest rollback method; it also assumes version 2 is stable, which may not be the case if it was never validated in production. Option C is wrong because retraining the model with corrected data is a time-consuming process that involves data preparation, training, and validation, and does not address the immediate need to roll back to a stable version. Option D is wrong because deleting version 3 from the registry does not affect the running production deployment; the deployment continues to use the artifact of version 3 until the deployment configuration is explicitly updated to reference a different version.

Full explanation →

144

Multi-Selecthard

Which THREE components are essential for implementing a successful MLOps pipeline for a continuously deployed AI system?

Select 3 answers

A.Manual approval gates for each deployment

B.Canary deployment strategy

C.Model registry for version control and metadata management

D.Automated testing and validation of models and pipelines

E.Data and model versioning

AnswersC, D, E

Registry is critical for tracking and managing model versions.

Why this answer

A model registry (C) is essential for MLOps because it provides version control, metadata management, and lineage tracking for all trained models. This enables reproducibility, auditability, and seamless rollback in a continuously deployed AI system, ensuring that only validated models are promoted to production.

Exam trap

CompTIA often tests the distinction between operational strategies (like canary deployments) and foundational pipeline components (like versioning and registries), leading candidates to confuse deployment tactics with essential infrastructure.

Full explanation →

145

MCQmedium

An organization wants to automate the detection of defective products on an assembly line using computer vision. They have a limited number of labeled images for defective items. Which approach would be most effective?

A.Use a support vector machine with handcrafted features

B.Train a convolutional neural network from scratch on the limited data

C.Synthesize additional defective images using GANs

D.Use transfer learning with a pre-trained model like ResNet and fine-tune on the defect data

AnswerD

Transfer learning leverages knowledge from large datasets and fine-tunes on small data.

Why this answer

Option A (Train CNN from scratch) requires large datasets. Option C (SVM with handcrafted features) is less effective for image data. Option D (GAN synthesis) is complex and may not guarantee improvement.

Option B (Transfer learning) leverages pre-trained models and fine-tuning, ideal for small datasets.

Full explanation →

146

MCQeasy

Based on the exhibit, which action is most likely to resolve the memory issue?

A.Add more training data.

B.Increase the learning rate.

C.Switch to a CPU.

D.Reduce the batch size.

AnswerD

Smaller batches reduce the memory allocated for intermediate tensors.

Why this answer

The exhibit shows an out-of-memory (OOM) error during training. Reducing the batch size decreases the memory footprint per iteration, allowing the model to fit within available GPU memory. This directly resolves the memory issue without altering the model architecture or data.

Exam trap

CompTIA often tests the misconception that memory errors are solved by adding more data or changing hardware, when in fact the simplest and most common fix is adjusting the batch size to fit within available GPU memory.

How to eliminate wrong answers

Option A is wrong because adding more training data increases the dataset size, which does not reduce per-batch memory consumption and may even exacerbate memory pressure during data loading. Option B is wrong because increasing the learning rate affects convergence behavior and gradient magnitudes, not memory usage; it can cause instability or divergence but does not free GPU memory. Option C is wrong because switching to a CPU would typically use system RAM instead of GPU memory, but CPUs are far slower for deep learning training and do not resolve the underlying memory constraint—they just shift the bottleneck, often making training impractically slow.

Full explanation →

147

MCQhard

Refer to the exhibit. A data scientist is training a binary classifier. Based on the training log, which problem is the model experiencing?

A.Underfitting

B.Data leakage

C.Overfitting

D.Vanishing gradient

AnswerC

Training loss decreases while validation loss increases, a classic sign of overfitting.

Why this answer

The training log shows that the model's training accuracy continues to improve while the validation accuracy plateaus or degrades after a certain number of epochs. This divergence between training and validation performance is the hallmark of overfitting, where the model memorizes the training data noise rather than learning generalizable patterns.

Exam trap

CompTIA often tests the distinction between overfitting and underfitting by showing a training log where training accuracy is high but validation accuracy is low, leading candidates to mistakenly think the model is underfitting because validation performance is poor.

How to eliminate wrong answers

Option A is wrong because underfitting would show poor performance on both training and validation sets, not the divergence seen here. Option B is wrong because data leakage typically causes unrealistically high performance on both sets or sudden jumps in metrics, not a gradual divergence after convergence. Option D is wrong because vanishing gradient affects deep networks by causing gradients to approach zero, preventing weight updates and stalling training, which would manifest as flat loss curves, not the overfitting pattern observed.

Full explanation →

148

MCQeasy

An AI development team is building a system to detect fraudulent transactions. They want to ensure the model complies with regulations requiring that individuals can question automated decisions. Which governance element is most relevant?

A.Right to explanation

B.Model versioning

C.Differential privacy

D.Data minimization

AnswerA

Right to explanation allows individuals to question and understand automated decisions.

Why this answer

The right to explanation is a key element of AI governance, requiring that individuals be given a reason for automated decisions. This directly relates to the scenario.

Full explanation →

149

MCQhard

A company develops an AI model that recommends job candidates. The model inadvertently discriminates against a protected group. Which approach is most effective for mitigating this bias?

A.Remove the protected attribute from the training data

B.Use a fairness-aware machine learning algorithm

C.Analyze model predictions after deployment

D.Collect more training data from the protected group

AnswerB

Fairness-aware algorithms incorporate constraints to reduce disparate impact.

Why this answer

Option B is correct because fairness-aware machine learning algorithms explicitly incorporate fairness constraints or objectives during model training, directly addressing and mitigating bias against protected groups. Unlike simple removal of protected attributes, these algorithms can detect and correct for proxy discrimination and disparate impact, ensuring the model's recommendations are equitable by design.

Exam trap

CompTIA often tests the misconception that removing a protected attribute from training data is sufficient to eliminate bias, but the trap is that models can still discriminate through correlated proxy features, making fairness-aware algorithms necessary.

How to eliminate wrong answers

Option A is wrong because simply removing the protected attribute from training data does not eliminate bias; the model can still learn proxies for that attribute from correlated features (e.g., zip code correlating with race), leading to indirect discrimination. Option C is wrong because analyzing model predictions after deployment is a detection step, not a mitigation approach; it can identify bias but does not prevent or correct it in the model's behavior. Option D is wrong because collecting more training data from the protected group does not inherently address bias; it may even amplify existing disparities if the data collection process or underlying societal biases remain unchanged, and it does not adjust the model's learning process to ensure fairness.

Full explanation →

150

MCQeasy

A hospital wants to deploy a machine learning model to predict patient readmission risk within 30 days. They have a dataset with 10,000 records, 70 features including demographics, lab results, and past admissions. The target variable is binary (readmitted or not). The data scientist trains a logistic regression model and achieves an AUC of 0.85 on the test set. However, the hospital's clinicians require interpretability of predictions to trust the model. Which action should the data scientist take to ensure the model meets the interpretability requirement while maintaining performance?

A.Reduce the number of features to 10 using PCA and retrain the logistic regression

B.Replace logistic regression with a random forest model and use feature importance plots

C.Train a deep neural network and apply LIME or SHAP for explanations

D.Use the logistic regression model as is, since it is inherently interpretable with coefficients

AnswerD

Logistic regression coefficients provide direct interpretability for each feature.

Why this answer

Option A (random forest) offers feature importance but is less interpretable. Option C (deep neural network with LIME/SHAP) adds complexity and may reduce transparency. Option D (PCA and retrain) loses information and may degrade performance.

Option B (keep logistic regression) provides inherent interpretability through coefficients, meeting requirements without sacrificing performance.

Full explanation →

Page 2 of 7

All pages

Practice AI0-001 by domain

Target a specific domain to shore up weak areas.

AI Concepts and Foundations Machine Learning and Deep Learning AI Models and Data Engineering AI Implementation and Operations AI Security, Ethics and Governance

See all domains with question counts →