Knowledge + Practice

CompTIA AI+ AI0-001 (AI0-001) — Questions 1–75

500 questions total · 7pages · All types, answers revealed

Take a mock exam Exam hub

Page 1 of 7

1

MCQeasy

A data scientist is training a binary classification model to detect fraudulent transactions. The dataset has 99% legitimate transactions and 1% fraudulent. The model achieves 99% accuracy but fails to catch most fraud. Which metric should the team prioritize to evaluate model performance?

A.F1 score

B.Precision

C.Accuracy

D.Recall

AnswerD

Recall measures the ability to catch fraudulent transactions, which is the primary goal.

Why this answer

Recall (sensitivity) measures the proportion of actual positive cases (fraud) correctly identified. With 99% accuracy but failing to catch most fraud, the model is biased toward the majority class (legitimate transactions), so recall is the critical metric to ensure fraud detection improves.

Exam trap

CompTIA often tests the misconception that high accuracy implies good model performance, especially in imbalanced datasets, leading candidates to overlook recall as the appropriate metric for minority class detection.

How to eliminate wrong answers

Option A is wrong because F1 score is the harmonic mean of precision and recall; while useful, it does not isolate the model's ability to catch fraud, and in this imbalanced dataset, a high F1 could still mask poor recall if precision is high. Option B is wrong because precision measures how many predicted frauds are actually fraud, but the model's failure to catch most fraud means recall is the primary concern, not the false positive rate. Option C is wrong because accuracy is misleading in imbalanced datasets; 99% accuracy can be achieved by simply predicting 'legitimate' for all transactions, which explains why the model fails to detect fraud.

Full explanation →

2

Multi-Selectmedium

A team is designing a deep learning pipeline for a computer vision task. They want to reduce overfitting. Which two techniques are specifically effective for this purpose? (Select TWO.)

Select 2 answers

A.Dropout

B.Using a smaller batch size

C.Adding more layers

D.L2 weight regularization

E.Increasing the learning rate

AnswersA, D

Dropout randomly deactivates neurons, reducing overfitting by preventing reliance on specific features.

Why this answer

Options A and B are correct. Dropout randomly drops neurons during training, preventing co-adaptation. L2 regularization adds a penalty on weights, discouraging complexity.

Option C, increasing learning rate, can hinder convergence. Option D, adding more layers, typically increases overfitting. Option E, smaller batch size, can have a regularizing effect but is not as direct or commonly cited as the primary techniques.

Full explanation →

3

Multi-Selectmedium

When evaluating a binary classification model, which two metrics are most appropriate for imbalanced datasets? (Choose two.)

Select 2 answers

A.Accuracy

B.Mean absolute error

C.Recall

D.R-squared

E.Precision

AnswersC, E

Recall measures the proportion of actual positives correctly identified, essential for capturing minority class.

Why this answer

Recall (Option C) is correct because it measures the proportion of actual positive cases correctly identified, which is critical in imbalanced datasets where the minority class is of primary interest. Precision (Option E) is correct because it measures the accuracy of positive predictions, helping to avoid false positives when the positive class is rare. Together, recall and precision provide a balanced view of model performance on the minority class, unlike accuracy which can be misleadingly high by simply predicting the majority class.

Exam trap

CompTIA often tests the misconception that accuracy is always the best metric, but the trap here is that accuracy fails on imbalanced datasets, and candidates must recognize that recall and precision are the appropriate pair for evaluating minority class performance.

Full explanation →

4

MCQhard

A data scientist trains a deep learning model on a large dataset. The training loss decreases steadily but the validation loss starts increasing after 20 epochs. The scientist uses early stopping with patience=5. Which of the following is the MOST likely cause and best corrective action?

A.Model is overfitting; add dropout regularization.

B.Training data is not representative; collect more data.

C.Model is underfitting; increase model capacity.

D.Learning rate too high; reduce learning rate.

AnswerA

Diverging validation loss after training loss decrease is classic overfitting; dropout helps.

Why this answer

The training loss decreasing while validation loss increasing after 20 epochs is a classic sign of overfitting, where the model memorizes training data noise instead of generalizing. Early stopping with patience=5 would halt training after 5 epochs of no validation improvement, but the root cause is overfitting. Adding dropout regularization randomly drops neurons during training, forcing the network to learn more robust features and reducing overfitting.

Exam trap

CompTIA often tests the distinction between overfitting and underfitting by showing a diverging validation loss curve, and the trap here is that candidates may confuse overfitting with a learning rate issue or data quality problem, leading them to choose 'reduce learning rate' or 'collect more data' instead of the correct regularization technique.

How to eliminate wrong answers

Option B is wrong because the validation loss increasing while training loss decreases indicates overfitting, not unrepresentative data; collecting more data might help but is not the most direct corrective action for overfitting. Option C is wrong because underfitting would show high training loss that does not decrease, not a decreasing training loss with increasing validation loss. Option D is wrong because a high learning rate would typically cause training loss to oscillate or diverge, not steadily decrease; reducing learning rate addresses convergence issues, not overfitting.

Full explanation →

5

MCQeasy

A healthcare organization uses an AI model to recommend treatment plans. The model was trained on data from a single hospital, and now treats patients from multiple demographics. Which ethical concern is most critical?

A.Accountability for treatment outcomes

B.Lack of transparency in model decisions

C.Privacy violations in training data

D.Fairness and bias in predictions

AnswerD

The model trained on a single hospital's data may not generalize, leading to unfair treatment recommendations for other demographics.

Why this answer

Training on homogeneous data leads to biased predictions for underrepresented groups, raising fairness concerns. While transparency and explainability are important, fairness is the primary issue due to demographic shift. Privacy and accountability are secondary in this context.

Full explanation →

6

Multi-Selecteasy

A data scientist is preparing a dataset for a binary classification neural network. The dataset contains both numerical and categorical features, and some rows have identical entries. Which TWO preprocessing steps are most essential to improve model performance and avoid overfitting?

Select 2 answers

A.Removing duplicate records

B.Scaling numerical features to have zero mean and unit variance

C.Increasing the batch size

D.Applying PCA for dimensionality reduction

E.Using dropout regularization in the model

AnswersA, B

Duplicate records can cause the model to overfit to repeated patterns.

Why this answer

Removing duplicate records prevents the model from being biased toward repeated instances. Scaling numerical features to zero mean and unit variance ensures that features with larger ranges do not dominate the gradient updates, which is especially important for neural networks. Increasing batch size and dropout regularization are hyperparameter choices, not preprocessing steps, and PCA is not always essential.

Full explanation →

7

MCQeasy

A data engineer is splitting a dataset into training, validation, and test sets for a machine learning project. The dataset is large and representative of the population. Which split ratio is commonly recommended?

A.90% training, 5% validation, 5% test

B.70% training, 20% validation, 10% test

C.50% training, 25% validation, 25% test

D.80% training, 10% validation, 10% test

AnswerD

This is a standard split, providing ample training data and reliable validation and test sets.

Why this answer

Option B is correct because 80/10/10 is a typical split for large datasets, ensuring enough training data while having separate validation and test sets. Option A is wrong because 50/25/25 has too little training data. Option C is wrong because 90/5/5 gives too little validation data.

Option D is wrong because 70/20/10 is also reasonable but 80/10/10 is more common for large datasets; however, the question asks for 'commonly recommended' and among the options, 80/10/10 is standard.

Full explanation →

8

MCQhard

Refer to the exhibit. A deep learning model is being trained. Based on the training log, which problem is most evident?

A.Vanishing gradients

B.Overfitting

C.Underfitting

D.Data leakage

AnswerB

Training loss decreases, validation loss increases.

Why this answer

The training log shows that the training loss continues to decrease while the validation loss increases after a certain epoch, which is a classic sign of overfitting. The model is memorizing the training data rather than learning generalizable patterns, leading to poor performance on unseen data.

Exam trap

CompTIA often tests the distinction between overfitting and underfitting by showing loss curves where training loss decreases but validation loss increases, which candidates may misinterpret as a normal training progression or as vanishing gradients.

How to eliminate wrong answers

Option A is wrong because vanishing gradients typically manifest as stagnant or very slow learning across both training and validation metrics, not as diverging loss curves. Option C is wrong because underfitting would show high training loss and high validation loss without improvement, not a decreasing training loss with an increasing validation loss. Option D is wrong because data leakage usually causes unusually high performance on both training and validation sets from the start, not a divergence after initial improvement.

Full explanation →

9

Multi-Selecteasy

A data scientist is cleaning a dataset. Which TWO actions are appropriate for handling missing data?

Select 2 answers

A.Ignore missing values and train the model directly.

B.Use a predictive model to estimate missing values.

C.Impute missing values with the mean of the entire dataset.

D.Delete rows with missing values if the missing rate is low.

E.Replace missing values with the most frequent value always.

AnswersB, D

Predictive imputation uses relationships in data, a valid advanced method.

Why this answer

Option B is correct because using a predictive model to estimate missing values is a sophisticated imputation technique that leverages relationships between features to fill gaps, preserving data integrity and avoiding bias. This approach is particularly useful when data is not missing completely at random, as it can capture complex patterns that simpler methods miss.

Exam trap

CompTIA often tests the misconception that simple imputation methods like mean or mode are always safe, when in fact they can introduce bias and distort the dataset, making predictive imputation or deletion of rows with low missing rates more appropriate depending on the context.

Full explanation →

10

MCQeasy

A logistics company uses a machine learning model to predict delivery times based on historical data. The model was performing well, but recently it started making inaccurate predictions, especially for routes that have experienced new traffic patterns and road closures. The data engineering team receives an alert that the model's accuracy has dropped by 15% over the last week. They suspect data drift. The team has access to the original training data and a continuous stream of new data. What is the most appropriate first step for the team to take?

A.Roll back the model to the previous stable version and schedule a full audit of the data pipeline.

B.Compare the distributions of key features between the training data and the recent data to quantify data drift.

C.Immediately retrain the model using the most recent data to adapt to the new patterns.

D.Add more features to the model to capture the new traffic patterns and road closures.

AnswerB

Identifying drift by comparing distributions is the standard first step to diagnose the problem before taking corrective action.

Why this answer

Option B is correct because the first step in diagnosing a suspected data drift is to statistically compare the distributions of key features between the training data and the recent streaming data. This quantifies whether the input data distribution has changed, which directly explains the accuracy drop. Without this analysis, any corrective action (like retraining or rollback) would be premature and could mask the root cause.

Exam trap

CompTIA often tests the misconception that the immediate response to a performance drop should be retraining or rollback, rather than first diagnosing the type of drift (data drift vs. concept drift) through distribution comparison.

How to eliminate wrong answers

Option A is wrong because rolling back the model without first confirming data drift wastes time and may not address the new traffic patterns; it assumes the previous model is still valid, which is false if drift is present. Option C is wrong because immediately retraining on recent data without verifying drift could introduce bias or overfit to transient noise, and it ignores the need to first understand what changed. Option D is wrong because adding features without first analyzing drift is a blind attempt that may not solve the distribution shift and could increase model complexity unnecessarily.

Full explanation →

11

MCQeasy

An organization deploys an AI system that processes personal data of EU citizens. Which regulatory framework imposes strict requirements on automated decision-making and profiling?

A.Payment Card Industry Data Security Standard (PCI DSS)

B.General Data Protection Regulation (GDPR)

C.Health Insurance Portability and Accountability Act (HIPAA)

D.Sarbanes-Oxley Act (SOX)

AnswerB

GDPR specifically addresses automated individual decision-making and profiling.

Why this answer

Option A is correct: the GDPR provides rules on automated decision-making and profiling. Option B is wrong because HIPAA is for healthcare data in the US. Option C is wrong because PCI DSS is for payment card data.

Option D is wrong because SOX is for financial reporting.

Full explanation →

12

MCQeasy

Refer to the exhibit. What is the recall of the model?

A.0.44

B.0.80

C.0.90

D.0.99

AnswerA

Recall = 400/(400+500) = 0.4444, so 0.44.

Why this answer

Recall = TP / (TP + FN) = 400 / (400 + 500) = 400 / 900 ≈ 0.444, which rounds to 0.44.

Full explanation →

13

Multi-Selecteasy

A data scientist is evaluating a logistic regression model for binary classification on highly imbalanced data. Which TWO metrics are most appropriate to assess model performance? (Choose TWO.)

Select 2 answers

A.Accuracy

B.Recall

C.Precision

D.Mean squared error (MSE)

E.F1 score

AnswersB, C

Recall measures the proportion of actual positives correctly identified, critical for minority class performance.

Why this answer

Precision and Recall directly measure the model's ability to correctly identify positive (minority) instances and avoid false positives. Accuracy is misleading when classes are imbalanced. MSE is for regression.

F1 score combines precision and recall, but the question asks for two metrics, and precision and recall are fundamental.

Full explanation →

14

MCQhard

A financial institution uses an AI model to approve loans. The model uses features including credit score and ZIP code. During an audit, it is discovered that the model has a high false positive rate for loan default predictions in certain ZIP codes. What should the institution do to address this?

A.Remove the ZIP code feature from the model

B.Increase the decision threshold for those ZIP codes

C.Discontinue use of the model for those ZIP codes

D.Retrain the model with fairness constraints

AnswerD

Fairness constraints can reduce bias while maintaining overall performance, a more comprehensive solution.

Why this answer

The false positive disparity in certain ZIP codes likely indicates bias related to demographics. The best course is to retrain the model with fairness constraints to mitigate disparate impact while maintaining performance.

Full explanation →

15

MCQeasy

A company is developing an AI chatbot for customer service. They want to ensure the bot does not generate offensive or harmful responses. Which governance practice should be implemented first?

A.Set up a human-in-the-loop review process

B.Implement a content filter to screen responses before delivery

C.Create a usage policy for acceptable bot behavior

D.Sanitize training data to remove toxic examples

AnswerB

Content filtering immediately prevents harmful outputs from reaching users.

Why this answer

Option B is correct because a content filter acts as a real-time safety gate that screens every response generated by the AI model before it reaches the customer. This is the first line of defense against offensive or harmful outputs, as it can catch toxic language, PII leaks, or policy violations immediately, even if the underlying model has not been fully sanitized. Without such a filter, harmful responses could be delivered before any other governance measure (like human review or policy creation) can intervene.

Exam trap

CompTIA often tests the principle of 'defense in depth' and the order of implementation, where candidates mistakenly choose data sanitization (D) as the first step, overlooking that runtime controls are more immediate and practical for preventing harm in a deployed system.

How to eliminate wrong answers

Option A is wrong because a human-in-the-loop review process introduces latency and cannot scale to handle high-volume chatbot traffic; it is a secondary safeguard, not the first implementation. Option C is wrong because creating a usage policy defines acceptable behavior but does not technically prevent the model from generating offensive responses—it is a documentation step, not an enforcement mechanism. Option D is wrong because sanitizing training data is a proactive but time-consuming and imperfect process; even with clean data, large language models can still generate toxic outputs due to emergent behaviors or adversarial prompts, so a runtime filter is needed first.

Full explanation →

16

MCQmedium

Refer to the exhibit. The model is a neural network for 10-class classification. The training log shows no improvement over 5 epochs. Which of the following is the most likely root cause?

A.The batch size is too large, making gradient updates insignificant.

B.The output layer uses sigmoid activation instead of softmax.

C.The learning rate is too high, causing the loss to oscillate.

D.The model is suffering from vanishing gradients, preventing weight updates.

AnswerD

Vanishing gradients can cause no learning, leading to constant loss and random accuracy.

Why this answer

The training log shows no improvement over 5 epochs, which is a classic symptom of vanishing gradients in deep neural networks. When gradients become extremely small during backpropagation, weight updates are negligible, causing the loss to stagnate. This is especially common in deep networks with sigmoid or tanh activations, where gradients saturate in the tails of the activation function.

Exam trap

CompTIA often tests the distinction between symptoms of high learning rate (oscillation/divergence) and vanishing gradients (flat loss), so candidates mistakenly choose 'learning rate too high' when they see no improvement, but the key clue is the absence of oscillation or divergence in the loss curve.

How to eliminate wrong answers

Option A is wrong because a batch size that is too large typically leads to noisy or less effective gradient updates, but it does not cause complete stagnation; the loss would still fluctuate or decrease slowly. Option B is wrong because using sigmoid activation in the output layer for 10-class classification would produce outputs that do not sum to 1, making it unsuitable for multi-class probability estimation, but it would not prevent the loss from changing entirely—the model would still update weights, albeit incorrectly. Option C is wrong because a learning rate that is too high causes the loss to oscillate or diverge, not to remain flat with no improvement; the loss would show erratic behavior or NaN values, not a steady plateau.

Full explanation →

17

MCQeasy

A marketing team wants to segment customers into groups based on purchasing behavior without predefined categories. Which algorithm should they use?

A.K-means clustering

B.Naive Bayes classifier

C.Logistic regression

D.Support vector machine

AnswerA

K-means is an unsupervised algorithm that groups data into clusters based on similarity, perfect for segmentation.

Why this answer

K-means clustering is an unsupervised learning algorithm that groups data points into clusters based on similarity without requiring predefined labels. Since the marketing team wants to segment customers based on purchasing behavior without predefined categories, K-means is the correct choice as it discovers natural groupings in the data.

Exam trap

CompTIA often tests the distinction between supervised and unsupervised learning, and the trap here is that candidates may confuse clustering (unsupervised) with classification (supervised) algorithms, leading them to pick a classifier like Naive Bayes or logistic regression instead of K-means.

How to eliminate wrong answers

Option B (Naive Bayes classifier) is wrong because it is a supervised learning algorithm that requires labeled training data to classify instances into predefined categories, making it unsuitable for discovering unknown segments. Option C (Logistic regression) is wrong because it is a supervised learning algorithm used for binary classification tasks, not for unsupervised clustering or segmentation without predefined groups. Option D (Support vector machine) is wrong because it is a supervised learning algorithm that separates data into predefined classes using hyperplanes, not for discovering hidden patterns or groupings in unlabeled data.

Full explanation →

18

Multi-Selectmedium

Which TWO techniques are commonly used to handle missing data in a machine learning dataset? (Choose TWO.)

Select 2 answers

A.Normalization

B.Imputation with mean or median

C.Deletion of rows with missing values

D.One-hot encoding

E.Dimensionality reduction

AnswersB, C

Replacing missing values with mean/median is a common imputation method.

Why this answer

Imputation with mean or median is a standard technique for handling missing numerical data because it preserves the dataset size and avoids introducing bias from simply discarding rows. By replacing missing values with the central tendency of the observed data, the model can still learn patterns without losing information, though it may reduce variance slightly.

Exam trap

CompTIA often tests the distinction between data preprocessing techniques (like normalization and encoding) and actual missing data handling methods, so candidates mistakenly select normalization or one-hot encoding as solutions for missing values.

Full explanation →

19

Multi-Selectmedium

A natural language processing (NLP) team is building a sentiment analysis model. The raw text data contains punctuation, stop words, and URLs. Which TWO preprocessing steps are most appropriate to improve model performance? (Choose two.)

Select 2 answers

A.Remove all punctuation and URLs

B.Apply stemming to reduce words to root forms

C.Remove common stop words

D.Convert all text to lowercase

E.Tokenize the text into individual words

AnswersA, C

Punctuation and URLs are typically not useful for sentiment and add noise.

Why this answer

Options A and B are correct. Removing stop words reduces noise, and removing punctuation and URLs helps standardize text. Option C (stemming) is useful but not always necessary; Option D (tokenization) is fundamental but the question asks for specific preprocessing to improve performance; Option E (converting to lowercase) is standard but not among the best two for this scenario; however, converting to lowercase is a basic step.

Actually, typical preprocessing includes lowercasing, removing punctuation, removing stop words, and tokenization. The question asks for 'most appropriate'. Lowercasing is common but technically could be considered basic.

I'll choose A and B as the two best. But to be safe, let's adjust: Many might argue lowercasing is essential. I'll make correct options A and D? Let's think: The stem says 'improve model performance'.

Stop word removal and punctuation removal are often performed. Lowercasing is also standard. But I need exactly two.

I'll go with A (remove stop words) and B (remove punctuation and URLs) as they directly reduce noise. Lowercasing is also important but often done anyway. I'll keep A and B.

Full explanation →

20

MCQeasy

A data scientist is building a classification model to detect fraudulent transactions. The dataset is highly imbalanced with only 1% fraudulent cases. Which approach should the scientist use to evaluate model performance most effectively?

A.F1 score

B.Accuracy

C.Recall

D.Precision

AnswerA

F1 score is the harmonic mean of precision and recall, providing a balanced measure for imbalanced datasets.

Why this answer

In highly imbalanced datasets like fraud detection (1% positive class), accuracy is misleading because a model that predicts all transactions as legitimate would achieve 99% accuracy yet fail to detect any fraud. The F1 score (harmonic mean of precision and recall) is the most effective metric because it balances both false positives and false negatives, providing a single score that reflects the model's ability to correctly identify the minority class without being skewed by class imbalance.

Exam trap

CompTIA often tests the misconception that accuracy is always the best metric for classification, but in imbalanced datasets, accuracy is a trap because it does not reflect performance on the minority class, leading candidates to overlook metrics like F1 score that directly address class imbalance.

How to eliminate wrong answers

Option B (Accuracy) is wrong because it is dominated by the majority class (99% legitimate transactions), so a trivial model that never predicts fraud can still achieve 99% accuracy, masking poor fraud detection performance. Option C (Recall) is wrong because it only measures the proportion of actual fraud cases correctly identified (true positives / (true positives + false negatives)), ignoring false positives; a model that flags every transaction as fraud would have perfect recall but be unusable in practice. Option D (Precision) is wrong because it only measures the proportion of predicted fraud cases that are actually fraud (true positives / (true positives + false positives)), ignoring false negatives; a model that makes very few fraud predictions but with high precision would miss many actual frauds, which is unacceptable in fraud detection.

Full explanation →

21

MCQmedium

A machine learning team is deploying a sentiment analysis model for customer reviews. The model was trained on reviews from an e-commerce site but will be used for a social media platform. The team observes a drop in accuracy. Which concept best explains this issue?

A.Data drift

B.Concept drift

C.Bias-variance tradeoff

D.Overfitting

AnswerA

The distribution of reviews differs between e-commerce and social media.

Why this answer

Data drift occurs when the statistical properties of the input data change between the training and production environments. Here, the model was trained on e-commerce reviews but is now processing social media posts, which have different vocabulary, tone, and structure, causing a mismatch in the input distribution and leading to accuracy degradation.

Exam trap

CompTIA often tests the distinction between data drift (input distribution change) and concept drift (relationship change), and candidates mistakenly choose concept drift when the scenario describes a change in the input data source rather than a change in the underlying mapping from inputs to outputs.

How to eliminate wrong answers

Option B is wrong because concept drift refers to a change in the underlying relationship between input features and the target variable over time, not a change in the input data distribution itself. Option C is wrong because bias-variance tradeoff is a model selection concept describing the balance between underfitting and overfitting, not an explanation for performance drop due to data distribution shift. Option D is wrong because overfitting occurs when a model learns training data too well, including noise, and fails to generalize to new data from the same distribution, not to a different distribution.

Full explanation →

22

MCQhard

An AI team is deploying a predictive maintenance model for industrial equipment. The model predicts failure within a 30-day window. The cost of a false positive is 10% of the cost of a false negative. Which evaluation metric should the team prioritize?

A.F2 score (beta=2) to prioritize recall over precision.

B.Area under the ROC curve (AUC-ROC) to measure overall discrimination.

C.F1 score to balance precision and recall equally.

D.Precision to minimize false positives.

AnswerA

F2 score puts more weight on recall, aligning with the higher cost of false negatives.

Why this answer

Given asymmetric costs, F-beta with beta>1 weights recall (false negatives) higher. Precision and recall individually ignore the cost trade-off. AUC-ROC summarizes performance but does not incorporate costs; F1 gives equal weight, which is not suitable when false negatives are costlier.

Full explanation →

23

MCQmedium

A healthcare organization is deploying an AI system to analyze patient records and recommend treatment plans. To comply with data privacy regulations, what is the most important security measure to implement?

A.Enable detailed audit logging

B.Anonymize patient data before processing

C.Encrypt all data at rest and in transit

D.Implement role-based access control

AnswerB

Anonymization removes identifying information, reducing privacy risks while allowing analysis.

Why this answer

Anonymizing patient data before processing is the most important security measure because it directly addresses data privacy regulations like HIPAA and GDPR by removing personally identifiable information (PII) from the dataset. This ensures that even if a breach occurs, the data cannot be linked back to an individual, thereby minimizing compliance risk. While other measures like encryption and access control are essential, anonymization is the foundational step for lawful AI processing of sensitive health data.

Exam trap

CompTIA often tests the distinction between security controls that protect data in transit/at rest versus those that protect the data's content itself; the trap here is that candidates confuse encryption with anonymization, thinking encryption alone satisfies privacy regulations, when in fact it only protects confidentiality, not identifiability.

How to eliminate wrong answers

Option A is wrong because detailed audit logging is a detective control that records who accessed what and when, but it does not prevent exposure of PII or ensure compliance with privacy regulations like HIPAA or GDPR. Option C is wrong because encrypting data at rest (e.g., AES-256) and in transit (e.g., TLS 1.3) protects against unauthorized interception but does not remove PII from the data; if an authorized user or AI model processes encrypted data, the plaintext still contains identifiable information. Option D is wrong because role-based access control limits who can view or process data but does not alter the data itself; a user with the appropriate role can still access raw PII, violating privacy regulations if the data is used for AI training without anonymization.

Full explanation →

24

MCQmedium

Refer to the exhibit. A machine learning pipeline configuration is shown. During a deployment, the model evaluation passes with accuracy 0.86 and precision 0.79. However, the pipeline proceeds to deploy. What is the most likely reason for this behavior?

A.The precision metric is not included in the evaluation script

B.The deployment only checks the accuracy threshold for rollback condition

C.The deployment target is set to staging instead of production

D.The operator manually overrode the threshold

AnswerB

The rollback_condition only mentions accuracy, so precision threshold is ignored.

Why this answer

The pipeline configuration shows a rollback condition that only checks the accuracy metric (accuracy < 0.85). Since the model achieved accuracy 0.86, which is above the threshold, the condition is not triggered, and the pipeline proceeds to deploy regardless of the precision value. The precision metric is not part of the rollback evaluation logic in this configuration.

Exam trap

CompTIA often tests the misconception that all evaluation metrics automatically trigger rollback conditions, when in fact only metrics explicitly listed in the condition logic are checked.

How to eliminate wrong answers

Option A is wrong because the evaluation script clearly outputs precision (0.79), and the exhibit shows precision is being calculated; the issue is that the rollback condition does not reference precision. Option C is wrong because the deployment target (staging vs. production) does not affect whether a rollback condition is evaluated; the pipeline proceeds based on the condition logic, not the environment name. Option D is wrong because there is no evidence or indication in the exhibit or scenario that an operator manually overrode the threshold; the behavior is fully explained by the configured rollback condition.

Full explanation →

25

MCQeasy

Refer to the exhibit. An AI developer implements the above neural network architecture for handwritten digit recognition. The model achieves 85% training accuracy and 83% test accuracy. Which modification is most likely to improve training accuracy?

A.Increase the dropout rate to 0.7

B.Increase the number of filters in the first Conv2D layer

C.Add another dense layer before the output

D.Remove the dropout layer

AnswerD

Dropout adds regularization; removing it can increase training accuracy, especially if the model is underfitting.

Why this answer

Removing the dropout layer reduces regularization, allowing the model to fit the training data better and increase training accuracy.

Full explanation →

26

MCQhard

A retail company deploys a machine learning model to predict customer churn. The model outputs a probability between 0 and 1, and churn is predicted if probability > 0.5. After deployment, the model has a high false positive rate (many non-churning customers labeled as churn), which leads to unnecessary retention offers and increased costs. The data science team confirms the model was trained on historical data with a balanced class distribution. The business team wants to reduce false positives while maintaining a reasonable true positive rate. However, they cannot retrain the model because the original training data is no longer available. What is the best course of action to reduce false positives?

A.Retrain the model using only the most recent three months of data.

B.Increase the decision threshold to a higher value, such as 0.7.

C.Collect new labeled data and perform transfer learning from the original model.

D.Decrease the decision threshold to a lower value, such as 0.3.

AnswerB

A higher threshold requires stronger evidence for churn, thus reducing false positives.

Why this answer

Raising the decision threshold (e.g., to 0.7) will reduce false positives because only high-confidence predictions will be classified as churn. This does not require retraining or new data. Reducing the threshold would increase false positives.

Retraining is not possible without data. Collecting new data would take time and still require retraining.

Full explanation →

27

MCQmedium

An image classification model misclassifies a stop sign as a speed limit sign after a few pixels are altered. What is the most effective defense against such attacks?

A.Use a larger validation dataset

B.Reduce the input image resolution

C.Increase the model's complexity

D.Adversarial training

AnswerD

Adversarial training explicitly trains on perturbed examples to improve robustness.

Why this answer

Adversarial training incorporates adversarial examples during training, making the model more robust.

Full explanation →

28

MCQhard

A healthcare startup is deploying a machine learning model to predict patient readmission within 30 days using electronic health records (EHR). The data pipeline uses Apache Spark for preprocessing and training on an Amazon EMR cluster. The training dataset is 50 GB and composed of structured numeric and categorical features, along with unstructured clinical notes. The data scientist observes that training takes over 12 hours and frequently fails due to out-of-memory (OOM) errors, especially when processing the clinical notes via TF-IDF vectorization. The cluster has 10 nodes with 64 GB RAM each. The data engineer has already tried increasing spark.sql.shuffle.partitions to 400 and using Kryo serialization, but OOM persists. Which action should the data engineer take next to resolve the OOM errors?

A.Broadcast the TF-IDF model to all executors to avoid shuffling

B.Repartition the clinical notes data into 2000 partitions before TF-IDF

C.Add 10 more nodes to the cluster to increase total memory

D.Use a single executor with 64 GB and increase driver memory to 128 GB

AnswerB

More partitions reduce the data per executor, mitigating OOM during vectorization.

Why this answer

Option B is correct because repartitioning the clinical notes data into 2000 partitions before TF-IDF vectorization increases parallelism and reduces the memory pressure per partition. The default partition count (often based on spark.default.parallelism) is too low for 50 GB of data, causing individual partitions to exceed executor memory limits. By increasing partitions, each executor processes smaller chunks, preventing OOM errors during the memory-intensive TF-IDF stage.

Exam trap

CompTIA often tests the misconception that increasing cluster resources (nodes or memory) alone solves OOM errors, when the real fix is to optimize data partitioning and parallelism within Spark's execution model.

How to eliminate wrong answers

Option A is wrong because broadcasting the TF-IDF model does not address the root cause of OOM; the model itself is typically small, but the issue is the large volume of raw text data being processed per partition, not the model size. Option C is wrong because adding more nodes increases total cluster memory but does not fix the per-partition memory imbalance; without repartitioning, the same skewed partitions will still cause OOM on individual executors. Option D is wrong because using a single executor with 64 GB and increasing driver memory to 128 GB ignores the distributed nature of Spark; it would force all processing into one executor, causing severe memory contention and likely worse OOM, while also losing parallelism.

Full explanation →

29

MCQmedium

A company deploys a chatbot using a large language model (LLM). After launch, users report that the chatbot sometimes generates plausible but false information. This phenomenon is known as:

A.Gradient explosion

B.Overfitting

C.Concept drift

D.Hallucination

AnswerD

Correct; LLMs often produce false information convincingly.

Why this answer

Option D is correct because hallucination in LLMs refers to the generation of plausible but factually incorrect or nonsensical information. This occurs when the model's probabilistic next-token prediction produces confident-sounding outputs that deviate from training data or real-world facts, often due to insufficient grounding or training data gaps.

Exam trap

The trap here is that candidates may confuse hallucination with overfitting, thinking the model is 'making up' data due to memorization errors, but overfitting is about poor generalization to new inputs, not confident false outputs from a well-generalized model.

How to eliminate wrong answers

Option A is wrong because gradient explosion is a training instability issue in deep neural networks where gradients become excessively large, causing weight updates to diverge; it does not relate to post-deployment output inaccuracies. Option B is wrong because overfitting describes a model that memorizes training data too well, performing poorly on unseen data, not generating false information that seems plausible. Option C is wrong because concept drift refers to a change in the statistical properties of the target variable over time, requiring model retraining, not a static LLM generating false outputs.

Full explanation →

30

MCQmedium

An e-commerce company uses a gradient boosting model to forecast daily sales. Recently, the model's predictions have become less accurate, showing a significant drop in R-squared on validation data. The data scientist checks for data drift but finds no significant changes in feature distributions. The model was trained on data from the past 24 months and is retrained monthly. Upon inspecting the feature importance, the data scientist notices that the top feature 'promotion_flag' has decreased in importance over time. What is the most likely cause of the performance degradation, and what should be done?

A.The model is overfitting to historical promotions; apply more regularization

B.Concept drift has occurred; retrain the model more frequently with recent data only, or use an online learning approach

C.The model's hyperparameters need tuning; perform a grid search

D.The promotion_flag feature is leaking future information; remove it

AnswerB

Concept drift changes the relationship between features and target; frequent retraining adapts to new patterns.

Why this answer

Option A (overfitting to promotions) does not explain the drop over time. Option C (hyperparameter tuning) is unlikely to fix the temporal change. Option D (leakage) would have caused issues from the start.

Option B correctly identifies concept drift (changing relationship) and suggests retraining more frequently or using online learning to adapt.

Full explanation →

31

MCQeasy

A social media company's AI recommendation system pushes extreme content to users, causing harm. Which ethical principle is most violated?

A.Autonomy

B.Justice

C.Beneficence

D.Non-maleficence

AnswerD

Non-maleficence requires avoiding harm.

Why this answer

Option C (Non-maleficence) is correct because it means 'do no harm'. Option A (Beneficence) is about doing good, but here harm is done. Option B (Autonomy) is about user choice.

Option D (Justice) is about fairness.

Full explanation →

32

MCQmedium

A security analyst reviews the log file from an AI model server. What is the most likely cause of the crash?

A.The server ran out of memory due to high traffic

B.The model weights were corrupted during loading

C.The model encountered an unknown data type

D.A malicious input triggered a buffer overflow

AnswerD

Special characters and memory allocation error suggest an injection attack.

Why this answer

A malicious input triggering a buffer overflow is the most likely cause of the crash because AI model servers often process user-supplied data in native code (e.g., C/C++ extensions or TensorFlow ops) that lack bounds checking. An attacker can craft an input that overflows a fixed-size buffer, corrupting memory and causing a segmentation fault or denial of service. This is a well-known AI security vulnerability, distinct from generic resource exhaustion or data-type errors.

Exam trap

CompTIA often tests the misconception that AI crashes are always due to resource exhaustion or data format errors, but the trap here is that a buffer overflow from malicious input is a distinct security-specific crash vector that candidates overlook in favor of more generic operational issues.

How to eliminate wrong answers

Option A is wrong because running out of memory due to high traffic would typically cause gradual performance degradation or an out-of-memory (OOM) kill, not a sudden crash from a single request, and the log would show memory allocation failures rather than a buffer overflow. Option B is wrong because corrupted model weights during loading would usually result in a checksum mismatch or loading error at startup, not a crash during inference from a specific input. Option C is wrong because an unknown data type would typically raise a type error or exception in the model's preprocessing layer, not cause a memory corruption crash like a buffer overflow.

Full explanation →

33

MCQhard

A company uses a large language model (LLM) to generate customer support responses. They notice the model sometimes produces harmful outputs. Which implementation strategy best reduces this risk while maintaining performance?

A.Implement a keyword-based output filter

B.Use a smaller, less capable model

C.Add system prompts instructing the model to be safe

D.Fine-tune the model using reinforcement learning from human feedback

AnswerD

RLHF effectively aligns model outputs with human preferences.

Why this answer

Option D is correct because reinforcement learning from human feedback (RLHF) directly trains the model to align its outputs with human preferences for safety and helpfulness, reducing harmful outputs while preserving performance. Unlike superficial filters or prompts, RLHF adjusts the model's internal behavior through reward modeling and policy optimization, making it the most effective strategy for sustained safety improvements.

Exam trap

CompTIA often tests the misconception that simple output filtering or prompt engineering is sufficient for safety, when in fact only training-based alignment methods like RLHF can meaningfully change model behavior without sacrificing performance.

How to eliminate wrong answers

Option A is wrong because keyword-based output filters are brittle and can be bypassed by paraphrasing or context-dependent harmful content, while also risking false positives that degrade performance by blocking legitimate responses. Option B is wrong because using a smaller, less capable model reduces overall performance and may still produce harmful outputs if not specifically trained for safety, as capability and safety are not directly correlated. Option C is wrong because system prompts are easily overridden by the model's training distribution and do not provide robust, consistent safety alignment, especially against adversarial or nuanced harmful inputs.

Full explanation →

34

Multi-Selectmedium

Which TWO are key requirements for AI governance under the EU AI Act for high-risk AI systems? (Choose two.)

Select 2 answers

A.Regular performance benchmarks

B.Human oversight

C.Open-source licensing

D.Transparency and documentation

E.Mandatory use of cloud

AnswersB, D

Required for high-risk AI systems.

Why this answer

Options A (Human oversight) and C (Transparency and documentation) are correct. The EU AI Act mandates human oversight and transparency for high-risk systems. Option B (Open-source licensing) is not required.

Option D (Mandatory use of cloud) is not specified. Option E (Regular performance benchmarks) is good practice but not a specific requirement.

Full explanation →

35

Multi-Selecthard

A data scientist is evaluating a trained binary classification model. The model has high accuracy but the precision is low and recall is high. Which three actions are most appropriate to improve precision? (Choose three.)

Select 3 answers

A.Collect more training data for the minority class

B.Apply oversampling to the majority class

C.Increase the classification threshold

D.Use a different algorithm that penalizes false positives more

E.Decrease the classification threshold

AnswersA, C, D

More minority data helps the model learn better boundaries, often improving precision.

Why this answer

Increasing the classification threshold reduces false positives, using a different algorithm that penalizes false positives more, and collecting more data for the minority class can all improve precision.

Full explanation →

36

MCQeasy

A company deploys a computer vision model for quality inspection on a manufacturing line. After deployment, the model's accuracy drops from 95% to 80% over two weeks. Which action is most likely to address this issue?

A.Retrain the model using recently collected production data.

B.Increase the confidence threshold for predictions.

C.Decrease the learning rate of the training algorithm.

D.Deploy an additional ensemble of models for redundancy.

AnswerA

Retraining with current data adapts the model to new data distributions, countering drift.

Why this answer

Option B is correct because data drift is a common cause of performance degradation over time, and retraining with recent data realigns the model. Option A is wrong because increasing threshold may reduce false positives but does not address drift. Option C is wrong because adjusting learning rate is irrelevant for inference.

Option D is wrong because adding redundant models increases complexity without solving drift.

Full explanation →

37

Multi-Selecteasy

A data analyst needs to select two appropriate unsupervised learning techniques for clustering unlabeled data. (Choose two.)

Select 2 answers

A.Linear regression

B.Support vector machine

C.Hierarchical clustering

D.Decision tree

E.K-means

AnswersC, E

Hierarchical clustering is an unsupervised algorithm that builds a hierarchy of clusters.

Why this answer

K-means and hierarchical clustering are both unsupervised learning algorithms for clustering data into groups without labels.

Full explanation →

38

Multi-Selectmedium

A data scientist is building a natural language processing model to classify customer reviews as positive or negative. Which TWO preprocessing steps are most essential before tokenization? (Select two.)

Select 2 answers

A.Perform stemming or lemmatization.

B.Remove punctuation and special characters.

C.Convert all text to lowercase.

D.Remove stop words from the text.

E.Replace missing values with a placeholder.

AnswersB, C

Removing punctuation helps tokens become clean words.

Why this answer

Removing punctuation and special characters (Option B) is essential because tokenizers typically split on whitespace, so punctuation attached to words (e.g., 'great!', 'bad.') would create noisy tokens like 'great!' and 'bad.' instead of clean tokens 'great' and 'bad'. Converting all text to lowercase (Option C) ensures that words like 'Great', 'great', and 'GREAT' are all mapped to the same token, preventing the model from treating them as distinct features and reducing vocabulary size.

Exam trap

CompTIA often tests the ordering of preprocessing steps, and the trap here is that candidates mistakenly believe stemming, lemmatization, or stop word removal should be done before tokenization, when in fact tokenization must come first to split the text into tokens for those later steps to operate on.

Full explanation →

39

MCQhard

Refer to the exhibit. A batch inference job fails with the given logs. What is the most likely root cause of the failure?

A.The input data has values that exceed the model's expected range

B.The input data contains missing values that are not handled in preprocessing

C.The model was not trained to handle categorical features

D.The model version is outdated and incompatible with the current preprocessing pipeline

AnswerB

The log clearly shows a NaN value for 'age' causing an error in normalization.

Why this answer

The logs indicate a 'ValueError' or similar exception when the batch inference job attempts to process the input data. This error typically arises when the preprocessing pipeline encounters missing values (e.g., NaN or None) that it cannot handle, causing the job to fail. Option B is correct because missing values not handled in preprocessing are a common root cause for such failures, especially when the training data had no missing values but the inference data does.

Exam trap

CompTIA often tests the distinction between data quality issues (missing values) and model compatibility issues (version mismatches or feature encoding), so candidates may incorrectly choose option D because they assume a version mismatch is the cause, when the logs clearly point to a preprocessing failure.

How to eliminate wrong answers

Option A is wrong because values exceeding the model's expected range would typically cause a different error, such as a 'ValueError' about clipping or scaling, not a generic failure from missing data. Option C is wrong because the model not being trained to handle categorical features would manifest as a 'TypeError' or 'KeyError' during feature encoding, not a missing-value-related error. Option D is wrong because an outdated model version incompatible with the preprocessing pipeline would likely cause a 'ShapeError' or 'AttributeError' due to mismatched feature names or dimensions, not a missing-value error.

Full explanation →

40

Multi-Selectmedium

Which THREE practices are recommended for versioning machine learning models in a production environment?

Select 3 answers

A.Use a model registry like MLflow or DVC.

B.Store model metadata such as hyperparameters and training data hash.

C.Automate model deployment based on version tags.

D.Use Git to version model binaries.

E.Keep only the latest model to save storage.

AnswersA, B, C

Model registries provide centralized versioning and lifecycle management.

Why this answer

Option A is correct because a model registry like MLflow or DVC provides a centralized repository for tracking model versions, metadata, and lineage. This enables reproducibility, rollback, and auditability in production, which is essential for managing the lifecycle of machine learning models.

Exam trap

CompTIA often tests the misconception that Git is suitable for versioning all artifacts, including large binary model files, when in fact Git's architecture is optimized for text diffs and cannot efficiently manage model binaries in a production ML pipeline.

Full explanation →

41

MCQeasy

An AI system for fraud detection shows a gradual decline in precision over several weeks, though recall remains stable. Which type of model drift is most likely occurring?

A.Data drift

B.Covariate shift

C.Label drift

D.Concept drift

AnswerD

Concept drift alters the decision boundary, often increasing false positives while recall remains stable.

Why this answer

Precision dropping while recall stable indicates increased false positives. Concept drift (Option B) changes the underlying relationships, causing the model to misclassify more legitimate transactions as fraud. Data drift (Option A) would affect both precision and recall.

Label drift (Option C) changes ground truth definitions, also affecting both. Covariate shift (Option D) is similar to data drift.

Full explanation →

42

MCQmedium

A deep learning model for sentiment analysis uses a softmax output layer. The hidden layers currently use tanh activation. Which activation function should replace tanh to mitigate vanishing gradients in deeper networks?

A.Sigmoid

B.Softmax

C.ReLU

D.Linear

AnswerC

ReLU is non-saturating and helps mitigate vanishing gradients.

Why this answer

ReLU does not saturate for positive values, helping avoid vanishing gradient issues common with tanh and sigmoid.

Full explanation →

43

MCQhard

A team is training a deep learning model for image classification. The training loss decreases rapidly but validation loss starts increasing after a few epochs. Which regularization technique should be applied to mitigate this issue?

A.Data augmentation

B.L2 regularization

C.Early stopping

D.Dropout

AnswerC

Early stopping prevents overfitting by stopping training when validation loss starts to rise.

Why this answer

Option C is correct because early stopping halts training when validation loss increases, preventing overfitting. Option A is wrong because L2 regularization penalizes large weights but doesn't stop training. Option B is wrong because dropout randomly drops neurons during training, but early stopping directly addresses the symptom.

Option D is wrong because data augmentation increases data diversity, but the issue is overfitting due to training too long.

Full explanation →

44

MCQhard

An AI system used for autonomous driving is found to have a lower accuracy in detecting pedestrians with darker skin tones. The development team wants to address this ethical issue. Which action is most effective?

A.Conduct additional testing to measure the disparity

B.Augment the training dataset with more images of pedestrians with darker skin

C.Replace the object detection algorithm with a different one

D.Adjust the model's decision threshold for pedestrian detection

AnswerB

Diverse data helps the model learn robust features for all skin tones.

Why this answer

Option B is correct because augmenting the training dataset with more images of pedestrians with darker skin directly addresses the root cause of the bias: underrepresentation in the training data. By providing a more balanced and diverse dataset, the model can learn more robust features for all skin tones, reducing accuracy disparity without altering the algorithm's core logic or introducing arbitrary thresholds.

Exam trap

CompTIA often tests the misconception that bias can be fixed by simply changing the algorithm or threshold, when in reality the most effective first step is to address data imbalance through targeted augmentation.

How to eliminate wrong answers

Option A is wrong because additional testing only measures the disparity but does not fix it; it is a diagnostic step, not a corrective action. Option C is wrong because replacing the object detection algorithm does not guarantee improved fairness—bias often stems from training data distribution, not the algorithm itself, and a different algorithm may still exhibit similar biases if trained on the same skewed data. Option D is wrong because adjusting the decision threshold can trade off precision and recall but does not address the underlying data imbalance; it may reduce false negatives for one group at the expense of increased false positives for another, without resolving the root cause.

Full explanation →

45

MCQhard

A data scientist is training a random forest model on a large dataset and notices that the model is overfitting. Which hyperparameter adjustment is most likely to reduce overfitting?

A.Increase the maximum features

B.Decrease the maximum depth of trees

C.Decrease the minimum samples split

D.Increase the number of trees

AnswerB

Shorter trees are less complex and generalize better.

Why this answer

Decreasing the maximum depth of trees reduces model complexity, preventing overfitting.

Full explanation →

46

MCQeasy

What is the primary function of an AI ethics board within an organization?

A.Developing algorithms

B.Managing cloud infrastructure

C.Marketing AI products

D.Reviewing AI projects for ethical compliance

AnswerD

The board provides oversight and guidance on ethical matters.

Why this answer

An AI ethics board reviews AI projects to ensure they align with ethical principles and regulatory requirements.

Full explanation →

47

MCQeasy

A company wants to deploy an AI model for real-time inference on edge devices with limited computational resources. Which model architecture would be MOST suitable?

A.YOLOv4

B.MobileNet

C.ResNet-152

D.BERT

AnswerB

MobileNet uses depthwise separable convolutions to reduce computation, ideal for edge deployment.

Why this answer

MobileNet is specifically designed for mobile and edge devices using depthwise separable convolutions, which drastically reduce the number of parameters and computational cost while maintaining acceptable accuracy. This makes it the most suitable choice for real-time inference on resource-constrained edge hardware.

Exam trap

CompTIA often tests the misconception that any 'lightweight' or 'fast' model (like YOLOv4) is suitable for edge devices, ignoring the specific architectural optimizations (e.g., depthwise separable convolutions) that MobileNet uniquely provides for extreme resource constraints.

How to eliminate wrong answers

Option A is wrong because YOLOv4, while fast for object detection, is still a large convolutional network requiring significant GPU memory and compute, making it impractical for low-power edge devices. Option C is wrong because ResNet-152 is a very deep residual network with 152 layers, optimized for high accuracy on powerful hardware, not for limited-resource edge deployment. Option D is wrong because BERT is a transformer-based NLP model with hundreds of millions of parameters, requiring substantial memory and compute, and is not designed for real-time inference on edge devices.

Full explanation →

48

MCQhard

A company is building a computer vision system to detect defects in manufactured parts. They have 10,000 labeled images per class (defective and non-defective). They want to achieve high accuracy with limited computational resources. Which deep learning architecture and approach is most appropriate?

A.Train a custom CNN from scratch with many layers

B.Use a decision tree ensemble

C.Use a pre-trained VGG16 and fine-tune the last few layers

D.Use an RNN to process image sequences

AnswerC

Transfer learning with fine-tuning is efficient and effective for moderate datasets.

Why this answer

Transfer learning using a pre-trained CNN like VGG16, fine-tuning only the last few layers, leverages existing features and reduces training time and resource requirements.

Full explanation →

49

MCQmedium

A healthcare AI system misdiagnosed patients due to adversarial inputs. What security measure should be prioritized?

A.Encrypt all patient data

B.Use stronger authentication

C.Regular software updates

D.Implement adversarial training

AnswerD

Directly mitigates adversarial examples.

Why this answer

Option C (Implement adversarial training) is correct because adversarial training makes the model robust to input manipulation. Option A (Encrypt all patient data) protects data privacy but not model integrity. Option B (Use stronger authentication) is for access control.

Option D (Regular software updates) is general maintenance.

Full explanation →

50

MCQhard

A manufacturing company is using a convolutional neural network (CNN) to detect defects on an assembly line. The model was trained on a balanced dataset of defective and non-defective parts. In production, the model shows high precision (95%) but very low recall (50%). The production line manager wants to minimize missed defects (false negatives). The data scientist has access to the original training data and can retrain the model. Which strategy is most effective for increasing recall while maintaining acceptable precision?

A.Apply data augmentation to defective images

B.Lower the classification threshold for the defective class

C.Use a bagging ensemble of CNNs

D.Oversample the defective class in training

AnswerB

Lowering the threshold increases sensitivity (recall) as more instances are classified as defective, directly reducing false negatives.

Why this answer

Lowering the classification threshold for the defective class directly addresses the recall issue by allowing more samples to be classified as defective, which reduces false negatives. This is the most immediate and effective method because it does not require retraining and can be tuned to balance precision and recall based on the manager's priority of minimizing missed defects.

Exam trap

CompTIA often tests the misconception that retraining with data augmentation or oversampling is the only way to fix recall issues, when in fact threshold tuning is a simpler and more direct post-training adjustment that does not require model retraining.

How to eliminate wrong answers

Option A is wrong because data augmentation on defective images primarily helps with generalization and overfitting, not with shifting the decision boundary to increase recall; it may improve model robustness but does not directly increase the number of true positives at inference time. Option C is wrong because a bagging ensemble of CNNs reduces variance and can improve overall accuracy, but it does not specifically target the recall-precision trade-off and may even lower recall if the ensemble's voting threshold remains unchanged. Option D is wrong because oversampling the defective class in training addresses class imbalance but the model was already trained on a balanced dataset; oversampling would not solve the underlying issue of the model's conservative decision boundary, and it could lead to overfitting on defective samples without guaranteeing higher recall.

Full explanation →

51

MCQhard

You are a security engineer at a large e-commerce company that uses an AI-based recommendation system. The system is deployed on a Kubernetes cluster and uses a TensorFlow model served via REST API. Recently, the security team detected unusual API calls that caused the model to return incorrect recommendations. Analysis shows that the inputs were crafted to maximize prediction error. The team suspects an adversarial attack. You need to implement a solution that detects and mitigates such attacks in real-time without requiring model retraining. Which approach should you take?

A.Implement an input validation filter to detect and block anomalous inputs

B.Increase the number of model replicas to distribute the load

C.Retrain the model with adversarial examples

D.Roll back the model to a previous version that was not attacked

AnswerA

Input validation can identify adversarial examples based on statistical anomalies.

Why this answer

Option A is correct because an input validation filter can detect and block adversarial inputs in real-time by analyzing statistical properties (e.g., outlier detection, perturbation magnitude) without modifying the model. This approach is lightweight, operates at the API gateway level, and does not require retraining, making it suitable for immediate deployment against crafted inputs that maximize prediction error.

Exam trap

CompTIA often tests the misconception that retraining or scaling can solve security issues, but the key constraint here is 'real-time detection without retraining,' which eliminates options that require model modification or do not address the attack vector.

How to eliminate wrong answers

Option B is wrong because increasing model replicas only distributes load and improves throughput, but does not detect or block malicious inputs; adversarial attacks exploit model vulnerabilities, not resource exhaustion. Option C is wrong because retraining with adversarial examples requires model retraining, which violates the constraint of 'without requiring model retraining' and is a longer-term solution, not real-time mitigation. Option D is wrong because rolling back to a previous version does not address the root cause; the same adversarial inputs would still be effective against the older model, and the attack vector remains unmitigated.

Full explanation →

52

MCQmedium

A company uses an AI model to screen job applications. The model is trained on historical hiring data that reflects past biases. After deployment, the model disproportionately rejects candidates from certain demographics. Which concept does this best illustrate?

A.Overfitting

B.Model drift

C.Algorithmic bias

D.Underfitting

AnswerC

Correct; this describes the biased outcome due to biased data.

Why this answer

Option C is correct because algorithmic bias refers to systematic and unfair discrimination in AI outputs due to biased training data or model design. Option A (overfitting) is about a model that performs well on training data but poorly on new data due to excessive complexity. Option B (underfitting) is when a model is too simple to capture patterns.

Option D (model drift) is about performance degradation over time due to changes in data distribution.

Full explanation →

53

MCQmedium

A self-driving car uses an AI model that learns by trial and error, receiving rewards for correct actions and penalties for mistakes. This type of learning is:

A.Supervised learning

B.Unsupervised learning

C.Transfer learning

D.Reinforcement learning

AnswerD

Correct; RL uses rewards to learn optimal actions.

Why this answer

Reinforcement learning (RL) is the correct answer because the self-driving car's AI model learns through trial and error, receiving rewards for correct actions and penalties for mistakes. This feedback-driven process, where an agent interacts with an environment to maximize cumulative reward, is the defining characteristic of reinforcement learning, not supervised or unsupervised learning.

Exam trap

CompTIA often tests the distinction between reinforcement learning and supervised learning by describing a scenario with feedback (rewards/penalties) but no labeled dataset, leading candidates to mistakenly choose supervised learning because they associate 'feedback' with 'labels'.

How to eliminate wrong answers

Option A is wrong because supervised learning requires labeled input-output pairs (e.g., images tagged with 'stop sign') to train a model, not trial-and-error feedback. Option B is wrong because unsupervised learning finds hidden patterns in unlabeled data (e.g., clustering sensor readings) without any reward or penalty signals. Option C is wrong because transfer learning applies knowledge from a pre-trained model to a new but related task, not learning from scratch via rewards and punishments.

Full explanation →

54

Multi-Selecthard

Which TWO techniques are most effective for ensuring model explainability in a production loan approval AI system subject to regulatory review? (Select TWO.)

Select 2 answers

A.Replace the model with a decision tree for transparency

B.Use SHAP values to understand feature contributions

C.Rely on the model's internal attention weights (if transformer-based)

D.Apply LIME to generate local explanations for each prediction

E.Calculate global feature importance using permutation importance

AnswersB, D

SHAP provides consistent and theoretically grounded explanations.

Why this answer

Options C and D are correct. Option C is correct because LIME provides local explanations for individual predictions. Option D is correct because SHAP values quantify feature contributions globally and locally.

Option A is wrong because feature importance gives global view but not per-instance. Option B is wrong while decision trees are interpretable, they may not be the deployed model. Option E is wrong because black-box models are inherently uninterpretable without post-hoc methods.

Full explanation →

55

Multi-Selectmedium

Which TWO of the following are appropriate uses of unsupervised learning?

Select 2 answers

A.Classifying emails as spam or not spam

B.Predicting the sale price of a house given its features

C.Detecting unusual patterns in network traffic that may indicate a cyberattack

D.Identifying a person from a photo

E.Segmenting customers into groups based on purchasing behavior

AnswersC, E

Anomaly detection often uses unsupervised methods.

Why this answer

Unsupervised learning discovers hidden patterns or structures in unlabeled data. Detecting unusual patterns in network traffic (option C) is a classic anomaly detection task, often performed using clustering or autoencoders, where the model learns 'normal' behavior and flags deviations without requiring labeled attack data.

Exam trap

CompTIA often tests the distinction between supervised and unsupervised learning by presenting tasks that seem 'automatic' but actually require labeled data, tricking candidates into choosing supervised tasks as unsupervised uses.

Full explanation →

56

MCQhard

The exhibit shows the output of a drift monitoring command for a fraud detection model. The team has an automated pipeline that triggers retraining when the overall average drift score exceeds 0.10. Based on the exhibit, what should the operations team do next?

A.Force retraining on all features to ensure the model adapts to the new data distribution.

B.Manually analyze the drift in 'amount' and 'location' and investigate potential causes.

C.No action is needed because the model is performing within acceptable drift limits.

D.Initiate the automated retraining pipeline since the average drift exceeds 0.05.

AnswerB

Investigating root causes of drift helps determine if retraining or data correction is appropriate.

Why this answer

The correct action is to analyze the drift in the 'amount' and 'location' features and investigate root causes before retraining. The overall average drift is (0.12+0.08+0.03+0.02)/4 = 0.0625, which is below the threshold of 0.10, so retraining is not automatically triggered. However, individual features show significant drift, which could degrade performance.

Option C is correct because understanding why those features drifted helps decide if retraining or data correction is needed. Option A is wrong because the average drift is below threshold. Option B is wrong because ignoring drift could lead to performance degradation.

Option D is wrong because manual retraining without investigation may not address the root cause.

Full explanation →

57

MCQeasy

A data scientist is deploying a machine learning model to production. The model was trained on an imbalanced dataset. Which technique should be used during deployment to mitigate bias without retraining the model?

A.Apply post-processing calibration to adjust decision thresholds

B.Use an ensemble of models trained on balanced subsets

C.Rebalance the dataset using SMOTE before inference

D.Remove sensitive features from the input data

AnswerA

Post-processing calibration adjusts thresholds to improve fairness without retraining.

Why this answer

Post-processing calibration adjusts the decision threshold of the model to account for the class imbalance present in the training data. This technique modifies the output probabilities or classification boundary without requiring access to the original training data or retraining the model, making it suitable for deployment scenarios where the model is already fixed.

Exam trap

CompTIA often tests the distinction between techniques applied during training versus deployment, and the trap here is that candidates mistakenly choose SMOTE or ensemble methods, which require retraining, instead of recognizing that threshold adjustment is a valid post-deployment bias mitigation strategy.

How to eliminate wrong answers

Option B is wrong because using an ensemble of models trained on balanced subsets requires retraining or modifying the model architecture, which violates the constraint of not retraining the model. Option C is wrong because SMOTE (Synthetic Minority Over-sampling Technique) is a data preprocessing method applied before training to balance the dataset, not during inference; applying it at inference time would require access to the original training data and would alter the input distribution, which is not feasible or correct. Option D is wrong because simply removing sensitive features does not mitigate bias caused by imbalanced data; bias can still propagate through correlated features, and this approach does not address the class imbalance issue directly.

Full explanation →

58

MCQmedium

A company deploys a machine learning model for fraud detection. After one month, the false positive rate has increased significantly. The model is retrained weekly on all historical data. What is the MOST effective immediate action?

A.Replace the model with a simpler logistic regression model.

B.Continue retraining weekly on all historical data.

C.Adjust the classification threshold to reduce false positives.

D.Retrain the model on only the most recent 30 days of data.

AnswerD

Recent data captures current fraud patterns, reducing false positives.

Why this answer

The false positive rate increase suggests the model is reacting to a shift in the underlying data distribution (concept drift). Retraining on only the most recent 30 days of data (option D) is the most effective immediate action because it focuses the model on the current fraud patterns, discarding stale historical data that may no longer be representative. This approach directly addresses the drift by adapting the model to the latest behavior.

Exam trap

CompTIA often tests the misconception that adjusting the classification threshold is a sufficient fix for model degradation, when in reality it only trades off error types without addressing the underlying data drift that caused the false positive increase.

How to eliminate wrong answers

Option A is wrong because replacing the model with a simpler logistic regression model does not address the root cause of concept drift and may reduce predictive performance without solving the false positive issue. Option B is wrong because continuing to retrain weekly on all historical data will dilute the influence of recent patterns with outdated data, likely perpetuating the high false positive rate. Option C is wrong because adjusting the classification threshold is a post-hoc fix that reduces false positives at the cost of increasing false negatives, and it does not correct the underlying model drift or data quality issue.

Full explanation →

59

MCQmedium

A team is developing a recommendation system for an e-commerce platform. They want to use collaborative filtering but are concerned about cold-start problems for new users. Which approach would best mitigate the cold-start problem?

A.Incorporate user demographic features as side information

B.Increase the number of latent factors in matrix factorization

C.Use a popularity-based baseline for all recommendations

D.Use only item-based collaborative filtering

AnswerA

Demographic features enable recommendations for cold-start users by using profile information.

Why this answer

Option A (Use only item-based collaborative filtering) still suffers cold-start for new items. Option C (Increase latent factors) does not address cold-start. Option D (Use a popularity baseline) lacks personalization.

Option B (Incorporate user demographic features as side information) allows recommendations even for new users by leveraging profile data.

Full explanation →

60

MCQhard

A financial institution uses a deep learning model for loan approvals. Under the EU AI Act, this is considered a high-risk AI system. Which mandatory requirement must the institution fulfill before deployment?

A.Obtain certification from an ISO 27001 auditor

B.Publish the model's source code publicly

C.Register the AI system with the national data protection authority

D.Conduct a risk assessment and bias testing

AnswerD

The EU AI Act mandates a risk management system and bias audits for high-risk systems.

Why this answer

High-risk AI systems under the EU AI Act require a conformity assessment, including a risk assessment and bias testing.

Full explanation →

61

Multi-Selecthard

Which THREE are effective methods for ensuring data privacy in AI training? (Choose three.)

Select 3 answers

A.Data encryption at rest

B.Data anonymization

C.Differential privacy

D.Data replication

E.Federated learning

AnswersB, C, E

Removes personally identifiable information.

Why this answer

Options A (Differential privacy), C (Data anonymization), and D (Federated learning) are correct. Differential privacy adds noise to protect individual records, anonymization removes PII, and federated learning trains without centralizing data. Option B (Data encryption at rest) protects storage but not against model inference.

Option E (Data replication) increases risk.

Full explanation →

62

MCQeasy

A data scientist notices that a binary classification model consistently predicts the majority class. Which data engineering technique should be applied?

A.Feature scaling

B.Dimensionality reduction

C.Polynomial features

D.Oversampling

AnswerD

Oversampling (e.g., SMOTE) creates synthetic samples of the minority class to balance the dataset.

Why this answer

Oversampling (Option D) is correct because the model's bias toward the majority class indicates a class imbalance problem. By synthetically increasing the number of minority class samples (e.g., using SMOTE or random oversampling), the training data becomes more balanced, allowing the classifier to learn decision boundaries that are not skewed toward the majority class.

Exam trap

CompTIA often tests the misconception that feature scaling or dimensionality reduction can fix class imbalance, when in reality these techniques address different issues like feature magnitude or curse of dimensionality, not skewed target distributions.

How to eliminate wrong answers

Option A is wrong because feature scaling normalizes the range of input features (e.g., via min-max scaling or standardization) but does not address class imbalance; it only prevents features with larger magnitudes from dominating gradient-based optimization. Option B is wrong because dimensionality reduction (e.g., PCA or t-SNE) reduces the number of features to combat overfitting or noise, but it does not alter the class distribution, so the majority class bias remains. Option C is wrong because polynomial features create interaction or higher-degree terms from existing features to capture non-linear relationships, but they do not change the ratio of majority to minority samples, leaving the imbalance untouched.

Full explanation →

63

MCQmedium

A data scientist is building a regression model to predict house prices. The dataset contains features such as square footage, number of bedrooms, and year built. Initial model performance is poor, and the scientist suspects that feature engineering could help. Which approach is most likely to improve model accuracy?

A.Use only linear features because polynomial terms overfit

B.Remove all features except square footage to reduce noise

C.Create interaction terms such as bedrooms times square footage

D.Add random noise to the target variable to increase variance

AnswerC

Interaction terms capture combined effects of features, often improving regression models.

Why this answer

Creating interaction terms like bedrooms × square footage captures non-linear relationships and synergies between features that a linear model alone cannot represent. In real estate, the effect of square footage on price often depends on the number of bedrooms (e.g., a large house with few bedrooms may be less valuable), so interaction terms allow the model to learn these conditional patterns, directly improving predictive accuracy.

Exam trap

CompTIA often tests the misconception that adding more features always causes overfitting, when in fact carefully engineered interaction terms can reduce bias without excessive variance if regularized properly.

How to eliminate wrong answers

Option A is wrong because restricting to only linear features ignores potentially valuable non-linear patterns; polynomial terms can be regularized to avoid overfitting and are often necessary for complex relationships. Option B is wrong because removing all features except square footage discards important predictors like bedrooms and year built, which carry significant signal for house prices, thus increasing bias and reducing accuracy. Option D is wrong because adding random noise to the target variable artificially increases variance and corrupts the ground truth, making it harder for the model to learn the true underlying patterns and degrading performance.

Full explanation →

64

MCQmedium

A team is reviewing a neural network model summary. The input layer expects 784 features (e.g., 28x28 images). How many parameters does the first dense layer have?

A.100,224

B.109,258

C.100,352

D.8,256

AnswerC

Calculated as (784 * 128) + 128 = 100,352, matching the exhibit.

Why this answer

The first dense layer has 784 input features and 128 output units (a common default). Each of the 784 inputs connects to each of the 128 neurons, giving 784 * 128 = 100,352 weight parameters, plus 128 bias parameters (one per neuron), for a total of 100,480 parameters. However, the question asks for the number of parameters in the dense layer itself, and the correct answer is 100,352, which corresponds to the weight parameters only, as biases are often listed separately or the layer uses no bias.

In typical Keras summaries, the parameter count for a Dense layer with bias is (input_dim * units) + units, but here the provided correct answer matches the weight count alone, indicating the model summary excludes biases or uses a bias-less configuration.

Exam trap

CompTIA often tests whether candidates remember to include bias parameters in the total count, but here the trap is that the correct answer matches the weight-only count, leading candidates to overcount by adding biases and selecting a wrong option like 100,480 (not listed) or miscalculating the product.

How to eliminate wrong answers

Option A (100,224) is wrong because it likely results from miscalculating the product of 784 and 128 as 100,224, which is off by 128 (the bias count), or from using an incorrect input dimension. Option B (109,258) is wrong because it does not correspond to any standard calculation for a dense layer with 784 inputs and 128 outputs; it may arise from mistakenly using 854 inputs or a different layer size. Option D (8,256) is wrong because it represents only the bias parameters if there were 128 units (128 * 64 = 8,192, close but not exact) or a confusion with the number of outputs squared, not the full parameter count.

Full explanation →

65

Multi-Selecthard

Which TWO strategies are effective for handling missing values in a dataset when the missingness is not random (MNAR)?

Select 2 answers

A.Multiple imputation using chained equations

B.Treat missing as a separate category (e.g., for categorical features)

C.Listwise deletion

D.KNN imputation

E.Mean imputation

AnswersA, B

Multiple imputation can handle MNAR if the imputation model incorporates variables that predict missingness.

Why this answer

Multiple imputation using chained equations (MICE) is effective for MNAR because it models each variable with missing values as a function of other variables, iteratively generating plausible values that preserve the relationships and uncertainty in the data. This approach can account for the systematic pattern of missingness by incorporating auxiliary variables that are correlated with both the missing values and the missingness mechanism, making it robust even when missingness depends on unobserved data.

Exam trap

CompTIA often tests the misconception that mean imputation or KNN imputation are safe defaults for any missing data pattern, but the trap here is that MNAR requires methods that explicitly model the missingness mechanism, which simple imputation techniques fail to do.

Full explanation →

66

MCQhard

Refer to the exhibit. A data engineer notices that the batch processing step is taking too long and causing delays. Which change would most likely reduce the latency?

A.Increase the parallelism of the Spark job

B.Move feature engineering to the stream processing step in Flink

C.Replace Apache Flink with Apache Storm for stream processing

D.Change the output format from Parquet to CSV

AnswerB

Performing feature engineering in stream reduces batch processing time and overall latency.

Why this answer

Moving feature engineering from the batch Spark job to the stream processing Flink job reduces the workload on the batch step, making it faster. Replacing Flink, increasing parallelism, or changing output format do not address the bottleneck as effectively.

Full explanation →

67

MCQmedium

A data engineer is designing a pipeline to train a linear regression model on a dataset with 10 million rows and 50 features. The dataset fits in memory. Which approach should the engineer use to train the model efficiently?

A.Normal equation

B.Batch gradient descent

C.Principal component analysis

D.Stochastic gradient descent

AnswerD

SGD updates weights per sample, making it efficient for large datasets.

Why this answer

Stochastic gradient descent (SGD) is the most efficient approach for training a linear regression model on a dataset with 10 million rows and 50 features because it updates the model parameters using only one training example per iteration, leading to much faster convergence per epoch compared to batch methods. Since the dataset fits in memory, SGD can still be implemented efficiently without the overhead of loading data in batches from disk, and it scales well to large datasets where the normal equation or batch gradient descent would be computationally prohibitive.

Exam trap

CompTIA often tests the misconception that the normal equation is always the best for small feature sets, but the trap here is that candidates overlook the massive computational cost of the O(n * f^2) matrix multiplication when n is large (10 million rows), even though f is small (50 features).

How to eliminate wrong answers

Option A is wrong because the normal equation requires computing (X^T X)^{-1} X^T y, which involves inverting a 50x50 matrix (feasible) but also computing X^T X, which is O(n * f^2) = 10 million * 2500 = 25 billion operations, making it extremely slow and memory-intensive for 10 million rows. Option B is wrong because batch gradient descent processes the entire 10-million-row dataset in each iteration, requiring O(n * f) = 500 million operations per epoch, which is computationally expensive and converges slowly compared to SGD. Option C is wrong because principal component analysis (PCA) is a dimensionality reduction technique used for feature reduction or visualization, not a method for training a linear regression model; it does not perform parameter optimization.

Full explanation →

68

MCQhard

A data scientist trains a deep neural network for image classification. The training loss decreases but validation loss starts increasing after 50 epochs. What should the data scientist do to improve generalization?

A.Decrease batch size

B.Apply dropout and early stopping

C.Add more hidden layers

D.Increase learning rate

AnswerB

Dropout randomly ignores neurons during training to reduce overfitting, and early stops when validation loss worsens, preventing further overfitting.

Why this answer

The increasing validation loss while training loss decreases is a classic sign of overfitting. Dropout randomly deactivates neurons during training, which prevents co-adaptation and forces the network to learn more robust features. Early stopping halts training when validation performance stops improving, directly addressing the overfitting by selecting the model with the best generalization before it degrades.

Exam trap

CompTIA often tests the misconception that increasing model complexity (more layers) or adjusting batch size/learning rate can fix overfitting, when in reality these changes either exacerbate the problem or address unrelated training dynamics.

How to eliminate wrong answers

Option A is wrong because decreasing batch size introduces more noise into the gradient estimates, which can actually hurt generalization and may lead to slower convergence or instability, not a direct cure for overfitting. Option C is wrong because adding more hidden layers increases model capacity and complexity, which typically worsens overfitting by allowing the network to memorize the training data even more. Option D is wrong because increasing the learning rate can cause the optimizer to overshoot minima, leading to divergence or poor convergence, and does not address the fundamental issue of the model fitting noise in the training data.

Full explanation →

69

MCQhard

A media company uses a natural language processing (NLP) model to classify news articles into topics. The model was trained on articles from 2015-2018. In 2023, the model's F1 score drops significantly. The data scientists find that the word embeddings no longer capture the meaning of some terms (e.g., 'covid', 'metaverse'). The model uses static word embeddings (Word2Vec) trained on the original corpus. Which solution BEST addresses the observed degradation? A. Replace static embeddings with contextual embeddings from a transformer model like BERT, then fine-tune the classifier. B. Retrain the static Word2Vec embeddings on a larger corpus from 2023. C. Apply data augmentation to the original training data by replacing words with synonyms. D. Increase the dimensionality of the static embeddings.

A.Retrain the static Word2Vec embeddings on a larger corpus from 2023.

B.Increase the dimensionality of the static embeddings.

C.Replace static embeddings with contextual embeddings from a transformer model like BERT, then fine-tune the classifier.

D.Apply data augmentation to the original training data by replacing words with synonyms.

AnswerC

Contextual embeddings dynamically represent words based on context, handling semantic shift effectively.

Why this answer

Option A is correct. Contextual embeddings (e.g., BERT) capture meaning based on context, adapting to new uses of words like 'covid' meaning pandemic. Fine-tuning the classifier on new data would update the model.

Option B (retraining static embeddings) might capture new word senses but still assigns a single vector per word, missing context. Option C (data augmentation) does not introduce new word meanings. Option D (increasing dimensionality) does not address the semantic shift.

Full explanation →

70

Multi-Selecthard

Which TWO of the following are techniques used for reducing overfitting in neural networks? (Choose two.)

Select 2 answers

A.Dropout

B.Boosting

C.L2 regularization

D.Increasing the learning rate

E.Increasing the number of hidden layers

AnswersA, C

Dropout randomly drops neurons to reduce overfitting.

Why this answer

Dropout is a regularization technique that randomly drops a fraction of neurons during training, which prevents the network from relying too heavily on any single neuron and forces it to learn more robust features. This reduces overfitting by introducing noise that improves generalization.

Exam trap

CompTIA often tests the distinction between regularization techniques and other training strategies, so the trap here is that candidates may confuse boosting (an ensemble method) with regularization, or assume that increasing model complexity (more layers) or learning rate can help reduce overfitting when they actually do the opposite.

Full explanation →

71

MCQhard

An organization is developing an AI system to approve loan applications. They want to ensure the model does not discriminate based on race or gender. Which technique BEST addresses this concern?

A.Remove race and gender features from the training data.

B.Use a more complex model to capture nuances.

C.Apply adversarial debiasing during model training.

D.Collect more training data from diverse populations.

AnswerC

Correct; adversarial debiasing learns fair representations.

Why this answer

Adversarial debiasing is a technique that explicitly trains the model to remove sensitive information (like race or gender) from its internal representations, preventing the model from learning discriminatory patterns even if correlated features remain. This directly addresses fairness by making the model's predictions independent of protected attributes, which is more robust than simply removing features (which can still allow proxy discrimination).

Exam trap

CompTIA often tests the misconception that removing protected attributes is sufficient to eliminate bias, when in reality proxy features and correlated variables can still cause discrimination, making adversarial debiasing or other fairness-aware algorithms necessary.

How to eliminate wrong answers

Option A is wrong because simply removing race and gender features does not prevent the model from learning proxies for these attributes (e.g., zip code, income bracket) that can still lead to discriminatory outcomes. Option B is wrong because using a more complex model increases the risk of overfitting to spurious correlations and does not inherently address fairness; it may even amplify biases present in the data. Option D is wrong because collecting more diverse data does not guarantee fairness; biased labeling, historical discrimination, or imbalanced representation can persist, and the model may still learn to discriminate unless debiasing techniques are applied.

Full explanation →

72

MCQeasy

A bank uses an AI model to approve loans. During an audit, it is found that the model denies loans at a higher rate for a certain ethnic group. Which governance principle is primarily violated?

A.Accountability

B.Fairness

C.Transparency

D.Privacy

AnswerB

Fairness requires non-discrimination, which is violated here.

Why this answer

Option D (Fairness) is correct because fairness requires that AI systems do not discriminate against protected groups. Option A (Privacy) is about data protection, not bias. Option B (Transparency) is about explaining decisions, not the discrimination itself.

Option C (Accountability) is about responsibility, but the core issue is bias.

Full explanation →

73

MCQmedium

A data pipeline processes customer data from multiple sources. The data quality check reveals duplicate records. Which step should the pipeline include to handle this?

A.Data deduplication

B.Data encryption

C.Data transformation

D.Data validation

AnswerA

Deduplication specifically targets and eliminates duplicate records.

Why this answer

Deduplication is the process of identifying and removing duplicate records to ensure data quality. Data validation checks for schema or format errors. Data transformation changes data structure or values.

Data encryption ensures security but does not address duplicates.

Full explanation →

74

Multi-Selecteasy

A data engineer is preparing a dataset for a binary classification model. The dataset has 10,000 samples with 100 features. To improve model performance and reduce training time, the engineer decides to perform feature selection. Which two techniques are appropriate for this task? (Select TWO).

Select 2 answers

A.Normalization

B.Recursive Feature Elimination (RFE)

C.L1 Regularization

D.One-Hot Encoding

E.Principal Component Analysis (PCA)

AnswersB, C

RFE selects features by removing the least important ones iteratively.

Why this answer

Recursive Feature Elimination (RFE) is an appropriate feature selection technique because it iteratively removes the least important features based on a model's feature importance scores or coefficients, directly reducing the feature count from 100 to a smaller subset. This improves model performance by eliminating irrelevant or redundant features and reduces training time by decreasing dimensionality.

Exam trap

CompTIA often tests the distinction between feature selection (keeping original features) and dimensionality reduction (creating new features), so candidates mistakenly select PCA thinking it selects features, when it actually transforms them into principal components.

Full explanation →

75

MCQmedium

Based on the exhibit, what is the most likely cause of the pod failure and its solution?

A.The node has insufficient CPU; add more CPU.

B.The pod is configured with wrong GPU drivers; update drivers.

C.The model is too large; use a smaller model.

D.The container memory limit is too low; increase the memory limit in the pod spec.

AnswerD

OOMKilled specifically indicates memory exhaustion; raising the limit is the direct fix.

Why this answer

The pod failure is caused by an OOMKilled (Out of Memory) error, as indicated by the pod status in the exhibit. When a container exceeds its memory limit, Kubernetes terminates it with an OOMKilled exit code. Increasing the memory limit in the pod spec allows the container to allocate more memory, resolving the failure.

Exam trap

CompTIA often tests the distinction between resource exhaustion errors (OOMKilled vs. CPU throttling) and configuration errors (driver issues), leading candidates to incorrectly attribute a memory limit issue to a hardware or driver problem.

How to eliminate wrong answers

Option A is wrong because the exhibit shows no CPU-related errors or resource pressure; the failure is due to memory exhaustion, not insufficient CPU. Option B is wrong because GPU driver issues would manifest as device plugin errors or initialization failures, not an OOMKilled status. Option C is wrong because the model size is not directly indicated as the cause; the pod is failing due to memory limits, and using a smaller model might reduce memory usage but does not address the misconfigured resource limit.

Full explanation →

Page 1 of 7

All pages

Practice AI0-001 by domain

Target a specific domain to shore up weak areas.

AI Concepts and Foundations Machine Learning and Deep Learning AI Models and Data Engineering AI Implementation and Operations AI Security, Ethics and Governance

See all domains with question counts →