Knowledge + Practice

CompTIA AI+ AI0-001 (AI0-001) — Questions 301–375

500 questions total · 7pages · All types, answers revealed

Take a mock exam Exam hub

Page 5 of 7

301

MCQmedium

An AI model is being developed for medical diagnosis from X-ray images. The dataset contains only frontal chest X-rays. The model achieves high accuracy on test set but fails on lateral views. What is the most likely cause?

A.Dataset bias

B.Underfitting

C.Label noise

D.Overfitting

AnswerA

The training set lacks lateral views, causing bias; the model has not learned to recognize features specific to lateral X-rays.

Why this answer

The model was trained only on frontal views, so it did not learn features from lateral views, resulting in dataset bias and poor generalization to unseen perspectives.

Full explanation →

302

MCQeasy

In the AI lifecycle, which phase involves splitting data into training, validation, and test sets?

A.Model training

B.Data preprocessing

C.Data collection

D.Model evaluation

AnswerB

Correct; preprocessing includes cleaning, transforming, and splitting data.

Why this answer

Data preprocessing is the phase where raw data is cleaned, transformed, and prepared for modeling. Splitting the dataset into training, validation, and test sets is a critical step during this phase to ensure unbiased evaluation and prevent data leakage. This split occurs before any model training begins, making it part of preprocessing rather than training or evaluation.

Exam trap

CompTIA often tests the misconception that data splitting belongs to model training or evaluation, when in fact it is a preprocessing step that must occur before any model sees the data.

How to eliminate wrong answers

Option A is wrong because model training is the phase where the algorithm learns patterns from the training data, not where the data is split; splitting must happen beforehand to avoid contaminating the evaluation. Option C is wrong because data collection is the initial gathering of raw data from sources, which occurs before any splitting or preprocessing. Option D is wrong because model evaluation uses the already-split test set to assess performance, but the split itself is established during data preprocessing.

Full explanation →

303

MCQeasy

A model's training accuracy is 99% but validation accuracy drops to 60%. What is the most likely issue?

A.Data leakage

B.Overfitting

C.Multicollinearity

D.Underfitting

AnswerB

Overfitting leads to high training accuracy but low validation accuracy.

Why this answer

A training accuracy of 99% with a validation accuracy of only 60% is a classic symptom of overfitting. The model has memorized the training data, including noise and outliers, rather than learning generalizable patterns, causing it to perform poorly on unseen validation data.

Exam trap

CompTIA often tests the distinction between overfitting and data leakage by presenting a large accuracy gap, where candidates might mistakenly attribute the issue to data leakage instead of recognizing that leakage typically inflates both accuracies rather than creating a divergence.

How to eliminate wrong answers

Option A is wrong because data leakage typically causes both training and validation accuracy to be artificially high, not a large gap between them; it occurs when information from outside the training set inadvertently influences the model. Option C is wrong because multicollinearity refers to high correlation among input features in regression models, which affects coefficient stability and interpretability, not a drastic accuracy drop between training and validation sets. Option D is wrong because underfitting would result in low accuracy on both training and validation sets (e.g., both below 70%), not a high training accuracy with a low validation accuracy.

Full explanation →

304

MCQhard

A retail company uses a time-series model to forecast daily sales for inventory management. The model is a seasonal ARIMA trained on three years of daily data. It performed well during initial validation but after deployment, forecasts became inaccurate during holiday seasons, often underestimating demand by up to 40%. The data science team examined the features and found that the training data did not include any holiday indicators. They also discovered that the model's residuals show strong autocorrelation during holiday weeks. The company needs to improve the forecast for the upcoming holiday season. They have access to historical sales data with holiday dates and are considering several approaches. Which approach will BEST address the issue?

A.Engineer holiday-related features (e.g., holiday flag, days before/after) and retrain the ARIMA model with these features as exogenous variables.

B.Decrease the learning rate and increase the number of training epochs.

C.Create an ensemble of the ARIMA model and a seasonal naive model to average forecasts.

D.Replace the ARIMA model with a deep learning LSTM model trained on the same data.

AnswerA

Incorporating holiday information directly addresses the root cause of the forecast error.

Why this answer

The core issue is that the ARIMA model lacks holiday-related information, causing systematic underestimation during holiday periods. By engineering holiday features (e.g., binary flags, days-before/after indicators) and including them as exogenous variables in a SARIMAX model, the model can directly learn the demand spikes associated with holidays. This directly addresses the residual autocorrelation during holiday weeks and improves forecast accuracy without changing the underlying time-series framework.

Exam trap

CompTIA often tests the misconception that adding more data or switching to a more complex model (like LSTM) automatically fixes forecasting issues, when the real problem is missing relevant features that can be addressed with a simpler, interpretable approach like SARIMAX with exogenous variables.

How to eliminate wrong answers

Option B is wrong because decreasing the learning rate and increasing epochs is a hyperparameter tuning technique for gradient-based models (e.g., neural networks), not applicable to ARIMA which is trained via maximum likelihood estimation or least squares. Option C is wrong because averaging forecasts from an ARIMA and a seasonal naive model would dilute the ARIMA's signal and still fail to capture holiday effects, as the naive model also lacks holiday features. Option D is wrong because replacing ARIMA with an LSTM does not automatically solve the missing holiday indicator problem; the LSTM would also need engineered holiday features to learn the pattern, and the question asks for the best approach given available data, not a complete model swap.

Full explanation →

305

MCQmedium

A data engineer is reviewing an S3 bucket policy for a machine learning project. The policy is intended to allow access to training data only from the corporate network (10.0.0.0/16). However, users in the corporate network report access denied. Which issue is most likely causing the problem?

A.The policy is missing a Deny statement.

B.The resource ARN is incorrect.

C.The policy does not include s3:ListBucket action, which is needed to list objects.

D.The IP address condition is not matching because the corporate network uses a different CIDR.

AnswerC

Users need ListBucket to see objects before getting them.

Why this answer

Option C is correct. The condition key uses "aws:SourceIp" but the correct key for IP address condition is "aws:SourceIp". However, the exhibit shows "aws:SourceIp" which is correct? Wait, the exhibit shows "aws:SourceIp" — that is correct.

But the problem might be that the policy uses "GetObject" but the users might be trying to list objects (s3:ListBucket). Option C says the action does not include s3:ListBucket, which is often needed first. Option A is wrong because the resource is correct.

Option B is wrong because the policy allows access from 10.0.0.0/16. Option D is wrong because the effect is Allow.

Full explanation →

306

MCQmedium

A government agency is deploying an AI model to screen loan applications. The model uses features like income, credit score, employment history, and zip code. During fairness auditing, the model is found to deny a disproportionately high number of applicants from a particular demographic group, even when controlling for legitimate financial factors. The agency wants to mitigate this bias without significantly reducing overall accuracy. Which approach should the data scientist prioritize?

A.Adjust the decision threshold for the affected group

B.Remove the zip code feature from the model

C.Apply sample weighting to balance the demographic groups

D.Use adversarial debiasing during model training

AnswerD

Adversarial debiasing forces the model to learn representations that are invariant to sensitive attributes, reducing bias with minimal accuracy loss.

Why this answer

Adversarial debiasing trains the model to remove sensitive information from its internal representations, reducing bias while maintaining accuracy. Option A (remove sensitive features) is ineffective because correlated proxies remain. Option B (reweight training samples) can help but may distort the distribution.

Option D (post-hoc threshold adjustment) may reduce disparity but often at the cost of overall accuracy; adversarial debiasing is a more principled in-processing method.

Full explanation →

307

MCQeasy

Based on the exhibit, what does this indicate about the model?

A.The model has balanced performance

B.The model is underfitting

C.The model is overfitting

D.The model has high precision but low recall, missing many positives

AnswerD

Correct: Precision is high, recall is low, indicating the model is conservative in labeling positives.

Why this answer

Option B is correct because high precision (0.95) means few false positives, but low recall (0.60) means many false negatives, so the model misses many positive instances. Options A, C, and D are incorrect: overfitting cannot be determined from these metrics alone, underfitting would likely show both low precision and recall, and balanced performance would have similar values.

Full explanation →

308

MCQeasy

A data engineer needs to store training data in a format that supports columnar pruning during model training. Which storage format should they use?

A.Parquet

B.XML

C.JSON

D.CSV

AnswerA

Parquet is columnar, enabling compression and pruning, reducing I/O.

Why this answer

Parquet is the correct choice because it is a columnar storage format that enables column pruning, allowing the training process to read only the columns needed for model training rather than entire rows. This reduces I/O and speeds up data loading, which is critical for large-scale AI/ML workloads. Unlike row-oriented formats, Parquet stores data by columns, making it efficient for analytical queries and feature selection.

Exam trap

CompTIA often tests the misconception that JSON or CSV are acceptable for columnar pruning because they are common and human-readable, but the trap here is that only columnar formats like Parquet or ORC support efficient column-level access, while row-oriented formats require full record scans.

How to eliminate wrong answers

Option B (XML) is wrong because XML is a verbose, hierarchical text format that stores data row-wise and lacks columnar pruning capabilities, leading to high storage overhead and slow read performance for tabular data. Option C (JSON) is wrong because JSON is a row-oriented, self-describing format that requires parsing entire records even when only a subset of fields is needed, making it unsuitable for column pruning. Option D (CSV) is wrong because CSV is a flat, row-oriented text format that forces reading entire rows into memory, with no support for columnar storage or predicate pushdown, resulting in inefficient I/O for selective column access.

Full explanation →

309

MCQmedium

A healthcare organization wants to use patient data to predict disease risk. They are concerned about bias in the model. Which step is most critical during the data preparation phase to mitigate bias?

A.Applying SMOTE to oversample minority classes

B.Using a more complex algorithm

C.Removing all demographic features

D.Ensuring the training data is representative of the target population

AnswerD

Representative data prevents bias from skewed sampling.

Why this answer

Option A is correct because ensuring the training data is representative of the target population is fundamental to avoid bias. Option B is incorrect because SMOTE addresses class imbalance, not bias from non-representative sampling. Option C is incorrect because model complexity does not directly address bias.

Option D is incorrect because removing demographic features may not eliminate bias if proxy variables remain.

Full explanation →

310

MCQmedium

A company built a speech-to-text model using a recurrent neural network (RNN). During deployment, the model performs poorly on accented speech. Which action would most effectively improve model robustness?

A.Collect a small sample of accented speech and fine-tune the model on that sample only.

B.Add dropout and reduce the number of RNN layers to prevent overfitting to the current data.

C.Augment the training dataset with various accented audio samples and retrain the model.

D.Replace the RNN with a convolutional neural network (CNN) for feature extraction.

AnswerC

Data augmentation with accents directly addresses the performance gap.

Why this answer

Option C is correct because augmenting the training dataset with diverse accented audio samples directly addresses the root cause of poor performance—distribution shift between training and deployment data. Retraining the model on this enriched dataset allows the RNN to learn invariant features across accents, improving generalization without altering the model architecture or risking catastrophic forgetting from fine-tuning on a tiny sample.

Exam trap

CompTIA often tests the misconception that architectural changes (like switching to CNN or adding regularization) can fix data distribution mismatches, when the real solution is to address the missing data diversity through augmentation or retraining.

How to eliminate wrong answers

Option A is wrong because fine-tuning on a small sample of accented speech can cause catastrophic forgetting of the original training distribution and does not provide enough diversity to learn robust accent-invariant features. Option B is wrong because adding dropout and reducing layers addresses overfitting to the current data, but the core problem is underfitting to accented speech due to missing representative training examples, not overfitting. Option D is wrong because replacing the RNN with a CNN for feature extraction does not inherently solve the accent robustness issue; CNNs are effective for spatial patterns but less suited for sequential temporal dependencies in speech, and the fundamental problem remains the lack of accented training data.

Full explanation →

311

MCQmedium

Which AI governance framework is specifically designed by the U.S. National Institute of Standards and Technology (NIST) to help organizations manage AI risks?

A.ISO/IEC 27001

B.COBIT

C.NIST AI Risk Management Framework

D.GDPR

AnswerC

NIST AI RMF is the U.S. standard for AI risk management.

Why this answer

NIST AI Risk Management Framework (AI RMF) is a comprehensive framework for AI risk assessment and mitigation.

Full explanation →

312

Multi-Selecthard

Which TWO techniques are specifically designed to protect individual privacy when training AI models?

Select 2 answers

A.Dropout

B.Model pruning

C.Differential privacy

D.Regularization

E.Anonymization

AnswersC, E

Differential privacy adds noise to training to prevent data leakage.

Why this answer

Differential privacy and anonymization are directly aimed at preserving privacy by limiting information leakage.

Full explanation →

313

MCQmedium

A company wants to create an AI system that can identify objects in images. They have a large dataset of labeled images. Which type of neural network architecture is most suitable?

A.Transformer

B.Convolutional neural network (CNN)

C.Generative adversarial network (GAN)

D.Recurrent neural network (RNN)

AnswerB

Correct; CNNs excel at image recognition due to convolutional layers.

Why this answer

Convolutional neural networks (CNNs) are specifically designed to process grid-like data such as images. They use convolutional layers to automatically learn spatial hierarchies of features (edges, textures, objects) from pixel data, making them the most suitable architecture for image classification tasks with labeled datasets.

Exam trap

CompTIA often tests the misconception that any 'neural network' can handle images equally, but the trap is that RNNs and Transformers are sequence-based and not optimized for spatial feature extraction, while GANs are generative, not discriminative.

How to eliminate wrong answers

Option A is wrong because Transformers are primarily designed for sequential data (e.g., text) using self-attention mechanisms; while they can be adapted for vision (Vision Transformers), they require large datasets and are not the standard choice for traditional image classification. Option C is wrong because Generative adversarial networks (GANs) are used for generating new data (e.g., synthetic images) rather than classifying or identifying objects in existing images. Option D is wrong because Recurrent neural networks (RNNs) are designed for sequential or time-series data (e.g., text, speech) and struggle with spatial relationships in images due to vanishing gradients and lack of translation invariance.

Full explanation →

314

MCQhard

A retail company uses a machine learning model to predict daily sales. The model takes features like past sales, promotions, holidays, and weather data. Recently, the model's accuracy dropped significantly. The data engineer checks the data pipeline and finds that the weather data source changed from a free API to a new paid API that provides more detailed data. The new data includes additional attributes like humidity and wind speed, but the existing pipeline only ingests temperature and precipitation. Also, the time zone format changed from UTC to local time. The model was trained on the old format. Which action should the engineer take first to restore model performance?

A.Add a new step to merge old and new weather data before feeding to the model.

B.Transform the new data to match the old format (time zone and selected features) and retrain the model.

C.Revert to the old weather API.

D.Retrain the model with the new data including all new features.

AnswerB

This aligns the data with the training pipeline, resolving the immediate mismatch.

Why this answer

The immediate problem is the time zone change causing misalignment between training and inference data. Transforming the new data to match the old format ensures consistency. Retraining with all new features may introduce drift; reverting may not be possible; merging data without alignment causes inconsistency.

Full explanation →

315

Multi-Selecthard

A team is deploying a deep learning model that uses a convolutional neural network (CNN) for image recognition. The model achieves high accuracy but is very slow to infer on edge devices. Which THREE optimization techniques should the team consider to speed up inference without significant accuracy loss? (Select three.)

Select 3 answers

A.Use larger convolutional filters (e.g., 7x7 instead of 3x3) to capture more context.

B.Use weight pruning to remove unnecessary connections in the network.

C.Implement knowledge distillation by training a smaller model to mimic the larger one.

D.Increase the number of convolutional layers to improve feature extraction.

E.Apply model quantization to reduce weight precision.

AnswersB, C, E

Pruning reduces computation and memory footprint.

Why this answer

Weight pruning removes redundant or less important connections (weights) from the neural network, reducing the number of computations required during inference. This directly speeds up inference on edge devices while typically causing only a minor drop in accuracy if done carefully, making it a standard optimization technique for deploying CNNs on resource-constrained hardware.

Exam trap

CompTIA often tests the misconception that increasing model capacity (larger filters or more layers) improves performance without considering the trade-off in inference speed, leading candidates to select options that actually worsen latency on edge devices.

Full explanation →

316

MCQmedium

A data scientist submits a model training job to a cloud ML platform. The job fails with an error: "Out of memory: Killed process." The training code is proven to work on the developer's local machine with 16GB RAM. The cloud instance has 32GB RAM. What is the most likely cause?

A.The cloud instance does not have enough memory

B.The training code is loading the entire dataset into memory without batch processing

C.The training model is too large for the GPU memory

D.The cloud instance ran out of disk space

AnswerB

If the dataset is larger than memory, the process may exhaust RAM even if instances have more memory than local.

Why this answer

Option B is correct because the error 'Out of memory: Killed process' occurs when the operating system's OOM killer terminates a process that exceeds available RAM. Even though the cloud instance has 32GB RAM, if the training code loads the entire dataset into memory without batch processing (e.g., using pandas.read_csv() without chunking), it can consume far more memory than the dataset size due to Python object overhead, temporary copies, and intermediate tensors. The local machine with 16GB RAM may have worked due to a smaller dataset or different memory pressure, but the cloud instance's 32GB is insufficient if the code is not memory-efficient.

Exam trap

CompTIA often tests the misconception that 'more RAM always solves memory errors', but the trap here is that the error is caused by inefficient data loading (no batching) rather than absolute memory size, so candidates must recognize the OOM killer message as a symptom of memory exhaustion from unbounded data consumption.

How to eliminate wrong answers

Option A is wrong because the cloud instance has 32GB RAM, which is double the local machine's 16GB, so the error is not simply due to insufficient memory—it's a code inefficiency. Option C is wrong because the error message 'Out of memory: Killed process' is a Linux kernel OOM killer message related to system RAM, not GPU memory; GPU out-of-memory errors typically appear as CUDA errors (e.g., 'CUDA out of memory'). Option D is wrong because disk space exhaustion would produce errors like 'No space left on device' or 'Disk quota exceeded', not an OOM killer process termination.

Full explanation →

317

MCQhard

A financial institution needs to integrate an AI-based credit scoring model into an existing mainframe system that processes transactions in COBOL. The model is deployed as a REST API. What is the best strategy to ensure minimal disruption and maintain data integrity?

A.Copy all transaction data to a cloud database for the model to access.

B.Use an API gateway with versioning and circuit breaker patterns.

C.Rewrite the mainframe system in Java to directly call the model.

D.Install a GPU on the mainframe to run the model natively.

AnswerB

This pattern enables loose coupling, resilience, and gradual integration without major changes to the mainframe.

Why this answer

Option B is correct because an API gateway with versioning and circuit breaker patterns allows the COBOL mainframe to call the REST API without modifying its core transaction logic. The gateway handles protocol translation, rate limiting, and failover, ensuring minimal disruption to the existing mainframe system while maintaining data integrity through transactional consistency and graceful degradation.

Exam trap

CompTIA often tests the misconception that legacy systems must be replaced or heavily modified to integrate with modern AI services, when in fact an API gateway provides a non-invasive integration layer that preserves the existing infrastructure.

How to eliminate wrong answers

Option A is wrong because copying all transaction data to a cloud database introduces latency, potential data inconsistency, and security risks, and does not address the integration between COBOL and the REST API. Option C is wrong because rewriting the mainframe system in Java is a massive, high-risk, and costly undertaking that would cause significant disruption and is unnecessary for integrating a REST API. Option D is wrong because installing a GPU on the mainframe does not enable native execution of the AI model, as mainframes lack the necessary software stack and the model is deployed as a REST API, not as a local executable.

Full explanation →

318

Multi-Selectmedium

Which TWO of the following are effective defenses against adversarial examples in AI systems?

Select 2 answers

A.Train the model with adversarial examples (adversarial training)

B.Use an ensemble of models and majority voting

C.Increase the model's sensitivity to input changes

D.Implement input sanitization and feature squeezing

E.Reduce model complexity through pruning

AnswersA, D

Adversarial training hardens the model against perturbations.

Why this answer

Options B and D are correct. Adversarial training and input sanitization/denoising are defenses. Option A is wrong because model compression may remove defenses.

Option C is wrong because increasing model sensitivity can make it more vulnerable. Option E is wrong because ensemble voting is not a direct defense against adversarial examples.

Full explanation →

319

Multi-Selecthard

Which TWO of the following are key characteristics of unsupervised learning?

Select 2 answers

A.It uses data without labeled responses

B.It predicts a target variable based on input features

C.It discovers hidden patterns or groupings in data

D.It requires a reward signal to learn optimal actions

E.It typically requires a separate validation set for tuning

AnswersA, C

Unsupervised learning works with unlabeled data.

Why this answer

Option A is correct because unsupervised learning algorithms, such as k-means clustering or hierarchical clustering, operate exclusively on input data that has no labeled responses. The model must infer the underlying structure directly from the features without any ground-truth outputs to guide it, which is the defining characteristic of unsupervised learning.

Exam trap

CompTIA often tests the distinction between supervised, unsupervised, and reinforcement learning by presenting a characteristic that is true for one paradigm but not the other, and the trap here is that candidates may confuse 'predicting a target variable' (supervised) with 'discovering hidden patterns' (unsupervised) because both involve analyzing input features.

Full explanation →

320

Multi-Selectmedium

Which TWO techniques should be considered when optimizing a deep learning model for deployment on edge devices with limited computational resources?

Select 2 answers

A.Apply adversarial training

B.Model quantization

C.Use a GPU for inference

D.Knowledge distillation

E.Increase the number of layers

AnswersB, D

Quantization reduces memory and computation requirements.

Why this answer

Model quantization reduces the precision of the model's weights and activations (e.g., from 32-bit floating point to 8-bit integer), which significantly decreases memory footprint and computational latency. This makes it a primary technique for deploying deep learning models on edge devices with limited resources.

Exam trap

CompTIA often tests the distinction between training-phase techniques (like adversarial training) and deployment-phase optimization techniques (like quantization and knowledge distillation), leading candidates to select options that improve model quality rather than reduce resource consumption.

Full explanation →

321

MCQeasy

A hospital's radiology department uses an AI model to detect lung nodules in CT scans. The model was trained on data from a specific brand of scanners and patient demographics common in Europe. Recently, the hospital acquired new scanners from a different manufacturer and started serving a more diverse patient population. Over the past month, the model's false-positive rate has increased by 15% and false-negative rate by 8%. The radiologists are losing confidence and are considering abandoning the AI tool altogether. The IT team has verified that the model inference is running correctly and the hardware is performing as expected. The data science team suspects the problem is related to the change in input data distribution. The hospital's AI operations policy requires that any model update must be validated on at least 500 recent cases before deployment. What is the BEST course of action for the AI operations team?

A.Roll back to the previous model version and restrict use of the AI tool to only European patients.

B.Collect 500 recent CT scans from the new scanners, retrain the model on a combined old and new dataset, and validate before deployment.

C.Retrain the model using the original training data but with increased regularization to avoid overfitting.

D.Adjust the model's decision threshold to reduce false positives and then monitor for two weeks.

AnswerB

Retraining with a representative sample addresses the data drift and meets the policy requirement of 500 validation cases.

Why this answer

Option C is correct because collecting 500 recent CT scans that represent the new scanner and patient distribution allows for validation of a retrained model that adapts to the new data. Option A is wrong because adjusting thresholds only addresses false-positive rate but not false-negative rate, and does not tackle the root cause. Option B is wrong because rolling back to the previous model where performance was good on old data would not improve performance on the new data.

Option D is wrong because retraining on the original training data would not include the new distribution and likely yield the same issues.

Full explanation →

322

MCQmedium

A machine learning engineer is tuning a neural network for image classification. The training loss decreases steadily, but the validation loss starts increasing after 50 epochs. Which action best addresses this issue?

A.Increase the number of hidden layers

B.Add more training data

C.Apply early stopping with a patience of 10 epochs

D.Increase the batch size

AnswerC

Early stopping monitors validation loss and stops training when it starts increasing, directly addressing overfitting.

Why this answer

Early stopping halts training when validation performance degrades, preventing overfitting.

Full explanation →

323

Multi-Selecthard

Which THREE data quality dimensions are critical for ensuring model reliability?

Select 3 answers

A.Timeliness.

B.Consistency.

C.Completeness.

D.Volume.

E.Accuracy.

AnswersA, B, E

Outdated data can cause predictions to be irrelevant.

Why this answer

Timeliness is critical because stale data can lead to incorrect model predictions, especially in dynamic environments like network traffic analysis or fraud detection. For AI models, data must reflect the current state of the system to ensure relevance and reliability. Without timely data, the model may act on outdated patterns, reducing its effectiveness.

Exam trap

CompTIA often tests the distinction between data quality dimensions and data characteristics, so the trap here is that candidates confuse completeness or volume with the three critical dimensions (timeliness, consistency, accuracy) that directly impact model reliability.

Full explanation →

324

MCQeasy

A small business launched a customer support chatbot powered by a pre-trained language model. The chatbot was fine-tuned on a dataset of past support tickets. For the first week, it performed well, accurately answering 85% of queries. After a routine software update that included a new version of the underlying language model library, the chatbot's accuracy dropped to 60% and it began giving nonsensical responses to some questions. The update did not change any code or configuration specific to the chatbot. The business has a backup of the previous environment. What is the MOST appropriate immediate action?

A.Retrain the chatbot on the original dataset using the new library version.

B.Add more intents to the chatbot's configuration to cover the errors.

C.Increase the model's temperature parameter to 1.5 to encourage more varied responses.

D.Roll back the software update to the previous version of the language model library.

AnswerD

The update likely introduced a regression; rolling back restores functionality immediately.

Why this answer

The most appropriate immediate action is to roll back the software update to the previous version of the language model library (Option D). The accuracy drop and nonsensical responses are directly caused by the library update, which likely changed internal model behavior (e.g., tokenization, attention mechanisms, or default hyperparameters) without any code or configuration changes. Restoring the previous environment immediately resolves the issue and allows the business to investigate the library changes in a controlled manner.

Exam trap

CompTIA often tests the misconception that retraining or adjusting hyperparameters can fix a regression caused by an underlying library change, when the immediate and correct action is to roll back to the known-good environment.

How to eliminate wrong answers

Option A is wrong because retraining the chatbot on the original dataset using the new library version does not address the root cause—the library update itself may have altered model internals (e.g., tokenizer version, default parameters) that cannot be fixed by retraining alone, and retraining is time-consuming and not an immediate fix. Option B is wrong because adding more intents does not resolve the underlying model behavior change; the chatbot is producing nonsensical responses, not missing intents, so intent expansion is irrelevant to the core issue. Option C is wrong because increasing the temperature parameter to 1.5 would make responses more random and less coherent, worsening the nonsensical outputs; temperature controls output randomness, not model correctness or library compatibility.

Full explanation →

325

Multi-Selectmedium

A DevOps team is deploying a machine learning model using a CI/CD pipeline. They want to ensure the model is reproducible and traceable. Which TWO practices should they implement?

Select 2 answers

A.Version the training dataset and code using Git and DVC.

B.Manually deploy the model to production after approval.

C.Store only the final model artifact in a shared drive.

D.Use a spreadsheet to record model version numbers.

E.Package the model in a Docker container with a fixed base image.

AnswersA, E

Version control for data and code ensures exact reproduction of training.

Why this answer

Option A and D are correct because versioning data and code, and using containers ensure reproducibility. Option B is wrong because manual deployment is error-prone and not traceable. Option C is wrong as it does not track the model.

Option E is wrong because manual tagging is inconsistent.

Full explanation →

326

MCQeasy

A data scientist is preparing a dataset for a supervised learning model. The dataset contains missing values in 15% of the rows for a numeric feature. Which preprocessing technique should be applied to minimize bias?

A.Remove all rows with missing values.

B.Impute missing values with the mean of the feature.

C.Encode the missing values as a separate category.

D.Use a model that handles missing values natively.

AnswerB

Mean imputation preserves the dataset size and is appropriate for numeric features when missingness is random.

Why this answer

Imputing missing values with the mean preserves data size and is standard for numeric features when missingness is random. Removing rows can introduce bias if missingness is not random. Encoding as a separate category is for categorical features.

Using a model that handles missing values natively is not always available and may still require imputation.

Full explanation →

327

Multi-Selecteasy

Which TWO are characteristics of supervised learning?

Select 2 answers

A.Does not require target variable

B.Requires labeled data

C.Uses reinforcement signals

D.Learns to cluster data

E.Predicts continuous or categorical output

AnswersB, E

Supervised learning uses input-output pairs for training.

Why this answer

Supervised learning requires labeled data and predicts either continuous (regression) or categorical (classification) outputs.

Full explanation →

328

MCQeasy

A data scientist is preparing a dataset for a classification model. The dataset contains a column "Age" with 10% missing values and a column "Income" with 30% missing values. Which imputation strategy is MOST appropriate to minimize bias?

A.Replace missing Age with the mean and missing Income with the median.

B.Delete all rows with missing values.

C.Replace missing Age with the mode and missing Income with a constant value.

D.Replace missing values with zeros.

AnswerA

Mean for symmetric Age, median for skewed Income minimizes bias.

Why this answer

Option A is correct because using mean imputation for Age (10% missing) and median imputation for Income (30% missing) minimizes bias. Mean is suitable for roughly symmetric distributions with low missingness, while median is robust to outliers and skewness, which is common in income data. This combination reduces distortion of central tendency and preserves data integrity better than uniform methods.

Exam trap

CompTIA often tests the misconception that a single imputation method (e.g., mean for all columns) is universally appropriate, when in fact the choice must consider the missingness rate and the distribution of each feature to minimize bias.

How to eliminate wrong answers

Option B is wrong because deleting all rows with missing values (listwise deletion) reduces sample size and can introduce selection bias, especially when missingness is not completely at random (MCAR). Option C is wrong because replacing Age with the mode is inappropriate for continuous variables like age, as it discards variance and can create artificial clusters; replacing Income with a constant value (e.g., 0) introduces systematic bias and distorts the distribution. Option D is wrong because replacing missing values with zeros for both Age and Income is arbitrary and unrealistic, leading to severe underestimation of central tendency and inflated model error.

Full explanation →

329

Multi-Selecthard

Which TWO of the following are effective defenses against adversarial evasion attacks on image classifiers?

Select 2 answers

A.Data augmentation

B.Gradient masking

C.Adversarial training

D.Input validation

E.Feature squeezing

AnswersB, C

Gradient masking obscures gradient information to prevent crafting adversarial examples.

Why this answer

Adversarial training and gradient masking (e.g., defensive distillation) are common defenses. Data augmentation helps generalization but not specifically against adversarial perturbations; feature squeezing reduces input complexity; input validation is generic.

Full explanation →

330

MCQhard

The exhibit shows a model configuration for a classification task with 10 classes. What is wrong with this setup?

A.The loss function should be categorical crossentropy, not mean squared error

B.The metric should be precision, not accuracy

C.The activation should be sigmoid in hidden layers

D.The optimizer should be SGD, not Adam

AnswerA

Correct: MSE is for regression; classification requires crossentropy loss.

Why this answer

Option A is correct because the loss function should be categorical crossentropy for multi-class classification with softmax output. Options B, C, and D are incorrect: Adam optimizer is appropriate, ReLU activations are fine for hidden layers, and accuracy metric is appropriate.

Full explanation →

331

MCQmedium

An AI engineer is training a deep neural network for image recognition. The training loss decreases steadily for the first few epochs but then plateaus and starts to oscillate. Which adjustment is most likely to improve convergence?

A.Add more layers

B.Increase the learning rate

C.Increase the batch size

D.Reduce the learning rate

AnswerD

A lower learning rate can smooth convergence and reduce oscillation.

Why this answer

Option B is correct because oscillating loss often indicates a learning rate that is too high; reducing it stabilizes training. Option A is incorrect because increasing the learning rate would worsen oscillation. Option C is incorrect because increasing batch size can help but not primarily address oscillation.

Option D is incorrect because adding more layers could increase complexity and overfitting.

Full explanation →

332

MCQhard

A large e-commerce company uses a recommendation system based on collaborative filtering. The system uses a matrix factorization model that is trained nightly on the entire user-item interaction history. Recently, the company launched a flash sale with thousands of new products. Users are reporting that the recommendations are not showing the new products, even for users who have purchased them during the sale. The data engineering team notices that the new products have very few interactions in the training data. The model's loss on the validation set has increased, and the recall@10 metric has dropped from 0.45 to 0.32. The team needs to improve the recommendation of new items without retraining the entire model from scratch every hour. Which approach should the team take?

A.Use a hybrid model that combines collaborative filtering with content-based features from product metadata

B.Retrain the model every hour to incorporate new interactions quickly

C.Remove the new products from the recommendation pool until they accumulate enough interactions

D.Increase the number of latent factors in the matrix factorization model

AnswerA

Content-based features allow the model to recommend new items based on their attributes, overcoming the cold-start problem.

Why this answer

Option A is correct because a hybrid model that combines collaborative filtering with content-based features (e.g., product metadata like category, price, or description) can recommend new products even with zero or very few user interactions. The content-based component leverages item attributes to compute similarity between new and existing items, enabling the system to surface new products without requiring extensive interaction history. This approach addresses the cold-start problem for new items while preserving the collaborative filtering signal for established items, and it does not require retraining the entire model from scratch every hour.

Exam trap

CompTIA often tests the misconception that simply retraining more frequently or increasing model complexity (e.g., more latent factors) can solve the cold-start problem, but the core issue is the lack of interaction data for new items, which requires a content-based or hybrid approach to leverage item metadata.

How to eliminate wrong answers

Option B is wrong because retraining the model every hour would be computationally expensive and operationally impractical for a large-scale system with thousands of new products; it also does not solve the fundamental cold-start issue since new items still have very few interactions in each hourly training window. Option C is wrong because removing new products from the recommendation pool defeats the business purpose of the flash sale, which is to promote and surface new items to users, and it would lead to a poor user experience and lost revenue. Option D is wrong because increasing the number of latent factors in matrix factorization does not address the lack of interaction data for new items; it may even exacerbate overfitting to sparse data and increase computational cost without improving cold-start recommendations.

Full explanation →

333

MCQeasy

A company implements a chatbot using a rule-based system. Users complain the chatbot cannot handle new queries. Which AI approach should be considered to improve flexibility?

A.Expert system

B.Natural language processing (NLP)

C.Robotic process automation

D.Machine learning

AnswerD

ML enables the system to learn patterns from data.

Why this answer

Machine learning (ML) enables a chatbot to learn from new data and adapt to unseen queries, unlike a static rule-based system. By training on historical conversations, an ML model can generalize patterns and handle novel inputs without requiring explicit rules for every scenario.

Exam trap

CompTIA often tests the misconception that NLP alone is sufficient for adaptive chatbots, but NLP is a component of understanding language, not a learning mechanism—machine learning is required for flexibility.

How to eliminate wrong answers

Option A is wrong because an expert system is also rule-based, relying on a fixed knowledge base and inference engine, which cannot adapt to new queries without manual rule updates. Option B is wrong because natural language processing (NLP) alone provides text understanding (e.g., tokenization, parsing) but does not inherently learn from new data; it must be combined with ML for adaptive behavior. Option C is wrong because robotic process automation (RPA) automates repetitive, rule-based tasks in structured environments and cannot handle the variability of new, unseen queries.

Full explanation →

334

MCQhard

A company uses a machine learning model to recommend products to customers. The marketing team notices that the model is recommending high-profit items more frequently than low-profit items, even when customers are likely to prefer the latter. This behavior is causing customer dissatisfaction. Which approach would best align the model with customer preferences while maintaining profitability?

A.Train the model with a loss function that weights profit more heavily than customer satisfaction.

B.Use a multi-objective optimization framework to balance profit and customer satisfaction.

C.Adjust the model's hyperparameters to reduce the influence of profit features.

D.Remove profit data from the training set and only use customer preference data.

AnswerB

Multi-objective optimization explicitly considers trade-offs between competing objectives, leading to more balanced recommendations.

Why this answer

Option D is correct because multi-objective optimization allows the model to balance multiple goals (e.g., profit and customer satisfaction) explicitly. Option A is incorrect because it still prioritizes profit, which may not address satisfaction. Option B is incorrect because it completely removes profit, which may harm business goals.

Option C is incorrect because it only adjusts profit weights without true multi-objective framework.

Full explanation →

335

MCQhard

An AI developer observes that the training accuracy of a neural network is high, but the test accuracy is low. The model uses a ReLU activation function and Adam optimizer. Which approach is most likely to improve test accuracy?

A.Increase the learning rate

B.Add L2 regularization to the loss function

C.Switch to a stochastic gradient descent optimizer

D.Increase the number of epochs

AnswerB

L2 regularization penalizes large weights, preventing overfitting.

Why this answer

L2 regularization adds a penalty on large weights, reducing overfitting and improving test accuracy.

Full explanation →

336

Multi-Selectmedium

Which THREE techniques can help reduce overfitting in neural networks?

Select 3 answers

A.Increasing training data size

B.L2 regularization

C.Using a larger learning rate

D.Dropout

E.Increasing number of layers

AnswersA, B, D

More data helps the model generalize better.

Why this answer

Dropout randomly drops neurons, L2 regularization penalizes large weights, and increasing data size reduces overfitting by providing more examples.

Full explanation →

337

MCQhard

An organization needs to store sensitive customer data for training a machine learning model. The data must be encrypted at rest and in transit, and access must be audited. Which combination of practices should be implemented?

A.Use TLS for transfer, AES-256 for storage, and AWS CloudTrail for auditing

B.Use FTP for transfer, AES-128 for storage, and manual log review

C.Use SSH for transfer, store data in a database, and enable access logs

D.Use MD5 for hashing, store data in plaintext, and enable server logs

AnswerA

These provide encryption and auditing.

Why this answer

Option A is correct because it combines TLS (Transport Layer Security) for encrypting data in transit, AES-256 for strong encryption at rest, and AWS CloudTrail for auditing API-level access. TLS ensures confidentiality and integrity during transmission, AES-256 provides robust symmetric encryption for stored data, and CloudTrail logs all AWS API calls for compliance and audit trails. This triad satisfies the requirements of encryption in transit, at rest, and audited access.

Exam trap

CompTIA often tests the distinction between encryption (AES) and hashing (MD5), and the requirement for both in-transit and at-rest encryption, leading candidates to confuse SSH or FTP with proper TLS-based encryption.

How to eliminate wrong answers

Option B is wrong because FTP transfers data in plaintext, offering no encryption in transit, and AES-128 is weaker than AES-256, while manual log review is not scalable or auditable. Option C is wrong because SSH encrypts only the session, not the data at rest, and storing data in a database without specifying encryption at rest leaves it vulnerable; access logs alone do not provide the same audit trail as a dedicated service like CloudTrail. Option D is wrong because MD5 is a hash function, not encryption, and storing data in plaintext violates the encryption-at-rest requirement; server logs are insufficient for comprehensive auditing.

Full explanation →

338

MCQeasy

A data scientist wants to group customers into segments based on purchasing behavior without predefined labels. Which type of machine learning is most appropriate?

A.Reinforcement learning

B.Supervised learning

C.Unsupervised learning

D.Semi-supervised learning

AnswerC

Correct; unsupervised learning identifies patterns without labels.

Why this answer

Unsupervised learning is the correct choice because the data scientist has no predefined labels and wants to discover natural groupings in customer purchasing behavior. Clustering algorithms, such as K-means or DBSCAN, are used in unsupervised learning to segment data based on inherent patterns without any target variable.

Exam trap

CompTIA often tests the distinction between supervised and unsupervised learning by presenting a scenario with no labels, and the trap is that candidates may confuse clustering (unsupervised) with classification (supervised) or think semi-supervised applies when no labels exist at all.

How to eliminate wrong answers

Option A is wrong because reinforcement learning involves an agent learning from rewards and penalties by interacting with an environment, not grouping unlabeled data. Option B is wrong because supervised learning requires labeled training data with known outcomes, which is not available in this scenario. Option D is wrong because semi-supervised learning uses a small amount of labeled data alongside a larger unlabeled dataset, but the question explicitly states there are no predefined labels.

Full explanation →

339

MCQeasy

A company deploys an AI model to predict equipment failure. The model performs well on historical data but fails to generalize to new data from a different factory. Which concept best describes this issue?

A.Transfer learning

B.Underfitting

C.Overfitting

D.Bias-variance tradeoff

AnswerC

The model fits training data too closely and fails on new data.

Why this answer

Option C (Overfitting) is correct because the model learned patterns specific to the historical data from the original factory, including noise and factory-specific nuances, rather than generalizable features. When applied to new data from a different factory, those learned patterns do not hold, causing poor performance. This is the classic symptom of overfitting: high accuracy on training data but low accuracy on unseen data.

Exam trap

CompTIA often tests the distinction between overfitting and underfitting by describing a model that performs well on training data but poorly on new data, which candidates may mistakenly attribute to underfitting if they focus only on the poor generalization without noting the strong training performance.

How to eliminate wrong answers

Option A is wrong because transfer learning refers to leveraging knowledge from one task to improve learning on a related task, which is not the issue here—the model fails to generalize, not that it fails to transfer knowledge. Option B is wrong because underfitting occurs when the model is too simple to capture underlying patterns, resulting in poor performance on both training and new data, whereas here the model performs well on historical data. Option D is wrong because bias-variance tradeoff is a broader concept describing the balance between underfitting (high bias) and overfitting (high variance); while overfitting is a manifestation of high variance, the specific issue described is overfitting itself, not the tradeoff.

Full explanation →

340

MCQhard

Refer to the exhibit. An AI specialist reviews the model evaluation report for a binary classifier. The specialist wants to improve recall. Which action is most likely effective?

A.Decrease the classification threshold

B.Collect more training data for the minority class

C.Increase the classification threshold

D.Add more features

AnswerB

More minority data provides the model with more patterns, often improving recall.

Why this answer

Collecting more data for the minority class (the class with lower recall) helps the model learn better representations, often improving recall.

Full explanation →

341

MCQeasy

An AI system in a self-driving car misinterprets a stop sign due to a small sticker placed on it. This is an example of which security vulnerability?

A.Supply chain attack

B.Model inversion attack

C.Adversarial example attack

D.Data poisoning attack

AnswerC

Small manipulation of input causes incorrect output.

Why this answer

Option A is correct because adversarial perturbations (stickers) cause misclassification. Option B is wrong because data poisoning corrupts training data, not inference. Option C is wrong because model stealing extracts model parameters.

Option D is wrong because supply chain attacks compromise components earlier.

Full explanation →

342

MCQhard

Refer to the exhibit. A team deploys a sentiment analysis model with this policy. After one month, the monitoring system triggers an alert for feature drift. Which action should the team take first?

A.Review the fairness check settings to ensure protected attributes are still relevant.

B.Immediately retrain the model on recent data to adapt to the drift.

C.Compare the current feature distributions with the training set to identify which features drifted.

D.Reduce the classification threshold to 0.5 to increase sensitivity.

AnswerC

Drift analysis should first characterize the drift to decide corrective action.

Why this answer

Option C is correct because when a monitoring system triggers an alert for feature drift, the first step is to diagnose which features have changed. Comparing current feature distributions with the training set identifies the specific features that drifted, enabling targeted remediation such as retraining with recent data or feature engineering. This aligns with the standard MLOps workflow for drift detection and response.

Exam trap

CompTIA often tests the misconception that any model alert should trigger immediate retraining, but the correct first step is always to diagnose the drift type and affected features before taking action.

How to eliminate wrong answers

Option A is wrong because fairness check settings and protected attributes are unrelated to feature drift; they address bias, not distribution shifts in input features. Option B is wrong because immediately retraining the model without first identifying which features drifted is premature and may waste resources or fail to address the root cause. Option D is wrong because reducing the classification threshold to 0.5 adjusts the decision boundary for sensitivity but does not correct feature distribution changes; it could degrade model performance further.

Full explanation →

343

MCQmedium

A company wants to forecast monthly sales for the next year using historical sales data over three years. The data shows strong seasonality and a slight upward trend. Which model type is best suited for this task?

A.Simple moving average of the last 12 months

B.ARIMA model without seasonal terms

C.SARIMA model with seasonal order (1,1,1)[12]

D.Linear regression with time as the independent variable

AnswerC

SARIMA explicitly handles both trend and seasonality.

Why this answer

Option C is correct because SARIMA explicitly models seasonality and trend, making it ideal for this scenario. Option A is wrong because linear regression does not capture seasonality well without manual feature engineering. Option B is wrong because ARIMA can handle trend but not seasonality without differencing at seasonal lag; SARIMA is more appropriate.

Option D is wrong because a simple moving average ignores trend and seasonality.

Full explanation →

344

MCQmedium

A team trains a deep learning model for image classification with 1000 classes. The training loss decreases but validation loss starts increasing after 10 epochs. What should they do first?

A.Use data augmentation

B.Increase batch size

C.Reduce learning rate

D.Add dropout layers

AnswerD

Correct: Dropout is a regularization technique that helps prevent overfitting.

Why this answer

Option C is correct because adding dropout layers can help reduce overfitting by randomly dropping neurons during training. Options A, B, and D are incorrect: reducing learning rate may help but not as directly for overfitting, increasing batch size may improve stability but not necessarily overfitting, and data augmentation helps if the dataset is small but the symptom here is overfitting.

Full explanation →

345

Multi-Selecteasy

An AI system is being implemented in a healthcare setting. Which TWO ethical considerations should be prioritized?

Select 2 answers

A.Ensuring the model does not exhibit racial or gender bias

B.Maximizing cost reduction for the hospital

C.Providing explainable predictions to doctors

D.Replacing human judgment entirely with AI

E.Using open-source models to reduce licensing costs

AnswersA, C

Fairness is a core ethical concern.

Why this answer

Option A is correct because AI systems in healthcare must avoid racial or gender bias to ensure equitable treatment outcomes. Biased models can lead to misdiagnosis or unequal care, violating ethical standards and regulatory requirements like HIPAA. Prioritizing fairness aligns with responsible AI implementation in clinical settings.

Exam trap

CompTIA often tests the distinction between ethical priorities and operational or financial goals, tricking candidates into selecting cost-saving or efficiency options instead of fairness and explainability.

Full explanation →

346

MCQhard

A team trained a ResNet-50 model with the configuration shown. The high training accuracy and lower validation accuracy suggest overfitting. Which change to the training configuration is MOST likely to reduce overfitting?

A.Reduce number of epochs to 5.

B.Increase batch size to 64.

C.Increase learning rate to 0.01.

D.Add dropout layers after convolutional layers.

AnswerD

Dropout randomly drops neurons, reducing co-adaptation.

Why this answer

Adding dropout layers after convolutional layers is a regularization technique that randomly drops a fraction of neurons during training, which forces the network to learn more robust features and reduces overfitting. This directly addresses the symptom of high training accuracy with lower validation accuracy by preventing the model from relying too heavily on specific neurons.

Exam trap

CompTIA often tests the misconception that increasing batch size or reducing epochs directly fixes overfitting, when in fact these changes can harm convergence or underfit, while regularization techniques like dropout are the correct solution.

How to eliminate wrong answers

Option A is wrong because reducing the number of epochs to 5 would likely lead to underfitting, as the model would not have enough training iterations to converge, and it does not address the root cause of overfitting. Option B is wrong because increasing batch size to 64 can actually reduce the stochasticity of gradient updates, potentially leading to sharper minima and worse generalization, which may exacerbate overfitting. Option C is wrong because increasing the learning rate to 0.01 can cause the optimizer to overshoot minima and destabilize training, and it does not provide regularization to combat overfitting.

Full explanation →

347

Multi-Selecthard

Which TWO are best practices for versioning machine learning models? (Choose 2)

Select 2 answers

A.Use the same model version for all deployments

B.Tag each model with training date, hyperparameters, and performance metrics

C.Use a version control system (e.g., Git) for model code and configuration

D.Store only the final model binary without metadata

E.Manually rename model files with version numbers

AnswersB, C

Metadata enables comparison and audit.

Why this answer

Option B is correct because tagging each model with training date, hyperparameters, and performance metrics creates a reproducible audit trail. This practice aligns with MLOps principles, enabling teams to trace model behavior back to specific training runs and compare versions objectively.

Exam trap

CompTIA often tests the misconception that versioning is only about file naming or storing the binary, when in fact it requires a comprehensive metadata and code tracking system to ensure reproducibility and traceability.

Full explanation →

348

MCQmedium

During training of a neural network, the loss oscillates and does not converge smoothly. The learning rate is set to 0.1. What is the most likely cause and what adjustment should be made?

A.Learning rate too low; increase it

B.Batch size too small; increase it

C.Learning rate too high; decrease it

D.Too many epochs; stop early

AnswerC

High learning rate causes divergence and oscillations.

Why this answer

A learning rate that is too high causes the optimizer to overshoot minima, leading to oscillations. Reducing the learning rate stabilizes training.

Full explanation →

349

Multi-Selecteasy

An organization wants to implement a robust MLOps pipeline. Which THREE components are essential for a complete MLOps framework? (Choose three.)

Select 3 answers

A.Continuous integration and continuous deployment (CI/CD) pipeline

B.Automated testing and validation

C.Automated code review and approval gates

D.Real-time model monitoring dashboard

E.Version control for data and model code

AnswersA, B, E

CI/CD automates model building and deployment.

Why this answer

A is correct because a CI/CD pipeline automates the integration of code changes and deployment of models into production, ensuring consistent and reliable releases. In MLOps, this includes building, testing, and deploying both application code and ML model artifacts, which is fundamental for operationalizing machine learning at scale.

Exam trap

CompTIA often tests the distinction between 'essential framework components' and 'optional operational tools,' leading candidates to mistakenly select monitoring dashboards or code review gates as core MLOps requirements instead of the foundational pillars of CI/CD, automated testing, and version control.

Full explanation →

350

MCQmedium

A model serving pod is failing with OOMKilled. What is the most likely cause?

A.The container image is corrupted

B.The model version is outdated

C.The model requires more memory than the 2Gi limit

D.The Kubernetes cluster has run out of disk space

AnswerC

The pod was killed because it used more memory than allowed.

Why this answer

Option C is correct because an OOMKilled error in Kubernetes indicates that a container exceeded its memory limit and was terminated by the Out Of Memory (OOM) killer. The most common cause is that the model's inference or training workload requires more memory than the configured resource limit (e.g., 2Gi), forcing the kernel to kill the process. This is a direct result of the container's memory request/limit mismatch with the actual consumption.

Exam trap

CompTIA often tests the distinction between OOMKilled (memory limit exceeded) and other pod failure reasons like CrashLoopBackOff (application crash) or ImagePullBackOff (image issues), so candidates must associate OOMKilled specifically with memory resource constraints, not general pod failures.

How to eliminate wrong answers

Option A is wrong because a corrupted container image would typically cause an ImagePullBackOff or CrashLoopBackOff error, not an OOMKilled termination, which is specifically a memory-related kernel action. Option B is wrong because an outdated model version might cause performance or accuracy issues, but it does not directly trigger the OOM killer; memory exhaustion is a resource constraint, not a version compatibility problem. Option D is wrong because running out of disk space on the Kubernetes cluster would result in Evicted pods or ImagePullBackOff errors due to node pressure, not an OOMKilled status, which is tied to memory limits enforced by cgroups.

Full explanation →

351

MCQeasy

A company deploys an AI model via a REST API that handles sensitive customer data. To secure the endpoint, the security team requires that only authenticated and authorized applications can invoke the API. Which mechanism should be implemented?

A.API key or bearer token in the HTTP header

B.TLS encryption for the connection

C.Input sanitization to prevent injection

D.IP whitelisting

AnswerA

API keys/tokens authenticate the caller and are standard for API security.

Why this answer

Option A is correct because API keys or bearer tokens (e.g., OAuth 2.0 access tokens) are the standard mechanism for authenticating and authorizing client applications when invoking a REST API. These tokens are passed in the HTTP Authorization header, allowing the server to verify the client's identity and permissions before processing requests containing sensitive customer data.

Exam trap

CompTIA often tests the distinction between transport-layer security (TLS) and application-layer authentication, so candidates mistakenly choose TLS because it 'secures' the endpoint, but it does not verify who is calling the API.

How to eliminate wrong answers

Option B is wrong because TLS encryption secures data in transit but does not authenticate or authorize the calling application; it only prevents eavesdropping and tampering. Option C is wrong because input sanitization protects against injection attacks (e.g., SQL injection) but does not verify the identity or authorization of the API caller. Option D is wrong because IP whitelisting restricts access based on source IP addresses, which can be spoofed or shared, and does not provide per-application authentication or authorization; it is a network-layer control, not an application-layer identity mechanism.

Full explanation →

352

MCQmedium

You are an AI engineer at a financial services firm. The company has deployed a gradient boosting model to predict loan default risk. The model takes features such as credit score, debt-to-income ratio, loan amount, and employment length. In production, the model processes about 10,000 predictions per day with an average latency of 50ms. Recently, the accuracy has dropped from 92% to 85%. You also notice that the average credit score of applicants has increased significantly because the marketing team launched a campaign targeting prime borrowers. The model was originally trained on data from the past three years, which included a mix of prime and subprime borrowers. You need to restore model performance while minimizing downtime and retraining cost. Which action should you take first?

A.Add a regularization term to penalize high credit scores.

B.Deploy an ensemble of the original model and a neural network.

C.Reject all predictions where the confidence score is below 0.9.

D.Retrain the model using the last three months of production data with labels.

AnswerD

Retraining with recent data realigns the model with the current applicant pool, directly addressing the covariate shift.

Why this answer

The drop in accuracy is due to data drift—the production data now has a different distribution (higher credit scores) than the training data. Retraining on the most recent three months of production data with labels directly addresses this shift by adapting the model to the new population, and it minimizes downtime because it uses existing infrastructure and avoids complex architectural changes.

Exam trap

CompTIA often tests the misconception that model performance degradation is always due to model architecture or hyperparameters, rather than recognizing data drift as the primary cause, leading candidates to choose complex solutions like ensembles or threshold adjustments instead of retraining on recent data.

How to eliminate wrong answers

Option A is wrong because adding a regularization term to penalize high credit scores would artificially bias the model against a legitimate feature value, reducing accuracy rather than correcting for distribution shift. Option B is wrong because deploying an ensemble with a neural network adds complexity, latency, and retraining cost without addressing the root cause of data drift, and it may not be feasible with the current 50ms latency requirement. Option C is wrong because rejecting predictions with confidence below 0.9 would discard many valid predictions (especially if the model is miscalibrated due to drift), reducing throughput and not fixing the underlying accuracy issue.

Full explanation →

353

MCQeasy

A chatbot developer uses a transformer-based model for customer service. Users complain that the chatbot sometimes gives offensive responses. Which technique should be applied first to mitigate this issue?

A.Increase the model size to improve its understanding of context.

B.Decrease the temperature parameter to make outputs more deterministic.

C.Train a separate classifier to detect offensive outputs in real time.

D.Review and filter the training dataset for offensive or biased language, then fine-tune the model.

AnswerD

Cleaning training data addresses the root cause.

Why this answer

Option D is correct because the root cause of offensive responses in transformer-based models is typically biased or toxic language present in the training data. Reviewing and filtering the dataset to remove such content, followed by fine-tuning the model, directly addresses the source of the problem. This approach aligns with the principle of data-centric AI, where improving data quality is the first step before modifying model architecture or inference parameters.

Exam trap

CompTIA often tests the misconception that modifying inference parameters (like temperature) or adding post-processing classifiers can fix fundamental data quality issues, when in fact the first and most effective mitigation is to address the training data itself.

How to eliminate wrong answers

Option A is wrong because increasing model size does not inherently fix biased or offensive outputs; larger models can actually amplify existing biases in the training data due to increased capacity to memorize patterns. Option B is wrong because decreasing the temperature parameter makes outputs more deterministic (lower randomness) but does not prevent the model from generating offensive content that it has learned from the data; it only reduces creative variation, not toxicity. Option C is wrong because training a separate classifier to detect offensive outputs in real time is a reactive measure that adds latency and complexity, whereas the proactive first step should be to clean the training data; a classifier also cannot prevent the model from generating offensive content in the first place.

Full explanation →

354

MCQeasy

A data scientist is preparing a dataset for training a classification model. The dataset contains 10,000 records with a binary target variable where 9,500 belong to class A and 500 belong to class B. Which technique should the scientist use to address the class imbalance?

A.SMOTE (Synthetic Minority Oversampling Technique)

B.Random undersampling of class A

C.Adding Gaussian noise to class B

D.Principal Component Analysis (PCA)

AnswerA

SMOTE creates synthetic minority samples to balance classes.

Why this answer

SMOTE is the correct technique because it generates synthetic samples for the minority class (class B) by interpolating between existing minority instances, effectively balancing the dataset without losing information. This approach avoids the overfitting risk of simple oversampling and the information loss of undersampling, making it ideal for a 19:1 imbalance ratio.

Exam trap

CompTIA often tests the misconception that any data augmentation (like adding noise) or dimensionality reduction (like PCA) can solve class imbalance, when in fact only resampling techniques like SMOTE directly address the skewed distribution of the target variable.

How to eliminate wrong answers

Option B is wrong because random undersampling of class A discards 9,000 majority class records, leading to significant information loss and potential bias in the model. Option C is wrong because adding Gaussian noise to class B does not create meaningful synthetic samples; it merely corrupts existing minority data, which can reduce model performance and introduce unrealistic variance. Option D is wrong because PCA is a dimensionality reduction technique used for feature extraction or noise reduction, not for addressing class imbalance in the target variable.

Full explanation →

355

MCQhard

An AI team notices that their model's performance degrades over time because the statistical relationship between input features and the target variable changes. This issue is called:

A.Data drift

B.Overfitting

C.Concept drift

D.Model drift

AnswerC

Correct; concept drift describes changes in the mapping from inputs to outputs.

Why this answer

Concept drift occurs when the statistical relationship between input features and the target variable changes over time, causing model performance to degrade. This is distinct from data drift, which involves changes in the input data distribution alone. In the AI0-001 context, concept drift directly addresses the shift in the underlying mapping from features to labels.

Exam trap

CompTIA often tests the distinction between data drift and concept drift, where candidates mistakenly choose data drift because they focus on the input features changing, rather than the relationship between features and the target.

How to eliminate wrong answers

Option A is wrong because data drift refers to changes in the distribution of input features, not the relationship between features and the target. Option B is wrong because overfitting is a model that memorizes training data noise and fails to generalize, not a temporal degradation due to shifting relationships. Option D is wrong because 'model drift' is not a standard term in machine learning; the correct term for the described phenomenon is concept drift.

Full explanation →

356

MCQmedium

A healthcare company must deploy a diagnostic AI model that uses protected health information (PHI). To comply with HIPAA, the operations team needs to ensure data privacy during model inference. Which practice should be implemented?

A.Run the model on-premises to avoid cloud data transmission

B.Encrypt all PHI at rest and in transit within the inference pipeline

C.Mask sensitive fields in the input data before inference

D.Apply differential privacy during model training only

AnswerB

Encryption ensures confidentiality of PHI.

Why this answer

Option B is correct because HIPAA mandates encryption of protected health information (PHI) both at rest and in transit to safeguard data confidentiality during model inference. Encrypting the entire inference pipeline ensures that even if data is intercepted or accessed without authorization, it remains unreadable. This practice directly addresses the compliance requirement for data privacy without relying on network location or partial obfuscation.

Exam trap

CompTIA often tests the misconception that on-premises deployment or data masking alone satisfies HIPAA, when in fact encryption of PHI at rest and in transit is the mandatory technical safeguard under the HIPAA Security Rule.

How to eliminate wrong answers

Option A is wrong because running the model on-premises does not inherently ensure data privacy; PHI could still be exposed through insecure storage, unencrypted logs, or internal network breaches, and HIPAA requires encryption regardless of deployment location. Option C is wrong because masking sensitive fields before inference only obscures data at the input stage, but the model may still process and output PHI in intermediate layers or results, leaving the pipeline vulnerable. Option D is wrong because differential privacy applied only during training does not protect PHI during inference; inference-time data must be protected with encryption and access controls to comply with HIPAA's operational requirements.

Full explanation →

357

Multi-Selecteasy

Which TWO are common attack vectors against AI systems? (Choose two.)

Select 2 answers

A.SQL injection

B.Cross-site scripting

C.Buffer overflow

D.Data poisoning

E.Adversarial examples

AnswersD, E

Corrupts training data.

Why this answer

Options B (Adversarial examples) and D (Data poisoning) are correct. Adversarial examples manipulate inputs to cause misclassification; data poisoning corrupts training data. Option A (SQL injection) is a traditional attack.

Option C (Buffer overflow) is a software vulnerability. Option E (Cross-site scripting) is a web attack.

Full explanation →

358

MCQhard

A data scientist is training a multi-class classifier with 10 classes. The training log shows the above output for the first two epochs. What is the most likely cause?

A.Batch normalization is disabled

B.The learning rate is set to zero

C.The dataset is imbalanced

D.The model is overfitting

AnswerB

A zero learning rate prevents any weight updates, so the model outputs remain at initial random values.

Why this answer

When the learning rate is set to zero, the optimizer makes no updates to the model weights regardless of the computed gradients. The training loss remains constant across epochs because the parameters never change, which matches the log showing identical loss values for both epochs. This is a common debugging scenario where a misconfigured learning rate prevents any learning from occurring.

Exam trap

CompTIA often tests the misconception that a flat loss curve is always due to data issues or model capacity, when in fact it is a classic symptom of a zero or extremely small learning rate that prevents any weight updates.

How to eliminate wrong answers

Option A is wrong because disabling batch normalization would cause training instability and fluctuating loss values, not a perfectly flat loss across epochs. Option C is wrong because an imbalanced dataset affects final accuracy and per-class performance, but the loss would still decrease (or oscillate) as the model learns the majority classes. Option D is wrong because overfitting is characterized by decreasing training loss with increasing validation loss, not a completely static training loss.

Full explanation →

359

MCQhard

A fraud detection model is trained on a dataset where only 0.1% of transactions are fraudulent. The model achieves 99.9% accuracy but fails to catch most frauds. Which metric should the team prioritize, and which technique could help?

A.Mean Squared Error; use L2 regularization

B.F1 score; use principal component analysis

C.Accuracy; collect more data

D.Precision-Recall AUC; use oversampling like SMOTE

AnswerD

Precision-Recall AUC evaluates minority class well; SMOTE generates synthetic samples.

Why this answer

With severe class imbalance, accuracy is misleading. Precision-Recall AUC focuses on minority class, and SMOTE oversamples it.

Full explanation →

360

MCQmedium

A company deployed an AI chatbot that started generating offensive responses after a data update. The security team needs to quickly mitigate the issue. What should they do first?

A.Delete the training data

B.Disable the chatbot and investigate

C.Roll back to previous model version

D.Add a content filter

AnswerB

Immediate containment and then investigation.

Why this answer

Option A (Disable the chatbot and investigate) is correct because immediate action to stop harm is critical, then root cause analysis. Option B (Roll back to previous model version) might work if the update caused it, but disabling first is safer. Option C (Add a content filter) may not catch all offensive responses.

Option D (Delete the training data) is irreversible and extreme.

Full explanation →

361

Multi-Selecteasy

Which TWO of the following are key stages in the AI lifecycle?

Select 2 answers

A.Human annotation of all data

B.Model retraining

C.Model deployment

D.Data collection

E.Manual feature extraction

AnswersC, D

Deploying the model into a production environment is a critical phase.

Why this answer

Data collection and model deployment are essential stages. Data collection provides the raw material for training, and model deployment puts the trained model into production. Manual feature extraction is becoming automated, human annotation is not always required, and model retraining should be continuous.

Full explanation →

362

Multi-Selectmedium

A computer vision team is building an image classifier for rare wildlife species. The dataset has only 500 images per class, and the model overfits. Which THREE data augmentation techniques are most likely to reduce overfitting? (Choose three.)

Select 3 answers

A.Horizontal flip

B.Adding Gaussian noise

C.Random cropping

D.Color jitter (brightness, contrast, saturation)

E.Random rotation by ±10 degrees

AnswersA, D, E

Flipping is a standard augmentation that doubles the dataset size.

Why this answer

Horizontal flip is a simple and effective data augmentation technique that doubles the training data by mirroring images, which helps the model generalize better to variations in orientation. This is particularly useful for wildlife images where the animal may appear facing left or right, reducing overfitting by exposing the model to more diverse examples without collecting new data.

Exam trap

CompTIA often tests the distinction between augmentations that preserve class labels (like flips and rotations) versus those that may alter semantic content (like extreme cropping or noise), leading candidates to overestimate the effectiveness of Gaussian noise for overfitting reduction.

Full explanation →

363

MCQmedium

A logistics company uses a machine learning model to predict delivery times based on historical data including distance, traffic, weather, and driver performance. The model is deployed as a REST API using Flask and run on a single server. Recently, the model has been returning predictions with high latency (over 2 seconds) during peak hours when the API receives 500 requests per second. The server has 8 CPU cores and 32 GB RAM. The model is a gradient boosting model (XGBoost) with 500 trees. The engineer wants to reduce inference latency to under 500ms without retraining the model. Which action is most effective?

A.Prune the model by reducing the number of trees to 100 and limit tree depth

B.Replace XGBoost with a linear regression model

C.Scale horizontally by deploying additional servers behind a load balancer

D.Increase server RAM to 128 GB

AnswerA

Pruning reduces computational load and latency while often maintaining adequate accuracy.

Why this answer

Option B is correct. Model pruning reduces the number of trees in the ensemble, directly lowering inference time. Option A is wrong because adding more servers (horizontal scaling) addresses throughput but not per-request latency; it may help if the bottleneck is CPU, but pruning is more efficient.

Option C is wrong because using a simpler model (linear regression) would require retraining and likely lose accuracy. Option D is wrong because increasing server memory does not speed up CPU-bound tree inference.

Full explanation →

364

Multi-Selectmedium

Which THREE of the following are techniques for handling missing data in machine learning?

Select 3 answers

A.Deletion of rows with missing values

B.Autoencoder reconstruction

C.Mean imputation

D.Principal Component Analysis

E.Using a separate category for missing values

AnswersA, C, E

Listwise deletion removes incomplete records; a basic approach.

Why this answer

Mean imputation replaces missing with the mean, deletion removes rows with missing, and flagging missing adds an indicator. Autoencoder imputation is more advanced but not a standard technique. PCA is for dimensionality reduction, not missing data handling.

Full explanation →

365

MCQmedium

An AIOps platform monitors server metrics and triggers alerts. The team notices too many false positives. Which adjustment should be made to the anomaly detection model?

A.Use a more complex model to better fit the data.

B.Shorten the observation window to detect anomalies faster.

C.Increase the training data to include more normal patterns.

D.Raise the anomaly score threshold for triggering alerts.

AnswerD

A higher threshold means only more extreme deviations trigger alerts.

Why this answer

Raising the anomaly score threshold (Option D) directly reduces false positives by requiring a higher deviation from normal behavior before an alert is triggered. In AIOps platforms, the anomaly score is a numeric value (e.g., 0–100) that quantifies how unusual a metric is; a higher threshold means only more extreme deviations generate alerts, filtering out minor fluctuations that were incorrectly flagged.

Exam trap

CompTIA often tests the misconception that adding more data or using a more complex model inherently improves accuracy, when in fact the threshold tuning is the direct lever for controlling false positive rates in operational AIOps systems.

How to eliminate wrong answers

Option A is wrong because using a more complex model increases the risk of overfitting to noise in the training data, which can actually increase false positives by treating random variations as anomalies. Option B is wrong because shortening the observation window makes the model more sensitive to short-term spikes and noise, which typically increases false positives rather than reducing them. Option C is wrong because increasing training data with more normal patterns can improve baseline accuracy, but it does not directly control the alerting sensitivity; false positives are primarily managed by the threshold, not by adding more normal data.

Full explanation →

366

MCQmedium

A healthcare AI system used for diagnosis shows a significant accuracy difference between demographic groups. Which technique should be applied to directly reduce this bias during model training?

A.Ignore the disparity as long as overall accuracy is acceptable

B.Retrain the model with more data from the underperforming group

C.Apply adversarial debiasing during training

D.Remove demographic attributes from the training data

AnswerC

Adversarial debiasing explicitly penalizes the model for encoding demographic information, reducing bias.

Why this answer

Adversarial debiasing is a training-time technique that forces the model to learn features uncorrelated with demographic attributes, directly reducing bias.

Full explanation →

367

MCQmedium

A financial services company is developing an AI model to detect fraudulent transactions. The dataset contains 99.9% legitimate transactions and 0.1% fraudulent ones. Which technique should the data scientist use to address the class imbalance problem?

A.Apply Synthetic Minority Oversampling Technique (SMOTE)

B.Use a bagging ensemble method

C.Undersample the legitimate transactions

D.Use cost-sensitive learning with higher weight on fraudulent class

AnswerA

SMOTE creates synthetic examples of the minority class, balancing the dataset without losing information.

Why this answer

SMOTE (Synthetic Minority Oversampling Technique) is the correct choice because it generates synthetic examples of the minority class (fraudulent transactions) by interpolating between existing minority instances, rather than duplicating them. This addresses the extreme 0.1% fraud rate without introducing overfitting or losing data, making it a standard technique for imbalanced classification problems in financial fraud detection.

Exam trap

CompTIA often tests the distinction between resampling techniques (SMOTE, undersampling) and algorithmic adjustments (cost-sensitive learning, ensemble methods), so candidates may incorrectly choose cost-sensitive learning because it 'handles imbalance' without recognizing that SMOTE is the specific data-level technique asked for.

How to eliminate wrong answers

Option B is wrong because bagging (bootstrap aggregating) is an ensemble method that reduces variance but does not directly address class imbalance; it would still train on the skewed distribution unless combined with resampling. Option C is wrong because undersampling the legitimate (majority) class would discard 99.9% of the data, causing severe information loss and potentially degrading model performance on legitimate transactions. Option D is wrong because cost-sensitive learning assigns higher misclassification costs to the minority class, which can help but is not a resampling technique; the question specifically asks for a technique to 'address the class imbalance problem' via data manipulation, and SMOTE is the direct resampling approach.

Full explanation →

368

MCQmedium

Refer to the exhibit. A data scientist reviews the pipeline and notes that the model performance degraded. Which change to the pipeline would most likely improve model performance?

A.Change the impute strategy from mean to median for the 'income' column.

B.Remove the normalization step entirely.

C.Drop the 'product_category' column instead of one-hot encoding.

D.Change the encoding method from onehot to label encoding.

AnswerA

Income often has outliers; median is less affected by extremes.

Why this answer

Imputing missing values with the mean is sensitive to outliers; using the median is more robust. Removing normalization or dropping product_category would lose information or harm scaling. Changing to label encoding on a nominal category could introduce false ordinal relationships.

Full explanation →

369

MCQhard

A company deploys a machine learning model that makes predictions on streaming data. Over time, the data distribution shifts, causing model performance to degrade. Which monitoring strategy is most appropriate to detect this drift?

A.Compare the distribution of predictions to the training set

B.Monitor the model's training loss

C.Retrain the model daily on new data

D.Track the model's accuracy on a fixed validation set over time

AnswerD

Accuracy drop on a static validation set indicates concept drift.

Why this answer

Option A is correct because tracking accuracy on a fixed validation set over time directly reveals performance degradation due to distribution shift. Option B is incorrect because training loss may remain low even with drift. Option C is incorrect because retraining daily is a response, not a detection method.

Option D is incorrect because comparing prediction distributions is less direct than performance metrics.

Full explanation →

370

MCQeasy

A company deployed a machine learning model on a cloud inference service. Users report high latency during peak hours. The model is deployed on a single instance. Which action should the team take to reduce latency without significant architectural changes?

A.Increase the model size to improve accuracy

B.Switch to a batch inference pipeline

C.Enable autoscaling for the inference instances

D.Add an API gateway to route requests

AnswerC

Autoscaling adds capacity during peak demand, reducing latency.

Why this answer

Enabling autoscaling allows the inference service to automatically add instances during high demand, distributing the load and reducing latency. Increasing the model size would worsen latency. Switching to a batch inference pipeline would increase latency for real-time requests.

Adding an API gateway does not address compute capacity.

Full explanation →

371

MCQeasy

An organization wants to ensure its AI systems comply with new regulations requiring explanations for automated decisions. Which governance practice is most directly relevant?

A.Implementing differential privacy

B.Deploying explainability tools

C.Conducting bias audits

D.Establishing an AI ethics board

AnswerB

Explainability tools generate explanations for individual decisions, meeting regulatory requirements.

Why this answer

Explainability tools (e.g., SHAP, LIME) provide post-hoc explanations for model decisions, directly addressing the requirement for explanations. The other options are important but not as directly relevant to providing explanations.

Full explanation →

372

MCQeasy

A data engineer discovers that a dataset contains duplicate rows. Which data cleaning step is MOST appropriate?

A.Keep only the first occurrence.

B.Fill duplicates with the mean.

C.Remove duplicate rows.

D.Convert duplicates to categorical.

AnswerC

Removing duplicates ensures each observation is unique.

Why this answer

Removing duplicate rows is the most appropriate data cleaning step because duplicate rows can bias statistical analyses and machine learning models by overrepresenting certain observations. In data engineering, deduplication is a standard preprocessing step to ensure data integrity and avoid skewed results. Option C directly addresses this by eliminating redundant entries without introducing artificial values or altering the data distribution.

Exam trap

CompTIA often tests the misconception that 'keeping the first occurrence' is a valid deduplication strategy, but in data engineering, this is arbitrary and can lead to data loss or bias, whereas explicit removal is the standard practice.

How to eliminate wrong answers

Option A is wrong because keeping only the first occurrence arbitrarily discards potentially valid later occurrences without considering context, which can introduce bias if duplicates are not truly identical. Option B is wrong because filling duplicates with the mean is nonsensical—duplicates are entire rows, not missing values, and imputing a mean would corrupt the dataset by replacing valid data with an aggregate. Option D is wrong because converting duplicates to categorical does not resolve the issue of overrepresentation; it merely relabels the problem without removing the redundant rows.

Full explanation →

373

MCQeasy

A data scientist is building a binary classification model to predict customer churn. The dataset has 10,000 samples with 80% non-churn and 20% churn. The model achieves 95% accuracy but fails to identify churners correctly. Which metric should the scientist focus on to evaluate model performance properly?

A.Precision

B.F1-score

C.Recall (TPR)

D.Specificity

AnswerC

Recall focuses on identifying positive cases, which is the main objective.

Why this answer

Option A is correct because recall (true positive rate) measures the ability to find positive (churn) cases, which is the goal in an imbalanced dataset. Option B, precision, is important but less critical when the cost of missing churners is high. Option C, F1-score, balances precision and recall but recall is more directly needed.

Option D, specificity, measures true negative rate, not relevant for catching churners.

Full explanation →

374

MCQeasy

A team deploys a real-time fraud detection model on a streaming platform. The model must produce predictions within 100 milliseconds per event. Initial latency is 150 ms. Which optimization is most likely to meet the latency requirement?

A.Apply model quantization to reduce precision from FP32 to INT8.

B.Increase the batch size to process more events simultaneously.

C.Add more feature engineering steps to improve model accuracy.

D.Migrate from a decision tree ensemble to a deep neural network.

AnswerA

Quantization reduces model size and speeds up computation, lowering latency.

Why this answer

Model quantization reduces the numerical precision of the model's weights and activations from FP32 to INT8, which decreases memory footprint and speeds up inference. This optimization directly addresses the 150 ms latency by enabling faster arithmetic operations on modern hardware, often cutting inference time by 2-4x, which can bring latency below the 100 ms requirement.

Exam trap

CompTIA often tests the misconception that increasing batch size or model complexity improves throughput for real-time systems, but candidates must recognize that real-time streaming requires low per-event latency, not high aggregate throughput.

How to eliminate wrong answers

Option B is wrong because increasing batch size processes more events simultaneously, which increases per-batch latency and is unsuitable for real-time streaming where each event must be handled individually within 100 ms. Option C is wrong because adding more feature engineering steps increases preprocessing time, worsening latency without guaranteeing a reduction in model inference time. Option D is wrong because migrating from a decision tree ensemble to a deep neural network typically increases model complexity and computational cost, raising latency rather than reducing it.

Full explanation →

375

MCQhard

Refer to the exhibit. A system administrator reviews the deployment. Which action should be taken to meet the SLA?

A.Retrain the model

B.Implement caching

C.Reduce model input size

D.Scale up the compute resources

AnswerD

Correct; more compute power can speed up inference.

Why this answer

The exhibit shows a deployment where inference latency exceeds the SLA requirement. Scaling up compute resources (e.g., adding more CPU cores, GPU memory, or increasing instance size) directly reduces per-request processing time by providing more parallel processing capacity, which is the most straightforward way to meet latency SLAs when the model is already optimized.

Exam trap

CompTIA often tests the misconception that retraining or caching are universal performance fixes, when in fact they address accuracy and request repetition respectively, not raw compute throughput.

How to eliminate wrong answers

Option A is wrong because retraining the model improves accuracy or adapts to new data, but does not inherently reduce inference latency unless the model architecture is changed to a smaller or more efficient one, which is not indicated. Option B is wrong because caching can reduce latency for repeated identical requests, but the exhibit does not suggest that requests are repetitive; caching does not help with unique or dynamic inputs. Option C is wrong because reducing model input size (e.g., downsampling images or truncating text) may lower latency but at the cost of accuracy or completeness, and the SLA likely requires maintaining output quality; scaling compute resources preserves model fidelity.

Full explanation →

Page 5 of 7

All pages

Practice AI0-001 by domain

Target a specific domain to shore up weak areas.

AI Concepts and Foundations Machine Learning and Deep Learning AI Models and Data Engineering AI Implementation and Operations AI Security, Ethics and Governance

See all domains with question counts →