Knowledge + Practice

CompTIA AI+ AI0-001 (AI0-001) — Questions 376–450

500 questions total · 7pages · All types, answers revealed

Take a mock exam Exam hub

Page 6 of 7

376

Multi-Selecteasy

Which TWO of the following are common techniques to reduce overfitting in a neural network?

Select 2 answers

A.Increasing the number of hidden layers

B.Using a larger learning rate

C.L2 regularization

D.Training for more epochs

E.Dropout

AnswersC, E

Correct; L2 regularization adds a penalty on squared weights.

Why this answer

L2 regularization (option C) reduces overfitting by adding a penalty term proportional to the squared magnitude of the weights to the loss function. This forces the network to keep weights small, preventing it from fitting noise in the training data and improving generalization.

Exam trap

CompTIA often tests the misconception that adding more layers or training longer always improves accuracy, when in fact these actions typically increase overfitting without proper regularization or validation monitoring.

Full explanation →

377

MCQeasy

An e-commerce company deploys a model to recommend products to users. The recommendation system uses collaborative filtering based on user-item interaction history. After deployment, the model shows decreasing click-through rates (CTR) over time. The data engineer notices that the model was trained on data from the past six months and is retrained daily. However, the trend suggests that user preferences are shifting more rapidly than expected. The engineer suspects that the model is suffering from distribution drift. Which approach should the engineer implement to adapt the model more quickly to changing user behavior?

A.Increase the retraining period to once per week to reduce computational cost

B.Switch to an online learning algorithm that updates the model after each user click

C.Increase the model complexity by adding more features and layers

D.Use only the last week of data for training to focus on recent trends

AnswerB

Online learning continuously adapts to new data, capturing shifts in user preferences promptly.

Why this answer

Option A is correct. Online learning allows the model to update incrementally with each new interaction, adapting quickly to changes. Option B is wrong because batch retraining weekly is slower than daily.

Option C is wrong because using only last week's data may not provide enough data and could be noisy. Option D is wrong because increasing model complexity may cause overfitting and is not a direct solution to drift.

Full explanation →

378

Multi-Selectmedium

Which TWO techniques are commonly used for feature selection in machine learning? (Choose 2)

Select 2 answers

A.Principal Component Analysis (PCA)

B.SMOTE

C.L1 regularization (Lasso)

D.Dropout

E.Recursive Feature Elimination (RFE)

AnswersC, E

Lasso can zero out coefficients.

Why this answer

L1 regularization (Lasso) is correct because it adds a penalty equal to the absolute value of the magnitude of coefficients, which can shrink some coefficients exactly to zero, effectively performing feature selection by removing irrelevant features from the model. This makes it a built-in feature selection technique within the training process.

Exam trap

CompTIA often tests the distinction between dimensionality reduction (PCA) and feature selection, where candidates mistakenly think PCA selects original features rather than creating new ones.

Full explanation →

379

MCQhard

An organization uses an AI-based hiring tool. To prevent bias, they want to ensure the model's decisions are explainable. Which approach is most suitable?

A.Use reinforcement learning with fairness constraints

B.Use a simpler interpretable model like logistic regression

C.Use a black-box deep learning model with SHAP explanations

D.Use ensemble methods with feature importance

AnswerB

Simple models are inherently explainable.

Why this answer

Option D (Use a simpler interpretable model like logistic regression) is correct because simple models are inherently interpretable. Option A (Use a black-box deep learning model with SHAP) still lacks full interpretability. Option B (Use ensemble methods with feature importance) is complex.

Option C (Use reinforcement learning with fairness constraints) is hard to explain.

Full explanation →

380

MCQhard

A deep learning model for image classification achieves 99% training accuracy but only 85% validation accuracy. The model has millions of parameters. Which technique is most likely to reduce overfitting while maintaining high accuracy?

A.Reduce batch size from 32 to 8

B.Decrease the learning rate by a factor of 10

C.Add dropout layers with a rate of 0.5 after each convolutional block

D.Increase the number of training epochs to 500

AnswerC

Dropout is a standard regularization technique for deep networks.

Why this answer

Option A is correct because dropout randomly deactivates neurons during training, acting as regularization and reducing overfitting. Option B is wrong because increasing epochs further will likely worsen overfitting. Option C is wrong because reducing batch size can increase training noise but is not a primary anti-overfitting technique.

Option D is wrong because reducing learning rate may help convergence but does not directly combat overfitting caused by model capacity.

Full explanation →

381

MCQhard

A data scientist is using a gradient boosting model (XGBoost) for a regression task and observes that the model's performance on the training set is much better than on the test set. Which hyperparameter tuning strategy would most effectively reduce overfitting?

A.Increase the number of boosting rounds

B.Increase the learning rate

C.Reduce the maximum depth of trees

D.Subsample less than 1.0

AnswerC

Shallow trees are less complex and generalize better, reducing overfitting.

Why this answer

Option A (Increase learning rate) makes each tree more influential, increasing overfitting. Option C (Increase boosting rounds) increases model complexity. Option D (Subsample less than 1.0) introduces randomness but is less direct than tree depth.

Option B (Reduce max depth) limits tree complexity, reducing overfitting.

Full explanation →

382

Multi-Selecthard

Which THREE of the following are key considerations when deploying an AI model in a production environment?

Select 3 answers

A.Basing acceptance solely on training accuracy

B.Maximizing model complexity to achieve the best accuracy

C.Monitoring model performance for data drift

D.Ensuring inference latency meets service-level agreements

E.Providing explainability for model decisions

AnswersC, D, E

Correct; models degrade over time if data changes.

Why this answer

Option C is correct because data drift refers to the change in the statistical properties of the input data over time, which can degrade model accuracy. Continuous monitoring for data drift is essential in production to detect when the model's assumptions about the data distribution are no longer valid, triggering retraining or alerts.

Exam trap

CompTIA often tests the misconception that high training accuracy is the primary goal for production deployment, when in reality operational concerns like latency, explainability, and drift monitoring are prioritized over raw accuracy.

Full explanation →

383

MCQmedium

A hospital uses an AI system to predict patient deterioration from vital signs. The system currently uses a logistic regression model trained on data from the past year. Recently, the hospital adopted a new patient monitoring device that provides more accurate readings. The model's performance has dropped significantly. The data science team has access to the new device's data for the past month and wants to improve the model with minimal disruption. The team also wants to ensure the model remains interpretable for regulatory compliance. Which approach should they take?

A.Retrain the logistic regression model on a combined dataset of old and new device data

B.Continue using the current model and manually adjust predictions based on device differences

C.Build an ensemble of logistic regression and a neural network using new data only

D.Replace the logistic regression model with a gradient boosting model using only new device data

AnswerA

This incorporates the new device's accuracy while maintaining interpretability and using all available data.

Why this answer

Retraining the logistic regression model on a combined dataset of old and new device data is the best approach because it leverages all available data to adapt the model to the new device's measurement distribution while preserving the model's inherent interpretability. Logistic regression is a linear model that remains fully transparent for regulatory compliance, and combining both datasets helps the model learn the systematic shift in vital sign readings without discarding valuable historical patterns. This minimizes disruption by avoiding a complete overhaul and directly addresses the performance drop caused by the change in input data distribution.

Exam trap

CompTIA often tests the trade-off between model performance and interpretability, and the trap here is that candidates may prioritize performance gains from complex models (like gradient boosting or neural networks) without recognizing that regulatory compliance mandates interpretability, making logistic regression the only viable choice despite its simplicity.

How to eliminate wrong answers

Option B is wrong because manually adjusting predictions based on device differences is ad-hoc, non-scalable, and introduces subjective bias, which undermines both reliability and regulatory compliance. Option C is wrong because building an ensemble with a neural network reduces interpretability, violating the regulatory requirement, and using only new data ignores valuable historical patterns, leading to overfitting and poor generalization. Option D is wrong because replacing logistic regression with a gradient boosting model sacrifices interpretability, as gradient boosting is a black-box model, and training only on one month of new data risks overfitting and fails to capture long-term trends.

Full explanation →

384

Multi-Selectmedium

Which THREE of the following are key principles of AI ethics as defined by major frameworks?

Select 3 answers

A.Transparency

B.Scalability

C.Accountability

D.Latency

E.Fairness

AnswersA, C, E

Transparency is about openness in AI systems' workings and decisions.

Why this answer

Transparency, fairness, and accountability are core principles across many AI ethics frameworks. Scalability and latency are technical performance metrics, not ethics.

Full explanation →

385

MCQeasy

Which practice best ensures AI systems comply with regulations like GDPR?

A.Using open-source models only

B.Regular vulnerability scans

C.Data minimization and anonymization

D.Hiring more data scientists

AnswerC

Directly supports GDPR requirements.

Why this answer

Option C (Data minimization and anonymization) is correct because GDPR requires collecting only necessary data and protecting privacy. Option A (Regular vulnerability scans) is for security. Option B (Using open-source models only) does not guarantee compliance.

Option D (Hiring more data scientists) does not directly address compliance.

Full explanation →

386

MCQmedium

A real-time recommendation system requires low latency. Which data storage strategy is best for serving user profiles and item embeddings?

A.Time-series database (e.g., InfluxDB)

B.In-memory key-value store (e.g., Redis)

C.Relational database with joins

D.Data lake in object storage (e.g., Amazon S3)

AnswerB

In-memory key-value stores offer sub-millisecond reads, suitable for real-time serving.

Why this answer

An in-memory key-value store like Redis is ideal for serving user profiles and item embeddings in a real-time recommendation system because it provides sub-millisecond read/write latency by keeping data in RAM. This directly meets the low-latency requirement for fetching embeddings and profiles on each request, without the overhead of disk I/O or complex query processing.

Exam trap

CompTIA often tests the misconception that any database with 'fast' in its name (like InfluxDB) is suitable for real-time serving, but the trap is that time-series databases prioritize write throughput and range queries over low-latency point lookups, which is the actual requirement here.

How to eliminate wrong answers

Option A is wrong because time-series databases like InfluxDB are optimized for timestamped metrics (e.g., CPU usage, sensor data) and not for low-latency key-based lookups of user profiles or embeddings; they introduce unnecessary indexing and query overhead. Option C is wrong because relational databases with joins require disk-based storage and SQL parsing, which adds significant latency (often tens to hundreds of milliseconds) compared to in-memory key-value access, making them unsuitable for real-time serving. Option D is wrong because a data lake in object storage like Amazon S3 has high latency (typically 100-200 ms per request) due to HTTP-based API calls and is designed for batch analytics, not for serving individual records in real-time.

Full explanation →

387

MCQeasy

Which ethical concern is most directly associated with AI systems that fully automate decision-making without human oversight?

A.Loss of human autonomy

B.Data storage costs

C.Job displacement

D.Environmental impact

AnswerA

Automated decisions remove human judgment, reducing autonomy and accountability.

Why this answer

Lack of human oversight can lead to ethical issues like biased decisions or unfair outcomes without recourse.

Full explanation →

388

Multi-Selecteasy

Which THREE components are essential in an MLOps pipeline?

Select 3 answers

A.Data versioning

B.Manual code review

C.Deployment automation

D.Hardware procurement

E.Automated model testing

AnswersA, C, E

Versioning data is crucial for reproducibility and tracking.

Why this answer

Data versioning (A) is essential in an MLOps pipeline because it ensures reproducibility and traceability of datasets used for training, validation, and testing. Without versioning, changes to data cannot be tracked, leading to inconsistent model behavior and difficulty in debugging. Tools like DVC or Git LFS enable precise snapshotting of data, which is critical for auditing and rollback in production AI systems.

Exam trap

CompTIA often tests the distinction between operational pipeline components (automation, testing, versioning) and peripheral activities (procurement, manual reviews) to see if candidates understand that MLOps is about automating the ML lifecycle, not general IT operations.

Full explanation →

389

MCQmedium

A team deploying an AI model for real-time fraud detection notices that inference latency is too high. The model is a deep neural network with 50 layers, deployed on a cloud GPU. Which of the following is the BEST approach to reduce latency while maintaining acceptable accuracy?

A.Deploy the model on a more powerful GPU.

B.Reduce the batch size for inference.

C.Replace the DNN with a logistic regression model.

D.Apply knowledge distillation to create a smaller model.

AnswerD

Correct; distillation compresses the model while preserving performance.

Why this answer

Knowledge distillation trains a smaller 'student' model to mimic the behavior of a larger 'teacher' model, significantly reducing the number of parameters and layers while preserving most of the original accuracy. This directly addresses the high inference latency caused by the 50-layer DNN by producing a compact model that runs faster on the same GPU hardware.

Exam trap

CompTIA often tests the misconception that simply upgrading hardware or reducing batch size is the best latency fix, when in fact architectural compression techniques like knowledge distillation are the most effective for deep models with strict latency budgets.

How to eliminate wrong answers

Option A is wrong because upgrading to a more powerful GPU only provides a linear speedup and does not address the fundamental architectural overhead of a 50-layer network; it also increases cost without guaranteeing latency targets. Option B is wrong because reducing batch size actually increases the number of inference passes per transaction, which can increase per-request latency due to underutilized GPU parallelism and higher overhead from frequent kernel launches. Option C is wrong because replacing the DNN with logistic regression would cause a catastrophic drop in accuracy for complex fraud patterns, as logistic regression cannot model non-linear interactions and high-dimensional feature spaces that the DNN captures.

Full explanation →

390

MCQhard

A social media company uses an AI content moderation system to filter hate speech. The system uses a natural language processing model trained on user reports. Recently, the model's false positive rate has increased, blocking legitimate posts. An internal audit reveals that a coordinated group of users has been falsely reporting harmless posts, causing the model to learn incorrect patterns. The company needs to address the attack and restore accuracy. The engineering team can modify the training pipeline. What is the most effective first step?

A.Redesign the training pipeline to incorporate a reputation system for reporting users

B.Increase the weight of non-reported posts to counteract the reported posts' influence

C.Apply adversarial training to make the model robust to crafted inputs

D.Retrain the model on a dataset that excludes all user-reported posts

AnswerA

A reputation system filters out malicious reports, preventing poisoning at the source.

Why this answer

Option D is correct because the attack exploits the reporting system; redesigning it to detect coordinated behavior reduces poisoning. Option A is wrong because retraining on non-reported data may not capture all necessary patterns. Option B is wrong because ignoring reported data removes a valuable signal.

Option C is wrong because adversarial training is not designed for this type of poisoning.

Full explanation →

391

MCQeasy

An engineer is building a regression model to predict housing prices. The dataset includes features such as square footage, number of bedrooms, and year built. The engineer notices that the square footage values range from 500 to 10,000, while the number of bedrooms ranges from 1 to 5. Which preprocessing step is most critical before training a gradient descent-based model?

A.Use k-fold cross-validation

B.Apply log transformation to all features

C.Normalize or standardize the features

D.One-hot encode the features

AnswerC

Scaling improves convergence of gradient descent.

Why this answer

Gradient descent-based models are sensitive to the scale of input features because they update weights proportionally to the gradient, which is influenced by feature magnitudes. With square footage ranging 500–10,000 and bedrooms 1–5, the larger feature will dominate the gradient, causing slow or unstable convergence. Normalizing or standardizing (e.g., Z-score or min-max scaling) ensures all features contribute equally, leading to faster and more reliable training.

Exam trap

CompTIA often tests the misconception that any data transformation (like log or one-hot encoding) is universally beneficial, but the key is matching the preprocessing step to the model's mathematical requirements—here, gradient descent's sensitivity to scale makes normalization/standardization the critical step.

How to eliminate wrong answers

Option A is wrong because k-fold cross-validation is a model evaluation technique to assess generalization, not a preprocessing step to address feature scale issues. Option B is wrong because log transformation is used to handle skewed distributions or multiplicative relationships, not to rescale features with different ranges; applying it to all features (including integer counts like bedrooms) can distort their meaning and is unnecessary for gradient descent scaling. Option D is wrong because one-hot encoding is used for categorical features to convert them into binary vectors, but the features listed (square footage, bedrooms, year built) are all numerical and do not require encoding.

Full explanation →

392

Multi-Selecteasy

Which TWO of the following are common threats to AI model security?

Select 2 answers

A.SQL injection

B.Data poisoning

C.Adversarial examples

D.Distributed denial-of-service (DDoS)

E.Phishing attacks

AnswersB, C

Malicious data inserted during training to corrupt model.

Why this answer

Options C and D are correct because adversarial examples and data poisoning are classic AI attacks. Option A is wrong because SQL injection targets databases, not models. Option B is wrong because phishing targets humans.

Option E is wrong because DDoS targets network availability.

Full explanation →

393

MCQeasy

A cybersecurity analyst monitors an AI chatbot that frequently produces offensive responses when given specific prompts. The development team suspects an adversarial attack. Which mitigation strategy is most effective against such prompt injection attacks?

A.Retrain the model on a larger, curated dataset

B.Encrypt all communication between users and the chatbot

C.Reduce the model's number of parameters

D.Implement input validation and sanitization

AnswerD

Validating inputs can neutralize injection attempts.

Why this answer

Option C is correct because input validation and filtering can detect and block malicious prompts. Option A is wrong because retraining on clean data does not prevent injection attacks. Option B is wrong because reducing model size is unrelated.

Option D is wrong because encryption protects data in transit, not input processing.

Full explanation →

394

MCQhard

An AI team notices that a model's F1 score on the validation set is 0.95, but on the test set it drops to 0.72. Which course of action is most appropriate?

A.Reduce the training dataset size.

B.Adjust the train/test split to be more representative.

C.Increase model complexity.

D.Apply regularization.

AnswerD

Regularization penalizes large weights or complex structures, reducing overfitting and improving generalization.

Why this answer

The F1 score dropping from 0.95 on the validation set to 0.72 on the test set is a classic sign of overfitting, where the model has memorized the training/validation data but fails to generalize to unseen test data. Applying regularization (e.g., L1/L2 weight decay, dropout) is the most appropriate course of action because it penalizes overly complex models, reduces variance, and improves generalization without requiring more data or changing the model architecture.

Exam trap

CompTIA often tests the distinction between overfitting (high variance) and underfitting (high bias), and the trap here is that candidates may incorrectly choose to increase model complexity (Option C) because they focus on the high validation score rather than recognizing the performance drop as a variance problem.

How to eliminate wrong answers

Option A is wrong because reducing the training dataset size would likely worsen overfitting by providing even less data for the model to learn generalizable patterns, increasing variance. Option B is wrong because adjusting the train/test split to be more representative does not address the underlying overfitting issue; the model's poor test performance is due to high variance, not a biased or unrepresentative split. Option C is wrong because increasing model complexity (e.g., adding more layers or parameters) would exacerbate overfitting, further increasing the gap between validation and test performance.

Full explanation →

395

Multi-Selecthard

A data engineer is designing a pipeline for a streaming data application that uses a machine learning model to detect anomalies in real time. Which TWO practices should the engineer implement to ensure data quality and model reliability?

Select 2 answers

A.Use batch processing to transform data in fixed intervals

B.Store all raw data indefinitely for future analysis

C.Use a sliding window for feature computation

D.Implement data validation checks at the ingestion point

E.Retrain the model on a fixed schedule every 24 hours

AnswersC, D

Sliding windows allow the model to use the most recent data for accurate anomaly detection.

Why this answer

Option C is correct because streaming anomaly detection requires real-time feature computation over recent data, and a sliding window ensures that only the most relevant data points are used for model inference, maintaining low latency and adapting to concept drift. This approach avoids the staleness of batch processing and aligns with the continuous nature of streaming pipelines.

Exam trap

CompTIA often tests the misconception that batch processing or fixed retraining schedules are sufficient for real-time streaming applications, when in fact sliding windows and continuous validation are required to maintain low latency and model accuracy.

Full explanation →

396

MCQhard

A deep learning model for image classification is overfitting due to a small dataset. The team decides to apply data augmentation. Which augmentation technique is least likely to preserve the label?

A.Adding random noise

B.Random rotation

C.Random cropping and rescaling

D.Horizontal flip

AnswerC

Cropping might remove the object of interest, rendering the label invalid.

Why this answer

Random cropping and rescaling is least likely to preserve the label because it can cut out the primary object or distort its proportions, potentially removing the discriminative features needed for correct classification. For example, cropping a dog image to show only the background or a leg could change the semantic meaning, making the label 'dog' incorrect. In contrast, other techniques like noise, rotation, or flipping typically retain the core subject and its label.

Exam trap

CompTIA often tests the misconception that all augmentations are equally label-preserving, but the trap here is that random cropping can alter the semantic content by removing the object, while other transformations like rotation or flipping maintain the object's presence and identity.

How to eliminate wrong answers

Option A is wrong because adding random noise preserves the label; it introduces pixel-level variations that help generalize without altering the object's identity or spatial layout. Option B is wrong because random rotation preserves the label; rotating an image by small or moderate angles does not change the object class, as CNNs are designed to be rotation-invariant to some degree. Option D is wrong because horizontal flip preserves the label; mirroring an image does not change the object's category (e.g., a cat remains a cat), and it is a standard augmentation for symmetric or non-directional objects.

Full explanation →

397

MCQeasy

A team of data scientists and engineers is working on multiple AI projects. They often struggle to reproduce experiments and manage model versions. Which tool or practice should they adopt?

A.Document experiments in a shared Word document.

B.Share code via email attachments.

C.Keep all models in a shared network drive.

D.Use an MLOps platform that provides version control, tracking, and reproducibility.

AnswerD

MLOps platforms are designed to manage the ML lifecycle effectively.

Why this answer

Option D is correct because an MLOps platform (e.g., MLflow, Kubeflow, or Vertex AI) provides integrated version control for code, data, and models, along with experiment tracking and reproducibility. This directly addresses the team's struggle to reproduce experiments and manage model versions by automating lineage capture and enabling consistent environment recreation.

Exam trap

CompTIA often tests the misconception that simple file-sharing or document-based approaches are sufficient for reproducibility, when in fact they lack the automated lineage and environment locking that MLOps platforms provide.

How to eliminate wrong answers

Option A is wrong because a shared Word document lacks automated versioning, dependency tracking, and execution capture, making it impossible to reliably reproduce experiments from static text. Option B is wrong because sharing code via email attachments introduces version confusion, lacks any form of change tracking or environment locking, and violates basic software engineering practices for collaboration. Option C is wrong because keeping models on a shared network drive provides no version history, no lineage to training code or data, and no mechanism to roll back or compare model iterations, leading to overwrites and irreproducible results.

Full explanation →

398

MCQhard

A data pipeline ingests streaming data from IoT sensors. The current batch processing pipeline causes stale predictions. Which architecture change is most appropriate?

A.Use a larger batch interval

B.Revert to micro-batch processing with Apache Spark

C.Store raw data in Hadoop HDFS

D.Implement Apache Kafka and stream processing

AnswerD

Kafka combined with stream processing (e.g., Kafka Streams, Flink) enables real-time ingestion and prediction.

Why this answer

Option B is correct because stream processing (e.g., with Kafka and Spark Streaming) processes data in real-time, reducing latency. Option A (micro-batch) is still batch with small intervals, but option B is more explicit. Option C worsens staleness.

Option D stores raw data but does not process faster.

Full explanation →

399

Multi-Selectmedium

A data engineer is designing a feature store for machine learning. Which THREE components are essential for a feature store? (Choose THREE.)

Select 3 answers

A.Data ingestion pipeline

B.Online serving layer

C.Feature repository

D.Experiment tracking

E.Model registry

AnswersA, B, C

The ingestion pipeline brings new data sources into the feature store to keep features current.

Why this answer

A feature store requires a feature repository to store feature definitions and values, an online serving layer to serve features at low latency for inference, and a data ingestion pipeline to continuously update features. Model registry and experiment tracking are important for MLOps but are not core components of a feature store.

Full explanation →

400

MCQhard

A self-driving car company is developing an object detection system using a convolutional neural network (CNN). The system needs to detect pedestrians and vehicles in real-time with high accuracy. Which technique can reduce inference time while maintaining accuracy?

A.Apply model pruning and quantization

B.Use a pre-trained model and fine-tune it

C.Add more convolutional layers

D.Increase number of filters in each layer

AnswerA

Pruning removes unimportant weights, and quantization reduces precision of weights, both speeding up inference while preserving accuracy.

Why this answer

Model pruning removes redundant or less important weights from the CNN, reducing computational load, while quantization converts floating-point weights to lower-precision integers (e.g., INT8). Together, they shrink model size and speed up inference without significantly degrading accuracy, making them ideal for real-time object detection in resource-constrained environments like autonomous vehicles.

Exam trap

CompTIA often tests the misconception that adding more layers or filters always improves performance, when in fact it increases latency and resource usage, while pruning and quantization are the standard techniques for reducing inference time without sacrificing accuracy.

How to eliminate wrong answers

Option B is wrong because fine-tuning a pre-trained model improves accuracy for a specific task but does not inherently reduce inference time; it may even increase it if the model remains large. Option C is wrong because adding more convolutional layers increases the network depth and computational cost, which slows inference and can cause overfitting without careful regularization. Option D is wrong because increasing the number of filters in each layer expands the feature map channels, raising the number of parameters and FLOPs, which directly increases inference time.

Full explanation →

401

MCQmedium

Refer to the exhibit. A security analyst reviews the monitoring log for an AI fraud detection model. Which of the following is the most likely cause of the multiple alerts?

A.Data poisoning of the training dataset

B.A network hardware failure

C.An adversarial attack attempt

D.A scheduled model retraining process

AnswerC

Multiple concurrent alerts indicate active probing or evasion.

Why this answer

Option B is correct because the combination of drift, high query rate, and degraded performance suggests an ongoing adversarial attack aimed at probing the model. Option A is wrong because data poisoning would have been detected earlier, not cause real-time drift. Option C is wrong because model retraining could cause temporary instability but not the specific query pattern.

Option D is wrong because network issues would cause latency but not feature drift or query rate anomalies.

Full explanation →

402

MCQmedium

During model monitoring, a loan approval model shows disparate impact against a protected group. The model's overall accuracy is high, but the false positive rate for the protected group is 0.12 compared to 0.02 for other groups. Which action should the operations team take first?

A.Document the disparity and proceed with deployment because accuracy is high

B.Replace the model with a simpler model that is less discriminatory

C.Retrain the model with reweighted training data to minimize disparity

D.Adjust the decision threshold for the protected group to equalize false positive rates

AnswerC

Retraining with fairness constraints directly mitigates bias in the model.

Why this answer

Option C is correct because retraining the model with reweighted training data directly addresses the root cause of disparate impact—biased historical data—by assigning higher weights to underrepresented groups during training. This technique, often implemented via cost-sensitive learning or sample reweighting, adjusts the model's internal decision boundaries to reduce false positive rate disparities without sacrificing overall accuracy. The operations team should first attempt to mitigate bias at the data level before considering threshold adjustments or model replacement, as reweighting preserves the model's learned patterns while promoting fairness.

Exam trap

CompTIA often tests the misconception that adjusting the decision threshold for a specific group is a quick fix for disparate impact, but the trap is that this violates the principle of equal treatment and can introduce legal liability, whereas retraining with reweighted data addresses bias at the algorithmic level without changing the decision rule per group.

How to eliminate wrong answers

Option A is wrong because documenting the disparity and proceeding with deployment ignores the ethical and regulatory requirement to address disparate impact, even if overall accuracy is high; high accuracy can mask significant bias against protected groups. Option B is wrong because replacing the model with a simpler model does not guarantee less discrimination—simplicity does not correlate with fairness, and a simpler model may still exhibit bias or have lower predictive performance. Option D is wrong because adjusting the decision threshold for the protected group alone treats the symptom (unequal false positive rates) rather than the cause, and can lead to calibration drift, reduced model interpretability, and potential legal issues under the Equal Credit Opportunity Act (ECOA) by applying different standards to different groups.

Full explanation →

403

MCQhard

A financial institution is deploying a reinforcement learning agent to optimize stock trading decisions. The agent is trained in a simulated environment that mimics historical market data. After deployment, the agent performs well initially but then suffers large losses during a period of high volatility that was underrepresented in the training data. The team wants to make the agent more robust to such market conditions without retraining from scratch. They have a budget for additional simulation compute and access to a broader historical dataset including past crises. The agent uses a deep Q-network (DQN) architecture. Which strategy should they adopt?

A.Increase the replay buffer size and continue training on the original dataset

B.Keep the DQN but perform extensive hyperparameter tuning on the original data

C.Modify the DQN to use a recurrent neural network (e.g., DRQN) and train on the expanded dataset

D.Switch to a policy gradient method with a random exploration strategy

AnswerC

Recurrent networks capture temporal dynamics better, and training on a more diverse dataset improves robustness.

Why this answer

Option C is correct because a Deep Recurrent Q-Network (DRQN) can capture temporal dependencies in market data, which is crucial for handling volatile periods that were underrepresented in training. By training on the expanded dataset that includes past crises, the agent can learn from sequential patterns of volatility, making it more robust without requiring a complete retraining from scratch. This approach leverages the existing DQN architecture while adding recurrent layers to better model the dynamic market conditions.

Exam trap

CompTIA often tests the misconception that simply tuning hyperparameters or expanding the replay buffer can fix a model's inability to generalize to unseen distributions, when the real solution requires a change in architecture to handle temporal dependencies.

How to eliminate wrong answers

Option A is wrong because simply increasing the replay buffer size and continuing training on the original dataset does not address the core issue of underrepresented high-volatility data; the agent would still lack exposure to those critical patterns. Option B is wrong because hyperparameter tuning on the original data cannot compensate for missing training examples of volatile market conditions; it only optimizes performance on the existing distribution. Option D is wrong because switching to a policy gradient method with random exploration does not inherently improve robustness to rare events; it may even increase variance and instability without addressing the data deficiency.

Full explanation →

404

MCQhard

A data scientist splits a dataset into training (80%) and test (20%). After training, the model achieves 95% accuracy on training and 60% on test. Which step should the data scientist take first?

A.Collect more data

B.Use cross-validation

C.Apply regularization

D.Increase model complexity

AnswerC

Regularization penalizes large weights, reducing overfitting.

Why this answer

The model shows high training accuracy (95%) but significantly lower test accuracy (60%), which is a classic sign of overfitting. Regularization (Option C) directly addresses overfitting by adding a penalty term to the loss function (e.g., L1 or L2 regularization), discouraging the model from learning overly complex patterns that do not generalize. This is the first step because it targets the core issue without requiring additional data or increasing complexity.

Exam trap

CompTIA often tests the misconception that overfitting is always solved by more data or cross-validation, but the immediate corrective action is to apply regularization to penalize model complexity.

How to eliminate wrong answers

Option A is wrong because collecting more data can help reduce overfitting, but it is not the first step; regularization is a simpler, more immediate fix that does not depend on data availability. Option B is wrong because cross-validation is a technique for model evaluation and hyperparameter tuning, not a direct remedy for overfitting; it would help assess the severity but does not solve the underlying problem. Option D is wrong because increasing model complexity would worsen overfitting, as it allows the model to fit noise even more closely, further reducing test accuracy.

Full explanation →

405

MCQhard

A team is training a deep learning model for natural language processing using a large corpus. They notice the model has a very high number of parameters and training is slow. Which technique can reduce the number of parameters without significant performance loss?

A.Apply embedding compression

B.Add more dropout layers

C.Use a larger batch size

D.Increase learning rate

AnswerA

Embedding compression reduces the dimensionality of embedding layers, directly reducing parameters with minimal impact on performance.

Why this answer

Embedding compression reduces the dimensionality of the embedding layer, which often contains the majority of the model's parameters in NLP tasks. By using techniques like low-rank factorization or pruning, the model retains most of its representational power while significantly decreasing the parameter count and training time.

Exam trap

The trap here is that candidates confuse regularization techniques (like dropout) or training speed optimizations (batch size, learning rate) with actual parameter reduction, which only embedding compression directly achieves.

How to eliminate wrong answers

Option B is wrong because adding more dropout layers does not reduce the number of parameters; it only randomly drops neurons during training to prevent overfitting, leaving the parameter count unchanged. Option C is wrong because using a larger batch size improves training speed through better hardware utilization but does not reduce the number of parameters. Option D is wrong because increasing the learning rate can speed up convergence but does not affect the parameter count and may cause training instability or divergence.

Full explanation →

406

MCQmedium

A machine learning model for credit card fraud detection is deployed. The model's precision is 0.95 and recall is 0.60. The business cost of missing a fraud is very high. Which of the following should the team prioritize to reduce the number of false negatives?

A.Use a different model algorithm.

B.Add more features.

C.Increase the classification threshold.

D.Decrease the classification threshold.

AnswerD

Lower threshold classifies more cases as positive, thus catching more actual frauds (reducing false negatives).

Why this answer

Decreasing the classification threshold makes the model more sensitive, classifying more transactions as fraudulent. This increases recall (reducing false negatives) at the cost of precision. Given the high cost of missing fraud, lowering the threshold is the direct way to capture more true positives, even if it increases false positives.

Exam trap

CompTIA often tests the misconception that improving model accuracy or changing algorithms is the primary fix, when in fact adjusting the decision threshold is the simplest and most effective way to address precision-recall trade-offs for high-cost false negatives.

How to eliminate wrong answers

Option A is wrong because simply switching algorithms does not guarantee a reduction in false negatives; the threshold and cost function matter more. Option B is wrong because adding more features may improve overall model performance but does not directly target the trade-off between precision and recall; it could even increase false negatives if the new features are noisy. Option C is wrong because increasing the classification threshold makes the model more conservative, reducing false positives but increasing false negatives, which is the opposite of what is needed.

Full explanation →

407

MCQmedium

An organization needs to classify customer emails into categories. They have labeled data for some categories but not all. Which approach should they use?

A.Unsupervised clustering then labeling

B.Supervised learning for all categories

C.Reinforcement learning

D.Semi-supervised learning

AnswerD

Correct: Semi-supervised learning leverages both labeled and unlabeled data.

Why this answer

Option C is correct because semi-supervised learning uses a small amount of labeled data along with a large amount of unlabeled data. Options A, B, and D are incorrect: supervised learning requires labels for all categories, unsupervised clustering would group without category labels, and reinforcement learning is for sequential decision making.

Full explanation →

408

MCQhard

A data scientist is working with a dataset that has 10,000 features but only 500 samples. The goal is to train a model for binary classification. Which feature selection technique is MOST appropriate to reduce overfitting?

A.Univariate selection using chi-squared test.

B.Recursive Feature Elimination (RFE) with cross-validation.

C.Using all features with L2 regularization.

D.Principal Component Analysis (PCA) without feature selection.

AnswerB

RFE with CV selects subset of features most relevant, reducing overfitting in small-sample settings.

Why this answer

Option B is correct because Recursive Feature Elimination with cross-validation iteratively removes features and evaluates performance, which is robust for high-dimensional small-sample data. Option A (PCA) creates components that may still overfit. Option C (univariate selection) may select irrelevant features.

Option D (L2 regularization) does not reduce features.

Full explanation →

409

MCQhard

A financial services firm deploys an AI system to screen loan applications. The model was trained on historical data that reflected biased lending practices. After deployment, a regulatory body investigates and finds that the model denies loans at a disproportionately higher rate to a protected demographic group. The firm must address this issue while maintaining compliance with fair lending laws. The Chief AI Officer proposes four possible actions. Which action is the most appropriate first step?

A.Retrain the model using a debiased dataset and implement fairness-aware algorithms, then validate with fairness metrics

B.Document the model's predictions and submit a report to the regulator explaining the historical dataset bias

C.Immediately deploy a rule-based system to manually review all denial decisions from the AI system

D.Disclose the bias findings to all rejected applicants and offer them priority reconsideration

AnswerA

This directly addresses the biased model and demonstrates a commitment to fairness.

Why this answer

Option C is the correct first step because it directly addresses the root cause (biased training data) and aligns with regulatory expectations. Option A is insufficient as it only documents without fixing. Option B may help but is premature before understanding the bias source.

Option D could expose the firm to further liability if the bias is systematic.

Full explanation →

410

MCQmedium

An AI system used for resume screening is found to consistently rank male candidates higher than female candidates with similar qualifications. The HR director wants to remediate this bias without significantly reducing model accuracy. Which technique should be applied?

A.Apply adversarial debiasing to the model during training.

B.Use a random selection of candidates to avoid bias.

C.Remove the gender feature from the dataset and retrain.

D.Collect more training data from underrepresented groups.

AnswerA

Adversarial debiasing reduces bias by training the model to be unable to predict protected attributes from its predictions.

Why this answer

Adversarial debiasing is the correct technique because it directly addresses bias during training by introducing an adversarial network that attempts to predict the protected attribute (e.g., gender) from the model's predictions. The main model is trained to maximize accuracy while minimizing the adversary's ability to infer the protected attribute, thereby reducing bias without a significant drop in predictive performance. This approach is more effective than simple feature removal or data collection because it actively learns to remove correlations between the protected attribute and the output.

Exam trap

CompTIA often tests the misconception that simply removing the protected attribute (e.g., gender) from the dataset is sufficient to eliminate bias, but candidates must understand that bias can persist through correlated features (proxy discrimination).

How to eliminate wrong answers

Option B is wrong because random selection of candidates would destroy the model's predictive accuracy entirely, as it ignores all qualifications and job-relevant features, which is not a valid remediation technique. Option C is wrong because simply removing the gender feature does not eliminate bias; the model can still learn gender proxies from correlated features such as years of experience, education, or job titles, leading to indirect discrimination. Option D is wrong because collecting more data from underrepresented groups may improve representation but does not guarantee removal of existing bias in the model's decision boundary; it can even introduce new imbalances if not handled carefully, and it does not actively debias the training process.

Full explanation →

411

MCQhard

Refer to the exhibit. A data scientist reviews the MLflow run for a Random Forest model on customer churn data. What is the most likely issue with this model?

A.The model is underfitting because training accuracy is too high.

B.The model is overfitting because there is a large gap between train and validation accuracy.

C.The model is performing well because validation accuracy is above 0.8.

D.The model has a data leak because dataset version is v2.

AnswerB

High train accuracy with lower validation accuracy is classic overfitting.

Why this answer

Option B is correct because a large gap between training accuracy (e.g., 0.99) and validation accuracy (e.g., 0.82) indicates that the Random Forest model has memorized the training data but fails to generalize to unseen validation data. This is the classic symptom of overfitting, where the model captures noise rather than the underlying pattern. In MLflow, comparing train and validation metrics directly reveals this discrepancy.

Exam trap

CompTIA often tests the misconception that high validation accuracy alone indicates a good model, ignoring the critical comparison between training and validation metrics to detect overfitting.

How to eliminate wrong answers

Option A is wrong because underfitting is characterized by low training accuracy, not high training accuracy; high training accuracy with poor validation performance indicates overfitting, not underfitting. Option C is wrong because a validation accuracy above 0.8 alone does not guarantee good model performance if there is a significant gap between train and validation accuracy, which signals overfitting. Option D is wrong because dataset version v2 is simply a versioning label and does not inherently cause data leakage; data leakage would involve information from the validation set leaking into training, which is unrelated to the version number.

Full explanation →

412

MCQhard

A CI/CD pipeline for a computer vision model uses canary deployment. After deploying a new version to 5% of traffic, the pipeline automatically rolls back due to a spike in error rate. The new model's inference time is 20% higher than the previous version. The operations team finds that the error is caused by timeout in the inference service. Which action should be taken to prevent future rollbacks?

A.Increase the timeout threshold for inference requests

B.Implement a fallback to the previous model when timeout occurs

C.Optimize the model using TensorRT or ONNX Runtime before deployment

D.Reduce the canary percentage to 1% to minimize impact

AnswerC

Optimizing reduces inference time, addressing the cause of timeouts.

Why this answer

Option C is correct because the root cause of the timeout is the 20% higher inference time of the new model. Optimizing the model using TensorRT or ONNX Runtime reduces inference latency directly, addressing the performance bottleneck that causes timeouts. This prevents the spike in error rate and subsequent rollback without masking the underlying issue.

Exam trap

The trap here is that candidates may confuse symptom management (increasing timeout or fallback) with root-cause resolution (model optimization), which Cisco tests to see if you understand that performance issues must be fixed at the source in AI/ML operations.

How to eliminate wrong answers

Option A is wrong because increasing the timeout threshold only masks the symptom (timeout) without fixing the underlying performance degradation; it may lead to poor user experience and does not prevent future rollbacks if the model remains slow. Option B is wrong because implementing a fallback to the previous model on timeout is a reactive workaround that does not address the root cause; it can cause inconsistent behavior and still result in errors during the fallback transition. Option D is wrong because reducing the canary percentage to 1% only minimizes the blast radius but does not prevent the timeout errors from occurring; the spike in error rate would still trigger a rollback, just with less traffic affected.

Full explanation →

413

MCQmedium

A machine learning team is deploying a model that predicts customer churn. They notice that the model's predictions are highly sensitive to small changes in input features, leading to inconsistent outputs. Which technique should the team apply to improve model stability?

A.Increase learning rate

B.Feature scaling

C.Regularization

D.Cross-validation

AnswerC

Regularization adds a penalty for large weights, reducing overfitting and sensitivity to input variations.

Why this answer

Regularization (Option C) is the correct technique because it adds a penalty term to the loss function (e.g., L1 or L2 regularization), which constrains the model's weights. This reduces variance and prevents overfitting to noise in the training data, directly addressing the high sensitivity to small input changes (brittleness). By shrinking coefficients, regularization forces the model to learn more general patterns, improving stability and consistency in predictions.

Exam trap

CompTIA often tests the misconception that feature scaling alone can fix model instability, but scaling only normalizes inputs and does not penalize large weights, which is the root cause of sensitivity to small input changes.

How to eliminate wrong answers

Option A is wrong because increasing the learning rate makes gradient descent steps larger, which can cause the model to overshoot minima and increase instability, not reduce sensitivity to input changes. Option B is wrong because feature scaling normalizes input ranges (e.g., via standardization or min-max scaling) to help gradient descent converge faster, but it does not address model variance or overfitting that causes prediction instability. Option D is wrong because cross-validation is a technique for evaluating model performance and tuning hyperparameters, not a method to directly improve model stability or reduce sensitivity to input perturbations.

Full explanation →

414

MCQeasy

A company wants to use AI to analyze customer reviews and determine sentiment (positive, negative, neutral). Which AI subfield is most directly applicable?

A.Reinforcement learning

B.Computer vision

C.Natural language processing

D.Robotics

AnswerC

Correct; NLP is used for text analysis and sentiment.

Why this answer

Natural language processing (NLP) is the AI subfield that enables machines to understand, interpret, and generate human language. Analyzing customer reviews for sentiment requires processing text, extracting meaning, and classifying it as positive, negative, or neutral, which is a core NLP task called sentiment analysis.

Exam trap

The trap here is that candidates often confuse natural language processing with computer vision or reinforcement learning because they see 'AI' broadly, but the specific task of analyzing text directly maps to NLP, not the other subfields.

How to eliminate wrong answers

Option A is wrong because reinforcement learning is a training paradigm where an agent learns by interacting with an environment and receiving rewards or penalties; it is not designed for text classification or sentiment analysis. Option B is wrong because computer vision focuses on interpreting visual data such as images and videos, not textual content like customer reviews. Option D is wrong because robotics deals with the design and control of physical machines to perform tasks in the real world, which is unrelated to analyzing text-based sentiment.

Full explanation →

415

MCQeasy

A startup develops an AI recruiting tool that screens resumes. After deployment, they receive a complaint from a candidate who claims the system rejected them due to age discrimination. The startup has no formal AI governance process. They want to quickly assess and remediate the issue. The dataset includes age as a feature. What should they do first?

A.Conduct a bias analysis to measure the model's impact on different age groups

B.Apologize to the candidate and offer a manual review of their resume

C.Immediately remove age from the feature set and retrain the model

D.Ignore the complaint because age is a legitimate business requirement

AnswerA

Analysis identifies whether and where bias exists before taking action.

Why this answer

Option B is correct because the complaint suggests age bias; the first step is to analyze the model's predictions across age groups to verify if there is bias. Option A is wrong because removing age may not eliminate proxies. Option C is wrong because disregarding the complaint is unethical.

Option D is wrong without first understanding the bias source.

Full explanation →

416

MCQeasy

An organization deploys an AI model on edge devices for real-time image classification. Which metric is most important to monitor for ensuring the device's operational health?

A.Model calibration error

B.Inference memory consumption

C.Average prediction confidence

D.Model accuracy on local test data

AnswerB

Memory is a key operational health indicator for edge devices.

Why this answer

For edge devices with limited resources, inference memory consumption is the most critical operational health metric because exceeding available memory can cause the model to crash or the device to become unresponsive. Unlike accuracy or confidence, memory usage directly reflects whether the device can sustain real-time inference without resource exhaustion.

Exam trap

CompTIA often tests the misconception that model accuracy or confidence is the primary concern for operational health, but the trap here is that edge device stability depends on resource constraints like memory, not model performance metrics.

How to eliminate wrong answers

Option A is wrong because model calibration error measures the reliability of predicted probabilities, not the operational health of the device. Option C is wrong because average prediction confidence indicates model certainty, not whether the device has sufficient memory to run inference. Option D is wrong because model accuracy on local test data evaluates model performance, not the device's ability to operate without memory overflow or system failure.

Full explanation →

417

MCQeasy

A data scientist is training a binary classification model to detect fraudulent transactions. The dataset is highly imbalanced with only 1% fraud cases. Which technique is most appropriate to address the class imbalance?

A.Use a linear regression model

B.Oversample the minority class

C.Undersample the majority class

D.Increase the learning rate

AnswerB

Oversampling creates synthetic instances of the minority class, helping the model learn better boundaries.

Why this answer

Oversampling the minority class (e.g., SMOTE) creates synthetic samples to balance the dataset, which is a common and effective approach for imbalanced classification.

Full explanation →

418

MCQmedium

Based on the exhibit, what is the most likely issue with the trained model?

A.Overfitting because training accuracy is much higher than validation accuracy

B.Data leakage artificially inflating training accuracy

C.Vanishing gradients causing no learning

D.Underfitting due to insufficient epochs

AnswerA

Training accuracy (99.32%) is significantly higher than validation accuracy (78.9%), a classic sign of overfitting.

Why this answer

The training accuracy reaches 99.32% while the validation accuracy plateaus around 78.9%, indicating overfitting. The model has memorized the training data but fails to generalize. Underfitting would show poor performance on both.

Vanishing gradients would cause loss not to decrease significantly. Data leakage would cause unusually high performance on both sets, but here validation accuracy is far lower.

Full explanation →

419

MCQmedium

Refer to the exhibit. A developer is using the above configuration for a multi-class classification task. The model performs well on training data but poorly on validation data. Which modification could help?

A.Remove dropout

B.Increase the dropout rate

C.Add L2 regularization to the dense layers

D.Increase the learning rate

AnswerC

L2 regularization adds a penalty on weights, which can reduce overfitting.

Why this answer

Option B is correct because adding L2 regularization to the dense layers penalizes large weights, reducing overfitting. Option A is incorrect because increasing dropout from 0.5 may hurt performance; it's already a reasonable value. Option C is incorrect because increasing learning rate could destabilize training.

Option D is incorrect because removing dropout would likely increase overfitting.

Full explanation →

420

MCQhard

During a penetration test, a security engineer discovers that an AI-powered chatbot can be tricked into revealing sensitive customer data by using specially crafted prompts. What type of attack is this, and what is the best mitigation?

A.Prompt injection attack; implement input validation and context sanitization

B.Model inversion attack; apply differential privacy during training

C.Data poisoning attack; implement strict access controls

D.Membership inference attack; add noise to model outputs

AnswerA

Prompt injection exploits the model via crafted inputs; validation prevents it.

Why this answer

Option D is correct: the attack is a prompt injection, and input validation (sanitization) is a primary defense. Option A is wrong because model inversion is different. Option B is wrong because this is not a membership inference attack.

Option C is wrong because differential privacy is for training, not for prompt-level attacks.

Full explanation →

421

MCQmedium

A company is deploying a machine learning model to predict customer churn. The dataset is highly imbalanced (95% non-churn, 5% churn). The model achieves 96% accuracy, but the F1-score for the churn class is only 0.2. Which metric should the team prioritize to evaluate model performance for this business problem?

A.F1-score

B.Accuracy

C.Log loss

D.AUC-ROC

AnswerA

F1-score balances precision and recall, suitable for imbalanced data.

Why this answer

In a highly imbalanced dataset (95% non-churn, 5% churn), accuracy is misleading because a model can achieve 96% accuracy by simply predicting the majority class for all instances. The F1-score, which is the harmonic mean of precision and recall, specifically measures the model's performance on the minority (churn) class. A low F1-score of 0.2 indicates the model fails to correctly identify churners, which is the critical business outcome, making F1-score the correct metric to prioritize.

Exam trap

CompTIA often tests the misconception that high accuracy is always good, especially in imbalanced datasets, leading candidates to overlook the F1-score as the appropriate metric for minority class performance.

How to eliminate wrong answers

Option B is wrong because accuracy is a poor metric for imbalanced datasets; a model can achieve high accuracy by always predicting the majority class, which does not reflect its ability to detect the minority churn class. Option C is wrong because log loss measures the confidence of probability predictions across all classes, but it does not directly address the imbalance or provide a clear threshold-based evaluation of the minority class performance like F1-score does. Option D is wrong because AUC-ROC evaluates the model's ability to rank positive and negative instances, but it can be overly optimistic in highly imbalanced scenarios and does not directly reflect precision and recall for the minority class, which are critical for churn prediction.

Full explanation →

422

Multi-Selecthard

A data engineer is designing a data pipeline for a real-time recommendation system. The pipeline must handle high velocity streams and ensure data quality. Which three components should be included in the pipeline? (Select THREE).

Select 3 answers

A.A stream processing engine like Apache Kafka Streams

B.A data validation step to check schema compliance

C.A data warehouse for historical analysis

D.A batch processing framework like Apache Spark

E.A message queue for buffering

AnswersA, B, E

Stream processing engines process data in real-time with low latency.

Why this answer

Apache Kafka Streams is a correct choice because it is a stream processing library specifically designed for building real-time applications and microservices that process data in motion. For a high-velocity recommendation pipeline, it provides exactly-once semantics, stateful processing (e.g., windowed joins, aggregations), and seamless integration with Kafka topics, enabling low-latency transformations without requiring an external cluster.

Exam trap

CompTIA often tests the distinction between stream processing and batch processing, and the trap here is that candidates mistakenly select a batch framework like Apache Spark or a data warehouse because they associate 'data pipeline' with traditional ETL, overlooking the strict real-time and low-latency requirements of the scenario.

Full explanation →

423

MCQeasy

A marketing team uses a recommendation system to suggest products to customers. The system currently uses collaborative filtering. Which scenario would most likely cause the cold-start problem?

A.A new product is added to the catalog with no purchase history.

B.The system switches from collaborative filtering to content-based filtering.

C.The website interface is redesigned, affecting user navigation.

D.A seasonal product experiences a sudden spike in sales.

AnswerA

No interaction data exists for the new product, so collaborative filtering fails.

Why this answer

The cold-start problem occurs when a recommendation system lacks sufficient data to make accurate predictions. In collaborative filtering, recommendations rely on historical user-item interactions (e.g., purchase history). A new product with no purchase history has no interaction data, so the system cannot find similar users or items to generate recommendations, directly causing the cold-start problem.

Exam trap

CompTIA often tests the cold-start problem by making candidates confuse it with performance issues or UI changes, but the trap here is that the cold-start problem is specifically about insufficient interaction data for new users or items, not about algorithm switches or interface redesigns.

How to eliminate wrong answers

Option B is wrong because switching from collaborative filtering to content-based filtering does not inherently cause the cold-start problem; content-based filtering uses item features (e.g., product attributes) to make recommendations, which can still work for new items if features are available. Option C is wrong because a website interface redesign affects user navigation but does not impact the underlying recommendation algorithm's data availability or the cold-start problem. Option D is wrong because a sudden spike in sales for a seasonal product provides abundant interaction data, which actually helps collaborative filtering make better recommendations, not cause a cold-start.

Full explanation →

424

MCQhard

A financial institution uses a deep learning model for fraud detection. The model is a feedforward neural network with three hidden layers. It was trained on a balanced dataset of 100,000 transactions. During deployment, the model achieves high accuracy on the test set but the fraud detection rate (true positive rate) is only 40% while the false positive rate is 0.1%. The business requires a true positive rate of at least 80%. Which of the following actions is most likely to achieve the required true positive rate while minimizing the increase in false positives?

A.Increase the number of hidden layers to five to capture more complex patterns

B.Use synthetic minority oversampling (SMOTE) to rebalance the training set

C.Change the threshold for classifying a transaction as fraud from the default 0.5 to a lower value

D.Add L2 regularization to reduce overfitting

AnswerC

Lowering threshold increases TPR; the optimal threshold can be chosen based on the precision-recall curve.

Why this answer

Option A (more hidden layers) may not improve recall and could overfit. Option C (L2 regularization) would increase bias, likely lowering TPR. Option D (SMOTE) rebalances training but the model already trained on balanced data; threshold adjustment is more direct.

Option B (lower decision threshold) directly increases TPR at the cost of FPR; threshold can be tuned to achieve 80% TPR with minimal FPR increase.

Full explanation →

425

MCQmedium

Based on the exhibit, what is the likely problem with the model?

A.Batch size too small

B.Overfitting

C.Learning rate too high

D.Underfitting

AnswerB

Correct: Training loss decreases but validation loss increases, classic overfitting.

Why this answer

Option A is correct because the training loss keeps decreasing while validation loss increases after a point, indicating overfitting. Options B, C, and D are incorrect: underfitting would have high training loss, learning rate too high would cause loss to oscillate, and batch size too small might affect convergence but not this pattern.

Full explanation →

426

MCQmedium

A deep learning model for sentiment analysis has millions of parameters and is trained on a small dataset. Which technique can help prevent overfitting?

A.Learning rate scheduling

B.Batch normalization

C.Dropout

D.Early stopping

AnswerC

Correct: Dropout is specifically designed to reduce overfitting in large neural networks.

Why this answer

Option A is correct because dropout is a regularization technique that randomly drops neurons during training, reducing overfitting. Options B, C, and D are incorrect: batch normalization helps with internal covariate shift, learning rate scheduling helps convergence, and early stopping can prevent overfitting but is not as specific as dropout for parameter-heavy models.

Full explanation →

427

MCQmedium

A self-driving car company uses a reinforcement learning agent to navigate. The agent was trained in a simulated environment and achieved high rewards. When deployed in the real world, the agent fails to avoid obstacles. The team collects real-world driving data and uses it to fine-tune the model. However, fine-tuning leads to catastrophic forgetting of the simulated knowledge. Which technique should the team use to mitigate this? A. Increase the learning rate during fine-tuning. B. Use elastic weight consolidation (EWC) to regularize important weights. C. Train the model from scratch using only real-world data. D. Increase the number of layers in the network.

A.Increase the number of layers in the network.

B.Use elastic weight consolidation (EWC) to regularize important weights.

C.Train the model from scratch using only real-world data.

D.Increase the learning rate during fine-tuning.

AnswerB

EWC selectively slows down learning on important weights for previous tasks, preserving simulated knowledge.

Why this answer

Option B is correct. Elastic weight consolidation (EWC) is a regularization technique that penalizes changes to weights that are important for previous tasks (simulation), thereby preventing catastrophic forgetting. Option A (increasing learning rate) would make forgetting worse.

Option C (training from scratch) discards the valuable simulation knowledge. Option D (adding layers) may increase capacity but does not address forgetting.

Full explanation →

428

MCQhard

Refer to the exhibit. A compliance audit requires that model predictions be explainable for regulatory reasons. Which setting in the deployment configuration supports this requirement?

A.target_latency: 100

B.data_retention: "90 days"

C.drift_detection: true

D.explainability: "required"

AnswerD

This setting directly mandates explainability.

Why this answer

The 'explainability': 'required' under compliance indicates that the model must provide explanations, meeting the audit requirement.

Full explanation →

429

Multi-Selecthard

A data scientist is using an ensemble method to combine multiple models. Which three statements about bagging (Bootstrap Aggregating) are true? (Select THREE.)

Select 3 answers

A.It requires the base models to be of different types

B.It reduces variance without increasing bias

C.It can be used with decision trees to create random forests

D.It reduces the error by combining weak learners

E.It trains models independently on bootstrap samples

AnswersB, C, E

Bagging averages predictions from models trained on bootstrap samples, reducing variance while bias remains similar.

Why this answer

Options A, B, and D are correct. Bagging reduces variance of unstable models (like trees) without increasing bias (A). It trains models independently on bootstrap samples (B).

Random forests use bagging along with random feature selection (D). Option C is false because boosting reduces bias, not bagging. Option E is false because bagging typically uses the same type of base model.

Full explanation →

430

MCQmedium

A data scientist trains a sentiment analysis model on user reviews. To ensure transparency, they want to explain why the model classified a particular review as negative. Which explainability technique should they use?

A.Decision tree surrogate model

B.Principal component analysis

C.SHAP (SHapley Additive exPlanations)

D.t-SNE dimensionality reduction

AnswerC

SHAP computes feature contributions for each prediction.

Why this answer

Option D is correct because SHAP values provide per-feature attribution for individual predictions. Option A is wrong because LIME is also for local explanations, but SHAP is more theoretically grounded and common for feature attribution. Option B is wrong because t-SNE is for visualization of high-dimensional data, not explanation.

Option C is wrong because decision trees are a model type, not an explanation method for any model.

Full explanation →

431

MCQeasy

A machine learning engineer wants to evaluate a binary classifier. Which metric is MOST appropriate when the positive class is rare (e.g., 1% of total data)?

A.True negative rate

B.F1-score

C.Mean squared error

D.Accuracy

AnswerB

Correct; F1 considers both precision and recall.

Why this answer

When the positive class is rare (e.g., 1% of total data), accuracy is misleading because a classifier that always predicts the negative class would achieve 99% accuracy. The F1-score is the harmonic mean of precision and recall, making it robust to class imbalance by focusing on the positive class performance. It is the most appropriate metric for evaluating binary classifiers on imbalanced datasets.

Exam trap

CompTIA often tests the misconception that accuracy is always the best metric, but in imbalanced datasets it is misleading, and candidates must recognize that F1-score (or precision-recall curves) is the correct choice for rare positive classes.

How to eliminate wrong answers

Option A is wrong because the true negative rate (specificity) measures the proportion of actual negatives correctly identified, which is not sensitive to the rare positive class and can be high even if the classifier misses all positives. Option C is wrong because mean squared error (MSE) is a regression metric that measures average squared differences between predicted and actual values, not suitable for binary classification outcomes. Option D is wrong because accuracy ( (TP+TN)/(TP+TN+FP+FN) ) is dominated by the majority class in imbalanced datasets, giving a falsely high score even when the classifier fails to detect the rare positive class.

Full explanation →

432

MCQhard

An engineer is training a neural network and observes the output shown. Which conclusion is most likely correct?

A.The gradients are vanishing.

B.The model is overfitting after epoch 2.

C.The model is underfitting.

D.The learning rate is too high.

AnswerB

Training loss decreases, validation loss increases.

Why this answer

The output shows training loss decreasing while validation loss increases after epoch 2, which is a classic sign of overfitting. The model begins to memorize the training data rather than generalize, leading to poor performance on unseen data. This pattern confirms that overfitting starts after epoch 2, making option B correct.

Exam trap

CompTIA often tests the distinction between overfitting and underfitting by presenting a loss curve where training loss decreases but validation loss increases, leading candidates to mistakenly attribute the issue to vanishing gradients or a high learning rate.

How to eliminate wrong answers

Option A is wrong because vanishing gradients typically cause slow or stalled learning across all epochs, not a sudden divergence between training and validation loss after epoch 2. Option C is wrong because underfitting would show high training loss and high validation loss throughout, not a decreasing training loss with increasing validation loss. Option D is wrong because a learning rate that is too high would cause the loss to oscillate or diverge from the start, not show a clear overfitting pattern after epoch 2.

Full explanation →

433

MCQmedium

Refer to the exhibit. Which assessment is most critical for ethical deployment?

A.Feature engineering should be improved

B.Data collection needs expansion

C.Bias mitigation is needed

D.Model retraining is required due to low recall

AnswerC

A demographic parity difference over 0.1 is often considered evidence of bias requiring correction.

Why this answer

The demographic parity difference of 0.15 indicates potential bias against a demographic group, so bias mitigation is the most critical ethical concern.

Full explanation →

434

MCQmedium

A financial institution uses a random forest model to approve loan applications. Recently, the model's false positive rate has increased, leading to more defaults. The data science team reviews the feature importance and finds that the model heavily relies on a feature 'zip code' which correlates with income. The company is concerned about fairness. The regulatory team requires that the model's predictions are not biased against protected groups. Which action BEST addresses the fairness concern while maintaining predictive performance? A. Remove the 'zip code' feature and retrain the model. B. Use adversarial debiasing to train a model that is invariant to protected attributes. C. Add more training data from underrepresented zip codes. D. Apply a post-processing technique that adjusts thresholds for different groups.

A.Apply a post-processing technique that adjusts thresholds for different groups.

B.Remove the 'zip code' feature and retrain the model.

C.Add more training data from underrepresented zip codes.

D.Use adversarial debiasing to train a model that is invariant to protected attributes.

AnswerD

Adversarial debiasing explicitly reduces the model's ability to predict protected attributes, mitigating bias while retaining predictive power.

Why this answer

Option B is correct. Adversarial debiasing directly forces the model to learn representations that are not predictive of protected attributes, thereby reducing bias while maintaining performance as much as possible. Option A (removing zip code) might lose important information, as zip code could be a proxy for other legitimate factors; also, other features may still correlate with protected attributes.

Option C (adding data) does not directly address bias and may not remove the correlation. Option D (post-processing) can adjust thresholds but may not address the underlying model bias; it is a less robust solution.

Full explanation →

435

Multi-Selectmedium

Which TWO techniques are commonly used to handle missing data in a dataset?

Select 2 answers

A.Feature scaling

B.One-hot encoding

C.Remove rows with missing values

D.Impute with mean or median

E.Principal component analysis (PCA)

AnswersC, D

Simple deletion if missing data is minimal.

Why this answer

Option C is correct because removing rows with missing values is a straightforward technique to handle missing data, especially when the missingness is random and the dataset is large enough that dropping a few rows does not significantly reduce the sample size or introduce bias. Option D is correct because imputing missing values with the mean or median is a common statistical method that preserves the dataset size and is simple to implement, though it can reduce variance and may distort relationships if the data is not missing completely at random.

Exam trap

CompTIA often tests the distinction between data preprocessing techniques that handle missing values versus those that transform or reduce features, so candidates may confuse feature scaling or PCA with missing data handling because they are all part of data preparation.

Full explanation →

436

MCQhard

A machine learning engineer notices that the gradient values in a deep network are becoming extremely small during backpropagation. What is this problem?

A.Dead ReLU

B.Exploding gradient

C.Covariate shift

D.Vanishing gradient

AnswerD

Correct: Vanishing gradient makes weights stop updating effectively.

Why this answer

Option B is correct because vanishing gradient occurs when gradients become very small, preventing weight updates. Option A is incorrect: exploding gradient would be large values. Option C is incorrect: dead ReLU refers to neurons that output zero.

Option D is incorrect: covariate shift is a change in input distribution.

Full explanation →

437

MCQeasy

A company must deploy a new model version with zero downtime. The current model is served via a REST API on a Kubernetes cluster. Which deployment strategy should the team use to gradually shift traffic to the new version while monitoring for errors?

A.Blue-green deployment

B.Canary deployment

C.Recreate deployment

D.Rolling update

AnswerB

Canary deployment gradually routes traffic to the new version for safe rollout.

Why this answer

A canary deployment gradually shifts a small percentage of traffic to the new model version while the majority continues to hit the stable version. This allows the team to monitor for errors and roll back quickly if issues arise, achieving zero downtime. It is the ideal strategy for validating a new model in production with minimal risk.

Exam trap

The trap here is that candidates confuse 'rolling update' with 'canary deployment' because both involve gradual changes, but a rolling update replaces pods sequentially without the ability to route a controlled subset of traffic for targeted monitoring and rollback.

How to eliminate wrong answers

Option A is wrong because blue-green deployment switches all traffic at once from the old to the new environment, which does not provide gradual traffic shifting or incremental error monitoring; it is an all-or-nothing cutover. Option C is wrong because recreate deployment tears down the old version before deploying the new one, causing downtime and violating the zero-downtime requirement. Option D is wrong because a rolling update replaces pods incrementally but does not allow fine-grained traffic splitting or canary-style monitoring; it updates all instances without a separate traffic-routing phase for error detection.

Full explanation →

438

MCQeasy

A data scientist wants to reduce the dimensionality of a dataset with 200 features before training a regression model. Which technique should they use?

A.LDA

B.t-SNE

C.Autoencoder

D.PCA

AnswerD

Correct: PCA is widely used for dimensionality reduction in regression tasks.

Why this answer

Option D is correct because PCA is a linear dimensionality reduction technique that is commonly used for feature reduction. Options A, B, and C are incorrect: t-SNE is for visualization only, LDA is for classification, and autoencoder is a neural network approach but more complex than needed.

Full explanation →

439

Multi-Selecthard

Which THREE factors are common causes of bias in AI systems?

Select 3 answers

A.Cross-validation

B.Lack of diversity in the development team

C.Unrepresentative training sample

D.Biased historical data used for training

E.High regularization

AnswersB, C, D

Homogeneous teams may overlook biased assumptions.

Why this answer

Option B is correct because a lack of diversity in the development team leads to homogeneity of thought, which can cause blind spots in identifying potential biases in data, features, or model behavior. When the team does not represent the full spectrum of end users, the AI system may inadvertently encode assumptions that disadvantage underrepresented groups, resulting in biased outcomes.

Exam trap

CompTIA often tests the distinction between statistical bias (e.g., from regularization or validation techniques) and harmful societal bias that leads to unfair outcomes, so candidates mistakenly select options like cross-validation or high regularization as causes of bias.

Full explanation →

440

Multi-Selectmedium

Which TWO practices are most effective for ensuring the security of an AI model against adversarial attacks?

Select 2 answers

A.Encrypting the model weights

B.Continuous model monitoring

C.Input sanitization and validation

D.Adversarial training

E.Rate limiting API access

AnswersC, D

Sanitization removes or normalizes inputs that may contain adversarial perturbations.

Why this answer

Input sanitization and validation (C) is correct because it prevents adversarial inputs—such as specially crafted perturbations or injection strings—from reaching the model's inference pipeline. By filtering, encoding, or rejecting malicious data at the application layer, the model's decision boundary is protected from manipulation. This is a fundamental defense-in-depth measure against evasion and poisoning attacks.

Exam trap

CompTIA often tests the distinction between reactive monitoring (B) and proactive defenses (C and D), and candidates mistakenly choose rate limiting (E) thinking it blocks all attacks, but it only throttles frequency, not content.

Full explanation →

441

MCQhard

An organization is implementing an AI-powered chatbot for customer service. The chatbot must comply with GDPR and handle data subject access requests (DSARs). Which design approach best ensures compliance?

A.Minimize data collection by not logging any user interactions.

B.Anonymize all user data before logging interactions.

C.Implement an audit trail that logs interactions with a unique user identifier, and provide a mechanism to delete logs upon user request.

D.Encrypt all chat logs and store them indefinitely for audit purposes.

AnswerC

This ensures compliance with the right to access and erasure under GDPR.

Why this answer

Option C is correct because GDPR requires that personal data be stored only as long as necessary and that data subjects have the right to erasure. By logging interactions with a unique user identifier and providing a deletion mechanism, the chatbot can fulfill DSARs while maintaining an audit trail for compliance monitoring. This approach balances operational needs with regulatory obligations.

Exam trap

CompTIA often tests the misconception that GDPR requires complete data minimization (Option A) or indefinite encryption (Option D), when in fact the regulation mandates a balance between data utility and privacy rights, including the ability to delete data upon request.

How to eliminate wrong answers

Option A is wrong because not logging any user interactions prevents the organization from monitoring chatbot performance, improving the AI model, or detecting security incidents, and GDPR does not prohibit all logging—only excessive or unnecessary data collection. Option B is wrong because anonymization must be irreversible to be GDPR-compliant; if the data can be re-identified (e.g., via correlation with other logs), it is pseudonymization, which still subjects it to GDPR requirements, and anonymizing before logging does not address the need to handle DSARs for data that was originally personal. Option D is wrong because storing chat logs indefinitely violates the GDPR storage limitation principle (Article 5(1)(e)), which mandates that personal data be kept no longer than necessary for the purpose for which it is processed.

Full explanation →

442

MCQmedium

A data scientist is training a deep learning model for image classification. The training loss decreases steadily but the validation loss starts increasing after 10 epochs. Which technique should the scientist apply to address this issue?

A.Add more dropout layers

B.Reduce the learning rate

C.Implement early stopping

D.Increase the number of training epochs

AnswerC

Early stopping halts training when validation loss stops improving, preventing overfitting.

Why this answer

The scenario describes overfitting: the model memorizes training data (loss decreases) but fails to generalize to unseen validation data (validation loss increases). Early stopping (Option C) halts training when validation performance degrades, preventing overfitting while preserving the best model weights. This is a standard regularization technique in deep learning frameworks like TensorFlow and PyTorch.

Exam trap

CompTIA often tests the distinction between preventive regularization (dropout, L2) and reactive overfitting control (early stopping), leading candidates to choose dropout or learning rate reduction when the scenario explicitly describes overfitting that has already begun.

How to eliminate wrong answers

Option A is wrong because adding more dropout layers can help regularize the model, but it is not the direct solution for the described symptom of validation loss increasing after a certain epoch; dropout is a preventive measure applied before training, not a reactive fix for overfitting that has already occurred. Option B is wrong because reducing the learning rate may slow down convergence or help escape local minima, but it does not address the core issue of overfitting; a lower learning rate can even exacerbate overfitting by allowing the model to fit noise more precisely. Option D is wrong because increasing the number of training epochs would worsen the overfitting problem, as the model would continue to memorize training data and further diverge from validation performance.

Full explanation →

443

MCQeasy

A data science team uses a CI/CD pipeline for ML models. They need to ensure that each model version is traceable back to the exact training data and hyperparameters. Which practice should be implemented?

A.Use a model registry with metadata tracking (e.g., MLflow)

B.Use Git LFS for model files

C.Store model artifacts in blob storage with timestamped filenames

D.Record hyperparameters in a shared spreadsheet

AnswerA

A model registry stores versions and associated metadata for full traceability.

Why this answer

A model registry (Option C) serves as a centralized repository that tracks model versions along with their metadata, including training data snapshots and hyperparameters. Git LFS (Option A) only handles large files, not metadata. Blob storage with timestamps (Option B) lacks structured tracking.

A spreadsheet (Option D) is error-prone and not integrated into the pipeline.

Full explanation →

444

MCQeasy

A machine learning engineer notices that a linear regression model has high bias. Which action is most likely to reduce bias?

A.Use a more complex model, such as polynomial regression

B.Reduce the number of training samples

C.Add L2 regularization

D.Apply feature scaling

AnswerA

More complex models have lower bias as they can fit more patterns.

Why this answer

Option A (Add L2 regularization) increases bias to reduce variance. Option C (Reduce the number of training samples) would increase bias. Option D (Apply feature scaling) does not directly affect bias.

Option B (Use a more complex model, such as polynomial regression) increases model flexibility, reducing bias.

Full explanation →

445

MCQmedium

A data scientist is training a binary classification model to detect fraudulent transactions. The dataset contains 99.9% legitimate transactions and 0.1% fraudulent transactions. After training a logistic regression model, the accuracy is 99.9%, but the recall for the fraud class is 0%. Which of the following is the MOST likely cause?

A.The regularization parameter is too large, causing underfitting.

B.The model is overfitting due to too many features.

C.The learning rate was too high.

D.The dataset is highly imbalanced, and the model predicts the majority class for all instances.

AnswerD

With severe class imbalance, a model can achieve high accuracy by always predicting the majority class, leading to zero recall for the minority class.

Why this answer

Option C is correct because a highly imbalanced dataset often leads the model to predict the majority class for all instances, resulting in high accuracy but zero recall for the minority class. Option A (learning rate) would not cause this behavior; it affects convergence speed. Option B (overfitting) typically reduces generalization but not in this specific pattern.

Option D (too large regularization) might cause underfitting but would not necessarily yield zero recall for one class.

Full explanation →

446

MCQeasy

A company is developing an AI chatbot for customer service. The legal team is concerned that the chatbot might generate responses that violate privacy regulations. Which governance mechanism should be implemented to mitigate this risk?

A.Use explainable AI techniques to understand why the chatbot generates certain responses.

B.Encrypt all chatbot conversations at rest and in transit.

C.Implement a human-in-the-loop review process for high-risk responses.

D.Anonymize the training data used to train the chatbot.

AnswerC

Human review can catch and block responses that violate privacy regulations before they are sent to customers.

Why this answer

Option C is correct because a human-in-the-loop (HITL) review process directly addresses the risk of privacy violations by ensuring that high-risk responses are reviewed by a human before being sent to the customer. This governance mechanism provides a safety net for unpredictable outputs from the generative AI model, which may inadvertently leak personally identifiable information (PII) or violate data protection regulations like GDPR or CCPA. Unlike technical controls that only reduce the attack surface, HITL offers real-time compliance oversight for the chatbot's natural language generation (NLG) outputs.

Exam trap

CompTIA often tests the distinction between preventive controls (like HITL) and detective or protective controls (like encryption or anonymization), and the trap here is that candidates confuse data security measures (encryption, anonymization) with governance mechanisms that directly control model output behavior.

How to eliminate wrong answers

Option A is wrong because explainable AI (XAI) techniques, such as SHAP or LIME, provide post-hoc interpretability of model decisions but do not prevent the generation of privacy-violating responses; they only help diagnose why a violation occurred after the fact. Option B is wrong because encrypting chatbot conversations at rest (e.g., using AES-256) and in transit (e.g., using TLS 1.3) protects data from external interception but does not control the content generated by the chatbot itself, which is the source of the privacy risk. Option D is wrong because anonymizing training data (e.g., via k-anonymity or differential privacy) reduces the risk of the model memorizing PII, but it does not prevent the chatbot from generating new responses that violate privacy regulations through inference or context-based leakage during inference.

Full explanation →

447

MCQmedium

During model deployment, a data engineer notices that the model's predictions are consistently lower than expected due to a shift in the distribution of one feature between training and production. Which technique should be used to detect and quantify this shift?

A.Compute the root mean square error (RMSE)

B.Calculate the population stability index (PSI)

C.Generate a confusion matrix

D.Perform a t-test on the means

AnswerB

PSI quantifies the degree of distribution shift, commonly used in monitoring.

Why this answer

The Population Stability Index (PSI) is specifically designed to detect and quantify shifts in the distribution of a feature or score between two populations, such as training and production datasets. It measures the stability of the feature by comparing the proportion of observations in each bin across the two time periods, making it the correct choice for diagnosing distribution drift in model deployment.

Exam trap

CompTIA often tests the distinction between performance metrics (like RMSE or confusion matrix) and distribution monitoring metrics (like PSI), trapping candidates who confuse model accuracy evaluation with data drift detection.

How to eliminate wrong answers

Option A is wrong because RMSE measures the average magnitude of prediction errors, not distribution shifts between datasets. Option C is wrong because a confusion matrix evaluates classification performance against ground truth labels, not feature distribution changes. Option D is wrong because a t-test on the means only checks for a difference in central tendency, not the full distributional shift that PSI captures, and it is sensitive to sample size rather than bin-wise stability.

Full explanation →

448

Multi-Selecteasy

A data engineer is preparing a dataset for training a classification model. The dataset contains missing values in multiple features, inconsistent categorical labels, and outliers in numerical features. Which TWO preprocessing steps should the engineer prioritize to improve model performance?

Select 2 answers

A.Normalize numerical features using min-max scaling.

B.Remove all rows with any missing data.

C.Encode categorical variables using label encoding.

D.Impute missing values with the median.

E.Apply one-hot encoding to all categorical variables.

AnswersA, D

Normalization ensures features contribute equally to distance-based models.

Why this answer

The correct steps are imputing missing values with the median (handles outliers better than mean) and normalizing numerical features (for distance-based algorithms). Blindly removing rows loses data; label encoding on nominal categories creates false order; one-hot encoding on high-cardinality categories can cause dimensionality issues.

Full explanation →

449

Multi-Selectmedium

Which THREE of the following are key principles of trustworthy AI as defined by major regulatory bodies?

Select 3 answers

A.Fairness and non-discrimination

B.Transparency and explainability

C.Maximum profitability

D.Proprietary secrecy

E.Accountability

AnswersA, B, E

AI systems should avoid bias and ensure equitable treatment.

Why this answer

Options A, C, and D are correct. Accountability, fairness, and transparency are core principles. Option B is wrong because maximum profitability is not an ethical principle.

Option E is wrong because proprietary secrecy conflicts with transparency.

Full explanation →

450

Multi-Selecthard

Which TWO of the following are effective techniques to detect data poisoning attacks in a training dataset?

Select 2 answers

A.Performing cross-validation to check for inconsistent model performance.

B.Normalizing features to zero mean and unit variance.

C.Using ensemble methods like random forest for training.

D.Applying PCA to reduce dimensionality.

E.Statistical outlier detection on feature distributions.

AnswersA, E

Poisoned data often causes model performance to vary significantly across folds.

Why this answer

Option A is correct because cross-validation can reveal data poisoning by exposing inconsistent model performance across folds. If a poisoned subset causes the model to perform well on certain folds but poorly on others, it indicates that the training data may have been tampered with, as the model's behavior becomes unstable due to maliciously injected samples.

Exam trap

CompTIA often tests the distinction between techniques that detect poisoning (like cross-validation and outlier detection) versus techniques that only mitigate or preprocess data, leading candidates to mistakenly select normalization or dimensionality reduction as detection methods.

Full explanation →

Page 6 of 7

All pages

Practice AI0-001 by domain

Target a specific domain to shore up weak areas.

AI Concepts and Foundations Machine Learning and Deep Learning AI Models and Data Engineering AI Implementation and Operations AI Security, Ethics and Governance

See all domains with question counts →