Knowledge + Practice

CCNA AI Implementation and Operations Questions

75 of 103 questions · Page 1/2 · AI Implementation and Operations · Answers revealed

Practice these questions Domain overview All questions

1

MCQmedium

Refer to the exhibit. A machine learning pipeline configuration is shown. During a deployment, the model evaluation passes with accuracy 0.86 and precision 0.79. However, the pipeline proceeds to deploy. What is the most likely reason for this behavior?

A.The precision metric is not included in the evaluation script

B.The deployment only checks the accuracy threshold for rollback condition

C.The deployment target is set to staging instead of production

D.The operator manually overrode the threshold

AnswerB

The rollback_condition only mentions accuracy, so precision threshold is ignored.

Why this answer

The pipeline configuration shows a rollback condition that only checks the accuracy metric (accuracy < 0.85). Since the model achieved accuracy 0.86, which is above the threshold, the condition is not triggered, and the pipeline proceeds to deploy regardless of the precision value. The precision metric is not part of the rollback evaluation logic in this configuration.

Exam trap

CompTIA often tests the misconception that all evaluation metrics automatically trigger rollback conditions, when in fact only metrics explicitly listed in the condition logic are checked.

How to eliminate wrong answers

Option A is wrong because the evaluation script clearly outputs precision (0.79), and the exhibit shows precision is being calculated; the issue is that the rollback condition does not reference precision. Option C is wrong because the deployment target (staging vs. production) does not affect whether a rollback condition is evaluated; the pipeline proceeds based on the condition logic, not the environment name. Option D is wrong because there is no evidence or indication in the exhibit or scenario that an operator manually overrode the threshold; the behavior is fully explained by the configured rollback condition.

Practice this question →

2

MCQhard

A company uses a large language model (LLM) to generate customer support responses. They notice the model sometimes produces harmful outputs. Which implementation strategy best reduces this risk while maintaining performance?

A.Implement a keyword-based output filter

B.Use a smaller, less capable model

C.Add system prompts instructing the model to be safe

D.Fine-tune the model using reinforcement learning from human feedback

AnswerD

RLHF effectively aligns model outputs with human preferences.

Why this answer

Option D is correct because reinforcement learning from human feedback (RLHF) directly trains the model to align its outputs with human preferences for safety and helpfulness, reducing harmful outputs while preserving performance. Unlike superficial filters or prompts, RLHF adjusts the model's internal behavior through reward modeling and policy optimization, making it the most effective strategy for sustained safety improvements.

Exam trap

CompTIA often tests the misconception that simple output filtering or prompt engineering is sufficient for safety, when in fact only training-based alignment methods like RLHF can meaningfully change model behavior without sacrificing performance.

How to eliminate wrong answers

Option A is wrong because keyword-based output filters are brittle and can be bypassed by paraphrasing or context-dependent harmful content, while also risking false positives that degrade performance by blocking legitimate responses. Option B is wrong because using a smaller, less capable model reduces overall performance and may still produce harmful outputs if not specifically trained for safety, as capability and safety are not directly correlated. Option C is wrong because system prompts are easily overridden by the model's training distribution and do not provide robust, consistent safety alignment, especially against adversarial or nuanced harmful inputs.

Practice this question →

3

MCQeasy

A company deploys a computer vision model for quality inspection on a manufacturing line. After deployment, the model's accuracy drops from 95% to 80% over two weeks. Which action is most likely to address this issue?

A.Retrain the model using recently collected production data.

B.Increase the confidence threshold for predictions.

C.Decrease the learning rate of the training algorithm.

D.Deploy an additional ensemble of models for redundancy.

AnswerA

Retraining with current data adapts the model to new data distributions, countering drift.

Why this answer

Option B is correct because data drift is a common cause of performance degradation over time, and retraining with recent data realigns the model. Option A is wrong because increasing threshold may reduce false positives but does not address drift. Option C is wrong because adjusting learning rate is irrelevant for inference.

Option D is wrong because adding redundant models increases complexity without solving drift.

Practice this question →

4

MCQhard

Refer to the exhibit. A batch inference job fails with the given logs. What is the most likely root cause of the failure?

A.The input data has values that exceed the model's expected range

B.The input data contains missing values that are not handled in preprocessing

C.The model was not trained to handle categorical features

D.The model version is outdated and incompatible with the current preprocessing pipeline

AnswerB

The log clearly shows a NaN value for 'age' causing an error in normalization.

Why this answer

The logs indicate a 'ValueError' or similar exception when the batch inference job attempts to process the input data. This error typically arises when the preprocessing pipeline encounters missing values (e.g., NaN or None) that it cannot handle, causing the job to fail. Option B is correct because missing values not handled in preprocessing are a common root cause for such failures, especially when the training data had no missing values but the inference data does.

Exam trap

CompTIA often tests the distinction between data quality issues (missing values) and model compatibility issues (version mismatches or feature encoding), so candidates may incorrectly choose option D because they assume a version mismatch is the cause, when the logs clearly point to a preprocessing failure.

How to eliminate wrong answers

Option A is wrong because values exceeding the model's expected range would typically cause a different error, such as a 'ValueError' about clipping or scaling, not a generic failure from missing data. Option C is wrong because the model not being trained to handle categorical features would manifest as a 'TypeError' or 'KeyError' during feature encoding, not a missing-value-related error. Option D is wrong because an outdated model version incompatible with the preprocessing pipeline would likely cause a 'ShapeError' or 'AttributeError' due to mismatched feature names or dimensions, not a missing-value error.

Practice this question →

5

MCQeasy

An AI system for fraud detection shows a gradual decline in precision over several weeks, though recall remains stable. Which type of model drift is most likely occurring?

A.Data drift

B.Covariate shift

C.Label drift

D.Concept drift

AnswerD

Concept drift alters the decision boundary, often increasing false positives while recall remains stable.

Why this answer

Precision dropping while recall stable indicates increased false positives. Concept drift (Option B) changes the underlying relationships, causing the model to misclassify more legitimate transactions as fraud. Data drift (Option A) would affect both precision and recall.

Label drift (Option C) changes ground truth definitions, also affecting both. Covariate shift (Option D) is similar to data drift.

Practice this question →

6

Multi-Selecthard

Which TWO techniques are most effective for ensuring model explainability in a production loan approval AI system subject to regulatory review? (Select TWO.)

Select 2 answers

A.Replace the model with a decision tree for transparency

B.Use SHAP values to understand feature contributions

C.Rely on the model's internal attention weights (if transformer-based)

D.Apply LIME to generate local explanations for each prediction

E.Calculate global feature importance using permutation importance

AnswersB, D

SHAP provides consistent and theoretically grounded explanations.

Why this answer

Options C and D are correct. Option C is correct because LIME provides local explanations for individual predictions. Option D is correct because SHAP values quantify feature contributions globally and locally.

Option A is wrong because feature importance gives global view but not per-instance. Option B is wrong while decision trees are interpretable, they may not be the deployed model. Option E is wrong because black-box models are inherently uninterpretable without post-hoc methods.

Practice this question →

7

MCQhard

The exhibit shows the output of a drift monitoring command for a fraud detection model. The team has an automated pipeline that triggers retraining when the overall average drift score exceeds 0.10. Based on the exhibit, what should the operations team do next?

A.Force retraining on all features to ensure the model adapts to the new data distribution.

B.Manually analyze the drift in 'amount' and 'location' and investigate potential causes.

C.No action is needed because the model is performing within acceptable drift limits.

D.Initiate the automated retraining pipeline since the average drift exceeds 0.05.

AnswerB

Investigating root causes of drift helps determine if retraining or data correction is appropriate.

Why this answer

The correct action is to analyze the drift in the 'amount' and 'location' features and investigate root causes before retraining. The overall average drift is (0.12+0.08+0.03+0.02)/4 = 0.0625, which is below the threshold of 0.10, so retraining is not automatically triggered. However, individual features show significant drift, which could degrade performance.

Option C is correct because understanding why those features drifted helps decide if retraining or data correction is needed. Option A is wrong because the average drift is below threshold. Option B is wrong because ignoring drift could lead to performance degradation.

Option D is wrong because manual retraining without investigation may not address the root cause.

Practice this question →

8

MCQeasy

A data scientist is deploying a machine learning model to production. The model was trained on an imbalanced dataset. Which technique should be used during deployment to mitigate bias without retraining the model?

A.Apply post-processing calibration to adjust decision thresholds

B.Use an ensemble of models trained on balanced subsets

C.Rebalance the dataset using SMOTE before inference

D.Remove sensitive features from the input data

AnswerA

Post-processing calibration adjusts thresholds to improve fairness without retraining.

Why this answer

Post-processing calibration adjusts the decision threshold of the model to account for the class imbalance present in the training data. This technique modifies the output probabilities or classification boundary without requiring access to the original training data or retraining the model, making it suitable for deployment scenarios where the model is already fixed.

Exam trap

CompTIA often tests the distinction between techniques applied during training versus deployment, and the trap here is that candidates mistakenly choose SMOTE or ensemble methods, which require retraining, instead of recognizing that threshold adjustment is a valid post-deployment bias mitigation strategy.

How to eliminate wrong answers

Option B is wrong because using an ensemble of models trained on balanced subsets requires retraining or modifying the model architecture, which violates the constraint of not retraining the model. Option C is wrong because SMOTE (Synthetic Minority Over-sampling Technique) is a data preprocessing method applied before training to balance the dataset, not during inference; applying it at inference time would require access to the original training data and would alter the input distribution, which is not feasible or correct. Option D is wrong because simply removing sensitive features does not mitigate bias caused by imbalanced data; bias can still propagate through correlated features, and this approach does not address the class imbalance issue directly.

Practice this question →

9

MCQmedium

A company deploys a machine learning model for fraud detection. After one month, the false positive rate has increased significantly. The model is retrained weekly on all historical data. What is the MOST effective immediate action?

A.Replace the model with a simpler logistic regression model.

B.Continue retraining weekly on all historical data.

C.Adjust the classification threshold to reduce false positives.

D.Retrain the model on only the most recent 30 days of data.

AnswerD

Recent data captures current fraud patterns, reducing false positives.

Why this answer

The false positive rate increase suggests the model is reacting to a shift in the underlying data distribution (concept drift). Retraining on only the most recent 30 days of data (option D) is the most effective immediate action because it focuses the model on the current fraud patterns, discarding stale historical data that may no longer be representative. This approach directly addresses the drift by adapting the model to the latest behavior.

Exam trap

CompTIA often tests the misconception that adjusting the classification threshold is a sufficient fix for model degradation, when in reality it only trades off error types without addressing the underlying data drift that caused the false positive increase.

How to eliminate wrong answers

Option A is wrong because replacing the model with a simpler logistic regression model does not address the root cause of concept drift and may reduce predictive performance without solving the false positive issue. Option B is wrong because continuing to retrain weekly on all historical data will dilute the influence of recent patterns with outdated data, likely perpetuating the high false positive rate. Option C is wrong because adjusting the classification threshold is a post-hoc fix that reduces false positives at the cost of increasing false negatives, and it does not correct the underlying model drift or data quality issue.

Practice this question →

10

MCQmedium

Based on the exhibit, what is the most likely cause of the pod failure and its solution?

A.The node has insufficient CPU; add more CPU.

B.The pod is configured with wrong GPU drivers; update drivers.

C.The model is too large; use a smaller model.

D.The container memory limit is too low; increase the memory limit in the pod spec.

AnswerD

OOMKilled specifically indicates memory exhaustion; raising the limit is the direct fix.

Why this answer

The pod failure is caused by an OOMKilled (Out of Memory) error, as indicated by the pod status in the exhibit. When a container exceeds its memory limit, Kubernetes terminates it with an OOMKilled exit code. Increasing the memory limit in the pod spec allows the container to allocate more memory, resolving the failure.

Exam trap

CompTIA often tests the distinction between resource exhaustion errors (OOMKilled vs. CPU throttling) and configuration errors (driver issues), leading candidates to incorrectly attribute a memory limit issue to a hardware or driver problem.

How to eliminate wrong answers

Option A is wrong because the exhibit shows no CPU-related errors or resource pressure; the failure is due to memory exhaustion, not insufficient CPU. Option B is wrong because GPU driver issues would manifest as device plugin errors or initialization failures, not an OOMKilled status. Option C is wrong because the model size is not directly indicated as the cause; the pod is failing due to memory limits, and using a smaller model might reduce memory usage but does not address the misconfigured resource limit.

Practice this question →

11

MCQmedium

An AI system used for hiring has been found to exhibit racial bias against certain candidates. Which step should the organization take to mitigate this?

A.Remove all demographic features from the model.

B.Use a different algorithm that is inherently unbiased.

C.Regularly audit model predictions across demographic groups and retrain with fairness constraints.

D.Hire more diverse data scientists.

AnswerC

This approach identifies and corrects bias systematically.

Why this answer

Option C is correct because bias in AI systems is often embedded in training data or model behavior, not just in feature selection. Regularly auditing predictions across demographic groups and retraining with fairness constraints (e.g., demographic parity or equalized odds) allows the organization to detect and correct disparate impact without sacrificing model performance. This aligns with the AI0-001 focus on continuous monitoring and iterative improvement in AI operations.

Exam trap

CompTIA often tests the misconception that removing sensitive attributes (like race or gender) automatically makes a model fair, when in reality proxy features and biased training data can perpetuate discrimination.

How to eliminate wrong answers

Option A is wrong because simply removing demographic features does not eliminate bias; proxy features (e.g., zip code, education level) can still encode the same discriminatory patterns, and the model may learn biased correlations from the remaining data. Option B is wrong because no algorithm is inherently unbiased; bias arises from data, labeling, and deployment context, so switching algorithms without addressing root causes will not guarantee fairness. Option D is wrong because hiring more diverse data scientists, while beneficial for broader perspectives, does not directly mitigate existing model bias; technical interventions like auditing and retraining with fairness constraints are required.

Practice this question →

12

Multi-Selecthard

An organization is implementing an AI governance framework. Which THREE components are essential for compliance with ethical AI standards?

Select 3 answers

A.Data privacy protection measures (e.g., differential privacy).

B.Open-source licensing of all models.

C.Maximizing model accuracy to increase revenue.

D.Model explainability and interpretability mechanisms.

E.Regular bias auditing of models.

AnswersA, D, E

Privacy is a key ethical requirement.

Why this answer

Options A, B, and D are correct because bias auditing, explainability, and privacy protection are core ethical requirements. Option C is wrong because maximizing revenue is a business goal, not an ethical requirement. Option E is wrong because open-source is not a mandatory component.

Practice this question →

13

MCQmedium

A data engineering team is designing a pipeline to train a model on streaming data. The data arrives in a time-series format. Which approach should they use to ensure the model reflects current trends without catastrophic forgetting?

A.Implement incremental learning with periodic validation

B.Use a sliding window of the most recent data for training

C.Deploy an ensemble of models trained on different time periods

D.Retrain the entire model from scratch every week

AnswerA

Incremental learning adapts to new data while retaining previous knowledge.

Why this answer

Incremental learning (also called online learning) allows the model to update its parameters continuously as new streaming data arrives, without requiring access to historical data. By coupling this with periodic validation on a held-out set, the team can detect concept drift and ensure the model adapts to current trends while avoiding catastrophic forgetting, which occurs when new updates overwrite previously learned patterns.

Exam trap

CompTIA often tests the misconception that a sliding window of recent data alone prevents catastrophic forgetting, but without a mechanism like elastic weight consolidation or replay buffers, the model still forgets older but recurring patterns.

How to eliminate wrong answers

Option B is wrong because a sliding window of only the most recent data discards older patterns entirely, which can cause catastrophic forgetting of long-term seasonality or trends. Option C is wrong because an ensemble of models trained on different time periods does not inherently adapt to streaming data; it requires retraining or adding new models over time and can become computationally expensive without addressing forgetting in individual models. Option D is wrong because retraining the entire model from scratch every week is inefficient for streaming data, introduces latency, and may still cause forgetting of intra-week patterns if the retraining window is too narrow.

Practice this question →

14

MCQhard

A global retailer uses an AI model to forecast demand across thousands of stores. After deployment, the model's predictions become less accurate during holiday seasons. The training data included two years of holiday periods. What is the most effective operational strategy to handle this recurring seasonal drift?

A.Deploy an anomaly detection system to flag holiday prediction outliers

B.Implement a scheduled retraining cycle just before each holiday period

C.Use an ensemble of models trained on different time periods

D.Increase the volume of training data by including five years of history

AnswerB

Proactive retraining with recent holiday data mitigates seasonal drift.

Why this answer

Scheduled retraining just before each holiday season directly addresses the recurring seasonal drift by updating the model with the most recent holiday data patterns. This is the most effective operational strategy because it proactively aligns the model with the known, periodic shift in demand behavior, rather than reacting to errors or relying on static historical data.

Exam trap

CompTIA often tests the misconception that more data or anomaly detection is the universal solution to drift, but the trap here is that candidates overlook the need for proactive, scheduled updates tailored to known recurring patterns rather than reactive or static fixes.

How to eliminate wrong answers

Option A is wrong because anomaly detection only flags outliers after predictions are made, it does not correct the underlying model drift or improve forecast accuracy during the holiday period. Option C is wrong because an ensemble of models trained on different time periods may reduce variance but does not specifically target the recurring seasonal pattern; it could still suffer from drift if none of the models are updated for the current holiday context. Option D is wrong because simply adding more historical data (five years) does not guarantee the model will adapt to the most recent seasonal shifts; older data may even introduce outdated patterns that dilute the relevance of recent holiday trends.

Practice this question →

15

MCQhard

A large e-commerce company has deployed a real-time product recommendation system using a neural collaborative filtering model. The model was trained on six months of user click and purchase data. For the first three months after deployment, the click-through rate (CTR) improved by 15%. However, starting in the fourth month, CTR began decreasing steadily despite no changes to the system infrastructure or data pipeline. The product manager suspects model decay but the engineering team insists the model is static and should not degrade. The data science lead suggests investigating further. They have access to production logs, A/B testing framework, and historical model versions. What is the BEST course of action to diagnose and address the issue?

A.Re-deploy the model with additional features such as time of day and user device.

B.Increase the frequency of batch inference from hourly to every 10 minutes to improve responsiveness.

C.Set up an A/B test comparing the current model against the original baseline model using recent traffic.

D.Retrain the model on only the most recent 30 days of data and replace the current model.

AnswerC

A/B testing isolates whether the current model underperforms relative to a known good version, confirming decay.

Why this answer

Option C is correct because setting up an A/B test comparing the current model against the original baseline model using recent traffic directly isolates whether the model's predictive performance has degraded due to concept drift (changes in user behavior over time). Since the model is static but the data distribution has shifted, the A/B test provides empirical evidence of decay by measuring CTR differences under identical conditions, which is the standard diagnostic step before any retraining or feature engineering.

Exam trap

CompTIA often tests the principle that diagnosing model decay requires a controlled comparison (A/B test) rather than immediately retraining or adding features, and the trap here is assuming that a static model cannot degrade when the underlying data distribution changes.

How to eliminate wrong answers

Option A is wrong because adding features like time of day or user device without first diagnosing the root cause of CTR decline may introduce noise or overfitting, and does not address the likely concept drift. Option B is wrong because increasing batch inference frequency improves latency but does not affect model accuracy or counteract data distribution shifts; the model's predictions remain unchanged regardless of inference cadence. Option D is wrong because retraining on only the most recent 30 days of data could discard valuable long-term patterns and may cause catastrophic forgetting, and it bypasses the necessary diagnostic step of confirming that model decay is indeed the issue.

Practice this question →

16

MCQmedium

A batch inference pipeline fails intermittently with out-of-memory errors when processing large datasets. The pipeline uses pandas DataFrames and feeds a pre-trained model. Which change would most effectively reduce memory consumption?

A.Increase the instance size of the compute node

B.Use a database instead of CSV files

C.Convert the model to use half-precision

D.Split the data into smaller chunks and process sequentially

AnswerD

Chunking reduces peak memory by processing subsets of the data at a time.

Why this answer

Option D is correct because splitting a large dataset into smaller chunks and processing them sequentially directly addresses the root cause of the out-of-memory error: the entire dataset is loaded into memory at once via pandas DataFrames. By processing data in batches, each chunk fits within the available RAM, preventing memory exhaustion while still allowing the pipeline to complete the full inference workload.

Exam trap

CompTIA often tests the misconception that scaling up hardware (Option A) is the best solution, when in fact architectural changes like chunking (Option D) are more effective and cost-efficient for batch processing workloads.

How to eliminate wrong answers

Option A is wrong because increasing the instance size merely adds more memory, which is a temporary workaround that does not fix the underlying inefficiency and increases cost; the pipeline will still fail if the dataset grows beyond the new limit. Option B is wrong because using a database instead of CSV files changes the storage layer but does not inherently reduce memory consumption during inference—pandas still loads the entire result set into a DataFrame unless chunked queries are explicitly used. Option C is wrong because converting the model to half-precision (FP16) reduces model memory footprint but does not address the primary memory consumer, which is the pandas DataFrame holding the large dataset; the model is typically much smaller than the data.

Practice this question →

17

Multi-Selecthard

Which THREE factors are most critical to consider when designing a continuous integration/continuous deployment (CI/CD) pipeline for machine learning?

Select 3 answers

A.Data quality and schema validation

B.A/B testing framework for comparing models

C.Automated model performance benchmarking

D.Automated unit testing of application code

E.Versioning of datasets, models, and training code

AnswersA, C, E

Data validation ensures reliable model inputs.

Why this answer

Data quality and schema validation (A) are critical because ML pipelines are highly sensitive to data drift, missing values, and schema mismatches that can silently degrade model performance. Without automated validation at the CI stage, bad data can pass through and corrupt model training or inference, leading to unreliable outputs in production.

Exam trap

CompTIA often tests the distinction between ML-specific pipeline requirements and general DevOps practices, so candidates mistakenly select generic options like unit testing (D) or A/B testing (B) instead of the ML-critical factors of data validation, model benchmarking, and versioning.

Practice this question →

18

MCQmedium

A company wants to roll out a new recommendation model to production. They decide to run an A/B test where 10% of users see the new model and 90% see the old model. After one week, the new model shows a 5% improvement in click-through rate. What is the next best action?

A.Immediately roll out the new model to 100% of users

B.Revert to the old model because the improvement is minimal

C.Run the test for another month to ensure statistical significance

D.Increase the testing percentage gradually while monitoring performance metrics and guardrails

AnswerD

This approach captures benefits while controlling risk.

Why this answer

Option D is correct because a 5% improvement observed over only one week with a 10% traffic split is insufficient to confirm statistical significance or rule out novelty effects, data drift, or seasonal bias. The recommended best practice in AI deployment is to gradually increase the testing percentage (e.g., 10% → 25% → 50% → 100%) while continuously monitoring performance metrics and guardrails (e.g., click-through rate, conversion rate, latency, and error rates) to ensure the new model generalizes safely across the full user population.

Exam trap

CompTIA often tests the misconception that a short-term observed improvement is automatically statistically significant, tempting candidates to choose immediate full rollout (Option A) or premature reversion (Option B), when the correct answer emphasizes incremental deployment with continuous monitoring.

How to eliminate wrong answers

Option A is wrong because immediately rolling out to 100% of users risks exposing the entire user base to a model that may have only shown a temporary or statistically insignificant improvement, potentially causing negative business impact if the model fails under full load or exhibits unexpected behavior. Option B is wrong because reverting to the old model based on a 'minimal' improvement is premature; a 5% uplift could be meaningful depending on the baseline, and the test should be allowed to run longer to gather sufficient data for a valid statistical conclusion. Option C is wrong because running the test for another month without adjusting the traffic split or monitoring guardrails is inefficient and may still not guarantee statistical significance if the sample size remains too small; the correct approach is to increase the traffic percentage gradually while verifying performance at each step.

Practice this question →

19

Multi-Selecteasy

Which THREE are common pitfalls when operationalizing AI models? (Select THREE.)

Select 3 answers

A.Training-serving skew due to differences in data preprocessing

B.Using simpler models that are easier to debug

C.Lack of monitoring for model performance drift

D.Ignoring infrastructure scalability requirements

E.Automating the model retraining process

AnswersA, C, D

Causes model inaccuracies in production.

Why this answer

Option A is correct because training-serving skew occurs when the data preprocessing logic used during model training differs from that used during inference in production. This is a common pitfall in operationalizing AI models, as even minor discrepancies in feature engineering, normalization, or encoding can cause significant performance degradation. For example, using different libraries or versions for tokenization between training and serving pipelines directly leads to skew.

Exam trap

CompTIA often tests the distinction between operational pitfalls and best practices, so the trap here is that candidates may mistake a recommended practice (like using simpler models or automating retraining) for a pitfall, when in fact the pitfall is the lack of monitoring or ignoring scalability.

Practice this question →

20

MCQmedium

An operations team monitors a classification model in production. The confusion matrix for the model shows the following values: TP=1500, FN=500, FP=600, TN=2400. Which metric should the team calculate to assess the model's ability to avoid false positives?

A.Accuracy

B.F1-score

C.Recall

D.Precision

AnswerD

Precision = TP/(TP+FP) = 1500/2100 = 0.714, directly relates to false positives.

Why this answer

Precision (TP / (TP + FP)) directly measures the proportion of positive identifications that were actually correct, making it the ideal metric to assess the model's ability to avoid false positives. With TP=1500 and FP=600, precision is 1500/(1500+600)=0.714, indicating that 71.4% of predicted positives are true positives, while 28.6% are false positives. The question explicitly asks about avoiding false positives, which is the inverse of precision's focus on the correctness of positive predictions.

Exam trap

CompTIA often tests the distinction between precision and recall by framing a question about 'avoiding false positives' or 'avoiding false negatives,' leading candidates to confuse precision with recall or to default to F1-score as a balanced metric.

How to eliminate wrong answers

Option A is wrong because Accuracy ((TP+TN)/(TP+TN+FP+FN)) measures overall correctness across all classes, not specifically the model's ability to avoid false positives; it can be high even when false positives are numerous if the negative class dominates. Option B is wrong because F1-score is the harmonic mean of precision and recall, balancing both false positives and false negatives, but it does not isolate the ability to avoid false positives as precision does. Option C is wrong because Recall (TP/(TP+FN)) measures the model's ability to find all positive samples, focusing on false negatives, not false positives.

Practice this question →

21

MCQhard

A financial institution deploys an AI credit scoring model. After six months, the model's performance drops significantly. Analysis shows that the relationship between features and labels has changed. Which term describes this phenomenon?

A.Concept drift

B.Model decay

C.Overfitting

D.Data drift

AnswerA

Concept drift directly refers to changes in the relation between inputs and outputs.

Why this answer

Concept drift occurs when the statistical relationship between input features and the target label changes over time, which is exactly what happened when the credit scoring model's performance dropped due to a shift in the feature-label relationship. This is distinct from data drift, which only involves changes in the input data distribution without affecting the label mapping.

Exam trap

CompTIA often tests the distinction between concept drift and data drift, and the trap here is that candidates confuse a change in input data distribution (data drift) with a change in the underlying relationship between features and labels (concept drift), leading them to incorrectly select data drift.

How to eliminate wrong answers

Option B (Model decay) is wrong because model decay is a general term for performance degradation over time, but it does not specifically describe a change in the feature-label relationship; it could be caused by data drift, concept drift, or other factors. Option C (Overfitting) is wrong because overfitting refers to a model learning noise or specific patterns in the training data that do not generalize, not a post-deployment shift in the underlying relationship. Option D (Data drift) is wrong because data drift only describes changes in the distribution of input features (e.g., customer income shifts), not a change in the mapping from features to the target label (e.g., what constitutes a good credit risk).

Practice this question →

22

Multi-Selecthard

An ML operations team needs to monitor a deployed model's performance. Which TWO metrics are most useful for detecting concept drift in a regression model? (Choose two.)

Select 2 answers

A.Distribution of input features

B.Distribution of residuals between predictions and actuals

C.Classification accuracy

D.Model inference latency

E.Mean absolute error (MAE) over a sliding time window

AnswersB, E

Changing residual distribution can indicate concept drift.

Why this answer

Option B is correct because monitoring the distribution of residuals (predicted vs. actual values) directly reveals when the relationship between inputs and outputs has shifted, which is the essence of concept drift. In a regression model, if the residuals become systematically biased or their variance changes over time, it indicates that the underlying data-generating process has changed, even if input feature distributions remain stable.

Exam trap

CompTIA often tests the distinction between covariate drift and concept drift, trapping candidates who think monitoring input features is sufficient for detecting all types of model degradation.

Practice this question →

23

MCQhard

An ML team uses the model registry above. After deploying version 3 to production, they discover it has a critical bug. What is the fastest way to roll back to a stable version without re-deploying from scratch?

A.Promote version 2 from Staging to Production

B.Redeploy version 1 by updating the production deployment to use its artifact

C.Retrain the model with corrected data

D.Delete version 3 from the registry

AnswerB

Version 1 is still in Production and can be quickly redeployed.

Why this answer

Option B is correct because it directly updates the production deployment to reference the artifact of version 1, which is a stable, previously validated version. This avoids the overhead of re-deploying from scratch by simply pointing the existing deployment to a known good artifact in the model registry, leveraging the registry's artifact storage and deployment integration.

Exam trap

CompTIA often tests the misconception that deleting a model version or promoting a staging version is the fastest rollback, when in reality the fastest method is to update the existing deployment's artifact reference to a known stable version.

How to eliminate wrong answers

Option A is wrong because promoting version 2 from Staging to Production would require a new deployment process (e.g., updating the deployment configuration or triggering a CI/CD pipeline), which is not the fastest rollback method; it also assumes version 2 is stable, which may not be the case if it was never validated in production. Option C is wrong because retraining the model with corrected data is a time-consuming process that involves data preparation, training, and validation, and does not address the immediate need to roll back to a stable version. Option D is wrong because deleting version 3 from the registry does not affect the running production deployment; the deployment continues to use the artifact of version 3 until the deployment configuration is explicitly updated to reference a different version.

Practice this question →

24

Multi-Selecthard

Which THREE components are essential for implementing a successful MLOps pipeline for a continuously deployed AI system?

Select 3 answers

A.Manual approval gates for each deployment

B.Canary deployment strategy

C.Model registry for version control and metadata management

D.Automated testing and validation of models and pipelines

E.Data and model versioning

AnswersC, D, E

Registry is critical for tracking and managing model versions.

Why this answer

A model registry (C) is essential for MLOps because it provides version control, metadata management, and lineage tracking for all trained models. This enables reproducibility, auditability, and seamless rollback in a continuously deployed AI system, ensuring that only validated models are promoted to production.

Exam trap

CompTIA often tests the distinction between operational strategies (like canary deployments) and foundational pipeline components (like versioning and registries), leading candidates to confuse deployment tactics with essential infrastructure.

Practice this question →

25

MCQeasy

Based on the exhibit, which action is most likely to resolve the memory issue?

A.Add more training data.

B.Increase the learning rate.

C.Switch to a CPU.

D.Reduce the batch size.

AnswerD

Smaller batches reduce the memory allocated for intermediate tensors.

Why this answer

The exhibit shows an out-of-memory (OOM) error during training. Reducing the batch size decreases the memory footprint per iteration, allowing the model to fit within available GPU memory. This directly resolves the memory issue without altering the model architecture or data.

Exam trap

CompTIA often tests the misconception that memory errors are solved by adding more data or changing hardware, when in fact the simplest and most common fix is adjusting the batch size to fit within available GPU memory.

How to eliminate wrong answers

Option A is wrong because adding more training data increases the dataset size, which does not reduce per-batch memory consumption and may even exacerbate memory pressure during data loading. Option B is wrong because increasing the learning rate affects convergence behavior and gradient magnitudes, not memory usage; it can cause instability or divergence but does not free GPU memory. Option C is wrong because switching to a CPU would typically use system RAM instead of GPU memory, but CPUs are far slower for deep learning training and do not resolve the underlying memory constraint—they just shift the bottleneck, often making training impractically slow.

Practice this question →

26

MCQeasy

A data science team deployed a model for real-time predictions. After two weeks, the model's accuracy dropped from 92% to 80%. The monitoring system shows no data drift in features, but the target variable distribution has shifted. Which approach should the team use to detect this issue?

A.Schedule manual weekly reviews of model predictions

B.Monitor the distribution of the predicted target variable over time

C.Retrain the model immediately with new data

D.Monitor input feature distributions using a KS test

AnswerB

This detects target drift, which indicates concept drift.

Why this answer

Option B is correct because monitoring the distribution of the predicted target variable directly detects concept drift, which occurs when the relationship between features and the target changes. Since the monitoring system shows no data drift in features, the accuracy drop is likely due to a shift in the target variable's distribution, and tracking predictions over time reveals this shift. This approach aligns with MLOps best practices for detecting concept drift without requiring immediate retraining.

Exam trap

CompTIA often tests the distinction between data drift and concept drift, trapping candidates who assume that monitoring input features (Option D) is sufficient to detect all performance degradation.

How to eliminate wrong answers

Option A is wrong because manual weekly reviews are reactive, not proactive, and cannot provide real-time detection of distribution shifts; they also introduce latency and human error. Option C is wrong because retraining the model immediately without diagnosing the root cause may waste resources and could reinforce biased patterns if the drift is temporary or due to a data quality issue. Option D is wrong because monitoring input feature distributions using a KS test detects data drift, but the problem states there is no data drift in features, so this approach would not identify the target variable shift causing the accuracy drop.

Practice this question →

27

MCQmedium

A financial institution uses a machine learning model to approve personal loans. The model was trained on historical data that includes applicant age, income, credit score, and loan amount. Compliance officers have received customer complaints suggesting the model may be discriminating against applicants over 60 years old. Initial analysis shows that the approval rate for applicants over 60 is 20 percentage points lower than for younger applicants with similar credit profiles. The data science team has been asked to investigate and remediate any bias. They have access to the training data, model coefficients, and can retrain or modify the model. What is the FIRST step the team should take?

A.Replace the model with a third-party vendor model that claims to be bias-free.

B.Re-sample the training data to have equal numbers of applicants over and under 60.

C.Conduct a fairness audit using appropriate metrics such as disparate impact ratio on the current model.

D.Remove the age feature from the training data and retrain the model.

AnswerC

An audit quantifies bias and provides a baseline to measure remediation effectiveness.

Why this answer

Option C is correct because the first step in addressing potential bias is to conduct a fairness audit using established metrics like the disparate impact ratio (e.g., the 80% rule from the US Equal Employment Opportunity Commission). This quantifies whether the model's approval rate for applicants over 60 is less than 80% of the rate for the younger group, providing a legally and technically sound baseline before any remediation. Without this measurement, any subsequent changes (like resampling or removing features) could be misguided or ineffective.

Exam trap

CompTIA often tests the misconception that removing a protected attribute (like age) is sufficient to eliminate bias, when in fact proxy features can perpetuate discrimination, making a fairness audit the mandatory first step.

How to eliminate wrong answers

Option A is wrong because replacing the model with a third-party vendor model that claims to be bias-free does not address the specific bias found in the current system, and it bypasses the necessary diagnostic step of understanding the root cause; vendor claims are not a substitute for empirical validation. Option B is wrong because resampling the training data to have equal numbers of applicants over and under 60 does not guarantee fairness—it can introduce sampling bias, distort the real-world distribution, and may not correct the underlying model behavior that causes disparate impact. Option D is wrong because simply removing the age feature from the training data and retraining the model is a naive approach; age may be correlated with other features (e.g., income, credit score), so the model could still indirectly discriminate through proxy variables, a phenomenon known as 'bias amplification' or 'redundant encoding'.

Practice this question →

28

MCQmedium

An operations team sees the log entries above for a production ML model. What is the MOST likely root cause of the latency spike?

A.A scheduled training job consuming GPU resources on the same node.

B.A memory leak in the model serving container causing gradual slowdown.

C.A network outage between the model server and the client.

D.A bug in the model's preprocessing code causing incorrect predictions.

AnswerB

Memory leak can cause garbage collection overhead and increased latency.

Why this answer

The log entries show a gradual increase in latency over time, which is characteristic of a memory leak in the model serving container. As memory consumption grows, garbage collection pauses become more frequent and longer, eventually causing request processing to slow down. This pattern is distinct from a sudden spike caused by resource contention or network issues.

Exam trap

CompTIA often tests the distinction between gradual vs. sudden performance degradation patterns, where candidates mistakenly attribute a gradual latency increase to a transient resource contention event like a training job or network issue.

How to eliminate wrong answers

Option A is wrong because a scheduled training job consuming GPU resources would cause a sudden, sharp latency spike at the start of training, not a gradual increase over time. Option C is wrong because a network outage would result in complete request failures or timeouts, not a progressive latency degradation. Option D is wrong because a bug in preprocessing code causing incorrect predictions would affect prediction accuracy, not the latency of the serving endpoint.

Practice this question →

29

MCQmedium

A data scientist fine-tunes a large language model for a legal document summarization task. After fine-tuning, the model performs well on test data but produces summaries that include hallucinated legal clauses. Which mitigation strategy is most effective?

A.Use a different tokenizer during fine-tuning.

B.Decrease the temperature parameter to 0.1 during inference.

C.Implement retrieval-augmented generation (RAG) to provide factual context.

D.Set a maximum token limit of 50 for each summary.

AnswerC

RAG fetches relevant documents to condition the generation, reducing reliance on parametric memory.

Why this answer

Option A is correct because RAG grounds the model in retrieved documents, reducing hallucinations. Option B is wrong as temperature affects creativity but does not eliminate hallucinations. Option C is wrong because it reduces output length but may not prevent false content.

Option D is wrong because it does not address the model's tendency to fabricate.

Practice this question →

30

MCQeasy

During an AI model deployment, the operations team notices that inference requests are taking longer than expected. Which component is most likely causing the bottleneck?

A.Input data preprocessing pipeline

B.API gateway rate limiting

C.Database connection pool size

D.The machine learning model's size and architecture

AnswerD

Larger models take longer to compute predictions.

Why this answer

The machine learning model's size and architecture directly determine the computational complexity of inference. Larger models with more parameters or deeper architectures require more matrix multiplications and memory bandwidth, which increases latency per request. This is the most common bottleneck in AI deployment because the model itself is the core computation unit, and its inference time scales with its complexity.

Exam trap

CompTIA often tests the misconception that operational components like API gateways or databases are the primary cause of slow inference, when in fact the model's computational demand is the root cause, especially in scenarios where preprocessing and postprocessing are negligible.

How to eliminate wrong answers

Option A is wrong because input data preprocessing typically involves lightweight operations like normalization or tokenization, which are orders of magnitude faster than model inference and rarely the primary bottleneck unless the pipeline is poorly optimized. Option B is wrong because API gateway rate limiting controls the number of requests per second, not the latency of individual inference requests; it would cause throttling errors, not slow responses. Option C is wrong because database connection pool size affects the ability to fetch or store data concurrently, but inference latency is dominated by model computation, not database lookups, unless the model relies on external data retrieval per request.

Practice this question →

31

MCQeasy

An organization is deploying an AI model on edge devices with limited computational resources. Which model optimization technique is most appropriate?

A.Perform additional feature engineering

B.Apply model quantization

C.Use an ensemble of models

D.Increase the training dataset size

AnswerB

Quantization reduces precision, making models smaller and faster.

Why this answer

Model quantization reduces the precision of the model's weights and activations (e.g., from 32-bit floating point to 8-bit integer), which significantly decreases memory footprint and computational requirements. This makes it ideal for deployment on edge devices with limited resources, as it enables faster inference with minimal accuracy loss.

Exam trap

CompTIA often tests the misconception that improving model performance (e.g., via feature engineering or more data) is equivalent to optimizing for deployment constraints, when in fact techniques like quantization directly address resource limitations.

How to eliminate wrong answers

Option A is wrong because feature engineering improves model input quality but does not reduce the computational load or model size required for inference on edge devices. Option C is wrong because using an ensemble of models increases the total number of parameters and inference time, which is counterproductive for resource-constrained edge devices. Option D is wrong because increasing the training dataset size improves model generalization but does not reduce the model's computational requirements during inference; it may even increase training time and model complexity.

Practice this question →

32

MCQeasy

During model training, the data science team discovers that many input features contain missing values. Which step should be taken to improve data quality?

A.Implement data validation checks to handle missing data appropriately (e.g., imputation).

B.Increase the model complexity to handle missing data.

C.Ignore missing values and train the model.

D.Remove all records with missing values.

AnswerA

This ensures data quality without losing valuable information.

Why this answer

Option A is correct because data validation checks, such as imputation (e.g., mean, median, or KNN imputation), directly address missing values by estimating plausible replacements based on the available data. This improves data quality and prevents bias or loss of information that could degrade model performance. In the context of AI implementation, handling missing data is a fundamental data preprocessing step to ensure robust model training.

Exam trap

CompTIA often tests the misconception that 'ignoring missing data' or 'removing rows' is acceptable, when in fact proper data validation and imputation are required to maintain data integrity and model validity.

How to eliminate wrong answers

Option B is wrong because increasing model complexity (e.g., adding more layers or parameters) does not inherently handle missing data; it may overfit to noise or propagate errors from incomplete features. Option C is wrong because ignoring missing values can cause algorithms (e.g., linear regression, SVM) to fail during training or produce biased coefficients, as many implementations do not natively support NaN inputs. Option D is wrong because removing all records with missing values can lead to significant data loss, reduce sample size, and introduce selection bias, especially when missingness is not completely at random (MCAR).

Practice this question →

33

MCQhard

An organization uses a batch prediction pipeline that processes daily customer data to generate marketing recommendations. One month after deployment, the model's performance degrades significantly. The data pipeline logs show that the input data schema has changed — a new categorical feature 'customer_segment' has been added, and the existing feature 'age_group' is now missing. Which step should the operations team take first?

A.Retrain the model using the new schema and redeploy

B.Update the data preprocessing pipeline to handle missing features and add the new feature

C.Revert to the previous week's model version that was performing well

D.Contact the data engineering team to revert the schema change

AnswerB

This adapts the pipeline to the new schema, enabling proper feeding to the model.

Why this answer

Option B is correct because the immediate priority is to ensure the data preprocessing pipeline can handle the schema change without breaking. The pipeline must gracefully handle the missing 'age_group' feature (e.g., by imputing or dropping it) and incorporate the new 'customer_segment' feature before any model retraining or rollback. This prevents data drift from causing inference errors and maintains pipeline stability.

Exam trap

CompTIA often tests the misconception that retraining the model (Option A) is the first step to fix performance degradation, but the trap here is that the root cause is a schema mismatch in the preprocessing layer, not the model weights.

How to eliminate wrong answers

Option A is wrong because retraining the model without first fixing the preprocessing pipeline would still fail due to missing or misaligned features, and it assumes the new schema is already compatible. Option C is wrong because reverting to a previous model version does not address the root cause — the input data schema has changed, so the old model would still receive malformed data and produce incorrect predictions. Option D is wrong because contacting the data engineering team to revert the schema change is a reactive, non-technical workaround that ignores the need for the operations team to adapt the pipeline to handle schema evolution autonomously.

Practice this question →

34

MCQeasy

A company deploys a deep learning model for real-time image classification. After deployment, they notice high inference latency exceeding the 100ms SLA. Which action would most likely reduce latency without significantly impacting accuracy?

A.Add more training data to improve model robustness

B.Replace the model with a simpler logistic regression model

C.Increase batch size for inference

D.Apply model quantization

AnswerD

Quantization reduces model size and inference time with minor accuracy impact.

Why this answer

Model quantization reduces the precision of the model's weights and activations (e.g., from 32-bit floating point to 8-bit integer), which significantly decreases memory bandwidth and computational requirements during inference. This directly lowers latency without fundamentally altering the model's learned representations, so accuracy degradation is typically minimal (often <1-2%).

Exam trap

CompTIA often tests the misconception that increasing batch size always improves latency, when in fact it increases per-request latency in real-time systems, and that simpler models are always better for latency, ignoring the critical accuracy requirement.

How to eliminate wrong answers

Option A is wrong because adding more training data improves model robustness and generalization but does not reduce inference latency; it may even increase training time and model complexity. Option B is wrong because replacing a deep learning model with a logistic regression model would drastically reduce accuracy for complex image classification tasks, failing the 'without significantly impacting accuracy' constraint. Option C is wrong because increasing batch size for inference increases the number of images processed per batch, which can improve throughput but actually increases per-request latency (time to first prediction) and may exceed the 100ms SLA for real-time applications.

Practice this question →

35

MCQmedium

A company uses an AI model to predict equipment failures. The model outputs a probability of failure. To minimize false alarms, the operations team wants a high precision. Which deployment strategy should they implement?

A.Retrain the model on more recent data

B.Increase the decision threshold for positive classification

C.Decrease the decision threshold

D.Use an ensemble of models with voting

AnswerB

Higher threshold means fewer positive predictions, increasing precision.

Why this answer

To minimize false alarms and achieve high precision, the operations team should increase the decision threshold for positive classification. A higher threshold means the model only predicts a failure when it is very confident, reducing the number of false positives (false alarms) at the cost of potentially missing some true failures (lower recall). This directly controls the precision-recall trade-off without changing the underlying model.

Exam trap

CompTIA often tests the precision-recall trade-off by making candidates confuse increasing the threshold (which improves precision) with decreasing it (which improves recall), or by suggesting retraining or ensemble methods as direct solutions for precision tuning.

How to eliminate wrong answers

Option A is wrong because retraining on more recent data improves model accuracy and relevance but does not directly control the precision-recall trade-off; it may not reduce false alarms if the model's calibration remains unchanged. Option C is wrong because decreasing the decision threshold would make the model more sensitive, increasing the number of positive predictions and thus increasing false alarms (lower precision), which is the opposite of the goal. Option D is wrong because using an ensemble of models with voting can improve overall accuracy and robustness, but it does not specifically target precision; the voting mechanism may still produce many false positives unless the threshold is also adjusted.

Practice this question →

36

MCQhard

A large financial services company deploys multiple AI models on a shared Kubernetes cluster with GPU nodes. The models serve real-time fraud detection and credit scoring. Recently, the operations team observed frequent out-of-memory (OOM) errors during peak hours, causing inference failures. The monitoring dashboards show GPU memory utilization averaging 90% during peak times, and pods are being evicted. The team has allocated 8GB per pod and the total cluster GPU memory is 32GB. The models require at least 4GB each, but the fraud detection model occasionally spikes to 7GB. Which course of action best resolves the OOM errors while maintaining high availability?

A.Reduce the batch size and model complexity for all models to lower memory footprint

B.Set resource limits and requests per model based on observed usage, and implement pod priority classes

C.Provision larger GPU nodes with 48GB memory each

D.Increase the memory request for all pods to 8GB to ensure they have enough

AnswerB

Limits prevent OOM, priority ensures critical models get resources.

Why this answer

Option B is correct because it uses Kubernetes resource management features—setting precise resource requests and limits based on observed GPU memory usage—combined with pod priority classes to ensure critical fraud detection pods are scheduled and retained during contention. This prevents OOM errors by capping memory per pod while allowing the spike-prone fraud model to be prioritized over less critical workloads, maintaining high availability without overprovisioning.

Exam trap

CompTIA often tests the misconception that simply increasing resource requests or node size solves OOM errors, when the real solution involves proper resource limits and scheduling policies to handle variable workloads and maintain availability.

How to eliminate wrong answers

Option A is wrong because reducing batch size and model complexity may degrade inference accuracy or latency, and it does not address the root cause of memory spikes for the fraud detection model; it is a workaround that sacrifices performance. Option C is wrong because provisioning larger GPU nodes (48GB) is a costly overprovisioning approach that does not solve the scheduling or priority issue—it only shifts the bottleneck and may still allow a single pod to consume excessive memory and cause OOM on the larger node. Option D is wrong because increasing the memory request for all pods to 8GB does not prevent the fraud detection model from spiking to 7GB (which is under 8GB) and ignores the need for limits and priority; it may also lead to resource waste and does not address eviction during peak contention.

Practice this question →

37

MCQhard

An ML engineering team has a retraining pipeline that triggers automatically when model accuracy drops below a threshold. Recently, the model's accuracy has been fluctuating, causing frequent retraining and high compute costs. The team suspects the data distribution is changing slowly. Which approach should the team implement to reduce unnecessary retraining while maintaining model performance?

A.Use a simpler model to reduce variability

B.Implement a statistical drift detection method on input features

C.Increase the frequency of model retraining

D.Reduce the batch size for inference

AnswerB

Drift detection ensures retraining only when meaningful change occurs.

Why this answer

Option B is correct because implementing a statistical drift detection method (e.g., using KL divergence, PSI, or ADWIN) on input features allows the team to identify when the data distribution has genuinely changed, rather than reacting to random accuracy fluctuations. This reduces unnecessary retraining by triggering the pipeline only when statistically significant drift is detected, maintaining model performance without the high compute costs of frequent retraining.

Exam trap

CompTIA often tests the misconception that increasing retraining frequency or simplifying the model can solve drift-related issues, but the correct approach is to detect drift statistically before deciding to retrain.

How to eliminate wrong answers

Option A is wrong because using a simpler model may reduce variability but does not address the root cause of distribution drift; it could also degrade performance by underfitting the true underlying patterns. Option C is wrong because increasing retraining frequency would exacerbate the compute cost problem and may overfit to transient fluctuations, not solve the issue of unnecessary retraining. Option D is wrong because reducing the batch size for inference affects throughput and latency, not the detection of data distribution changes or the decision to retrain.

Practice this question →

38

MCQeasy

A developer sees the above error during inference on a deployed image classification model. What is the most likely cause?

A.The model version is incompatible with the serving framework

B.The input images are not being resized to the required dimensions

C.The inference server does not support batch processing

D.The model is overfitting to a specific image size

AnswerB

Model expects 299x299 but receives 224x224, so preprocessing is missing resizing.

Why this answer

The error during inference typically indicates a mismatch between the input tensor shape expected by the model and the shape of the provided image. Most image classification models are trained on fixed-size inputs (e.g., 224x224 for ResNet), and failing to resize the input images to those required dimensions causes a shape mismatch error in the serving framework (e.g., TensorFlow Serving or TorchServe). Option B correctly identifies this as the most likely cause because the error message often references tensor shape incompatibility.

Exam trap

CompTIA often tests the misconception that inference errors are caused by model versioning or server configuration, when the actual issue is a simple preprocessing step like image resizing that candidates overlook.

How to eliminate wrong answers

Option A is wrong because model version incompatibility with the serving framework usually manifests as a loading or serialization error (e.g., 'Unsupported op set' or 'Model not found'), not a runtime shape mismatch during inference. Option C is wrong because the inference server's batch processing capability is unrelated to the error; even if batch processing is unsupported, the server would still process single images, and the error would not be about input dimensions. Option D is wrong because overfitting to a specific image size is a training-phase issue that would affect model accuracy, not cause a runtime shape mismatch error during inference.

Practice this question →

39

MCQhard

A company operating in the EU must comply with GDPR. An AI model processes personal data for customer segmentation. Which of the following ensures compliance?

A.Obtain explicit consent once and use data indefinitely.

B.Store personal data permanently for model improvement.

C.Use only aggregated data without any individual records.

D.Implement data anonymization and allow users to request deletion.

AnswerD

Anonymization reduces privacy risk, and deletion capability ensures compliance with GDPR rights.

Why this answer

Option D is correct because GDPR mandates that personal data must be processed lawfully, with data minimization and the right to erasure. Implementing data anonymization removes personally identifiable information (PII) so the data is no longer considered personal data under GDPR, and allowing users to request deletion directly satisfies the 'right to be forgotten' (Article 17). This approach ensures compliance by both protecting individual privacy and providing a mechanism for data subjects to exercise their legal rights.

Exam trap

CompTIA often tests the misconception that pseudonymization or simple aggregation is sufficient for GDPR compliance, when in fact only irreversible anonymization (where no individual can be re-identified) removes data from GDPR scope, and the right to deletion must still be explicitly supported for any remaining personal data.

How to eliminate wrong answers

Option A is wrong because GDPR requires that consent be specific, informed, and revocable; obtaining consent once does not permit indefinite use, and data must be retained only as long as necessary for the stated purpose. Option B is wrong because storing personal data permanently violates the data minimization and storage limitation principles (Article 5(1)(c) and (e)), and model improvement is not a valid basis for indefinite retention without explicit, ongoing consent. Option C is wrong because while aggregated data reduces risk, it does not automatically ensure compliance if the aggregation method is reversible or if the data can be re-identified; true anonymization must be irreversible and meet the GDPR's standard of 'anonymous information' (Recital 26).

Practice this question →

40

Multi-Selectmedium

A machine learning engineer is deploying a model to production. Which TWO practices are essential for ensuring reproducibility of model predictions?

Select 2 answers

A.Increase the number of training epochs to ensure convergence.

B.Use the same GPU hardware for both training and inference.

C.Use parallel data loading to speed up inference.

D.Version-control the model artifact (e.g., using MLflow or DVC).

E.Fix random seeds for all libraries (e.g., NumPy, TensorFlow).

AnswersD, E

Versioning ensures the exact model is used for inference.

Why this answer

Version-controlling the model artifact (D) is essential because it allows you to reproduce the exact model binary that generated a prediction, ensuring that any changes to the model code, hyperparameters, or training data do not silently alter outputs. Tools like MLflow or DVC store the model along with its metadata, enabling rollback and auditability in production.

Exam trap

CompTIA often tests the misconception that hardware consistency (e.g., same GPU) is required for reproducibility, when in fact deterministic software practices (version control and seed fixing) are the critical factors.

Practice this question →

41

MCQhard

Based on the exhibit, which action is permitted by this policy?

A.Deploy a new model to an endpoint.

B.Update an existing endpoint.

C.Delete an endpoint.

D.Invoke an endpoint for inference.

AnswerA

The allowed actions are necessary and sufficient to deploy a new model.

Why this answer

The exhibit shows an IAM policy that grants the `sagemaker:CreateModel` and `sagemaker:CreateEndpointConfig` actions, but the key action is `sagemaker:CreateEndpoint`. Deploying a new model to an endpoint requires creating a new endpoint, which is explicitly allowed by this policy. The policy does not include `sagemaker:UpdateEndpoint`, `sagemaker:DeleteEndpoint`, or `sagemaker:InvokeEndpoint`, so only creating a new endpoint is permitted.

Exam trap

CompTIA often tests the distinction between creating a new resource versus modifying or deleting an existing one, leading candidates to assume that broad permissions like `CreateEndpoint` also cover updates or invocations, which is incorrect in IAM policy evaluation.

How to eliminate wrong answers

Option B is wrong because updating an existing endpoint requires the `sagemaker:UpdateEndpoint` action, which is not listed in the policy. Option C is wrong because deleting an endpoint requires the `sagemaker:DeleteEndpoint` action, which is not granted. Option D is wrong because invoking an endpoint for inference requires the `sagemaker:InvokeEndpoint` action, which is absent from the policy.

Practice this question →

42

MCQmedium

A company is deploying a fraud detection model that must return predictions within 100ms to avoid transaction delays. The team is deciding between batch and real-time inference. Which factor most strongly supports a real-time inference architecture?

A.The model requires large amounts of historical data for each prediction

B.The application requires immediate feedback for each transaction

C.The infrastructure budget is limited and must be optimized

D.The model can be retrained weekly using gathered data

AnswerB

Real-time inference delivers low-latency predictions for each request.

Why this answer

Real-time inference is required when the application must return predictions within strict latency bounds (e.g., 100ms) to avoid transaction delays. The need for immediate feedback per transaction directly aligns with a real-time architecture, where each request is processed individually as it arrives, rather than waiting for a batch window. Batch inference would introduce unacceptable latency because it processes groups of records on a schedule, not on-demand.

Exam trap

CompTIA often tests the misconception that batch inference is always cheaper or more efficient, but the trap here is that latency requirements (under 100ms) force a real-time architecture regardless of cost or data volume.

How to eliminate wrong answers

Option A is wrong because requiring large amounts of historical data for each prediction does not dictate real-time vs. batch; it affects feature engineering and storage, not inference latency. Option C is wrong because limited infrastructure budget typically favors batch inference, which can use cheaper, less scalable resources and process data in bulk, not real-time. Option D is wrong because weekly retraining is a model update frequency concern, unrelated to the inference serving architecture; both batch and real-time systems can support periodic retraining.

Practice this question →

43

MCQeasy

Refer to the exhibit. The monitoring dashboard for a deployed churn prediction model shows a drift detected flag. However, the error rate and latency are within acceptable ranges. What is the most appropriate immediate action?

A.Trigger automatic retraining using the latest data

B.Roll back to the previous model version immediately

C.Ignore the drift since performance metrics are stable

D.Investigate the type and severity of drift before deciding

AnswerD

Understanding drift (covariate vs concept) informs next steps.

Why this answer

Option B is correct because drift detection warrants investigation before any automated action; retraining or rollback might be premature without understanding the drift type. Option A is wrong because auto-retraining could be risky if drift is benign. Option C is wrong because ignoring drift may lead to future degradation.

Option D is wrong because rollback discards potential improvements.

Practice this question →

44

MCQhard

A company serves a large language model (LLM) on a Kubernetes cluster. The inference latency is acceptable but the cost is high due to GPU usage. The model is 7 billion parameters and requires 16GB GPU memory. The team wants to reduce cost without increasing latency. Which strategy should they implement?

A.Increase the batch size for inference

B.Add more GPU nodes to distribute the load

C.Switch to CPU-based inference

D.Use model quantization to reduce precision

AnswerD

Quantization reduces model size and memory, enabling more efficient GPU usage.

Why this answer

Model quantization reduces the precision of the model's weights (e.g., from FP32 to INT8), which decreases the GPU memory footprint from 16GB to approximately 4GB for a 7B parameter model. This directly lowers GPU cost per inference while maintaining acceptable latency, as the model can run on fewer or cheaper GPUs without increasing inference time.

Exam trap

CompTIA often tests the misconception that adding more hardware (Option B) or increasing batch size (Option A) always reduces cost, when in fact they increase resource usage and cost; the trap is that candidates overlook memory optimization techniques like quantization as a direct cost-reduction strategy.

How to eliminate wrong answers

Option A is wrong because increasing batch size for inference would increase GPU memory usage and could increase latency due to larger memory transfers, not reduce cost without affecting latency. Option B is wrong because adding more GPU nodes would increase cost, not reduce it, and does not address the high GPU memory usage per inference. Option C is wrong because switching to CPU-based inference would drastically increase latency (often 10-100x slower) due to the lack of parallel processing for large matrix operations, violating the requirement to not increase latency.

Practice this question →

45

MCQeasy

A team deploys a machine learning model as a REST API. They want to monitor model drift. Which metric is MOST appropriate for detecting drift in the input data distribution?

A.Model accuracy on a recent holdout set.

B.Population stability index (PSI) comparing training and recent data.

C.F1 score on the training data.

D.Root mean squared error (RMSE) on test data.

AnswerB

PSI directly quantifies distribution shift.

Why this answer

Population stability index (PSI) is the most appropriate metric for detecting drift in input data distribution because it directly measures the shift between the training data distribution and the recent production data distribution. PSI is calculated by binning both distributions and computing the sum of (proportion in bin of recent data minus proportion in bin of training data) times the natural log of their ratio, making it sensitive to changes in feature distributions without requiring ground truth labels.

Exam trap

The trap here is that candidates often confuse performance metrics (accuracy, F1, RMSE) with distribution drift detection, not realizing that PSI specifically quantifies covariate shift without needing ground truth labels.

How to eliminate wrong answers

Option A is wrong because model accuracy on a recent holdout set measures performance degradation, not input data distribution drift; accuracy can drop due to concept drift or other factors, and it requires labeled data which may not be available in production. Option C is wrong because F1 score on the training data is a measure of model fit on historical data, not a metric for detecting changes in the input distribution of new data. Option D is wrong because root mean squared error (RMSE) on test data evaluates prediction error on a static test set, not the distributional shift between training and current production inputs.

Practice this question →

46

Multi-Selecteasy

Which TWO actions are most appropriate for managing model drift in a production AI system?

Select 2 answers

A.Freeze the model to prevent any changes

B.Roll back to a previous model version if performance degrades

C.Periodically retrain the model on recent data

D.Manually review all model predictions

E.Implement automated monitoring to detect drift indicators

AnswersC, E

Regular retraining helps the model adapt to new patterns.

Why this answer

Option C is correct because periodically retraining the model on recent data is a fundamental strategy to combat model drift, ensuring the model adapts to changes in the underlying data distribution (e.g., concept drift or covariate shift). This aligns with MLOps best practices for maintaining model accuracy over time in production AI systems.

Exam trap

CompTIA often tests the distinction between reactive fixes (like rollback) and proactive, automated strategies (like monitoring and retraining), tricking candidates into choosing rollback as a valid long-term drift management action.

Practice this question →

47

Multi-Selectmedium

A financial services firm has deployed an AI model for real-time credit scoring. The operations team needs to ensure the model remains reliable and compliant over time. Which TWO actions should the team prioritize? (Choose two.)

Select 2 answers

A.Implement automated monitoring for data drift and model performance metrics.

B.Deploy a model versioning system with automated rollback capabilities.

C.Establish a governance process for version-controlled model deployment and retraining.

D.Schedule monthly manual retraining of the model using historical data.

E.Generate weekly compliance reports for regulatory review.

AnswersA, C

Monitoring data drift and performance metrics is proactive and addresses the root cause of model degradation.

Why this answer

Option A is correct because monitoring for data drift helps detect when the distribution of input features changes, which can degrade model performance. Option D is correct because version-controlled retraining ensures reproducibility and auditability. Option B is wrong because implementing automated rollback is reactive, not proactive; monitoring is more fundamental.

Option C is wrong while useful, manual retraining on fixed intervals does not adapt to drift as effectively as drift-triggered retraining. Option E is wrong because compiling frequent reports creates overhead without directly ensuring model reliability.

Practice this question →

48

MCQmedium

A healthcare AI startup has developed a model to detect diabetic retinopathy from retinal images. The model achieved 96% sensitivity and 94% specificity on a validation set from the same distribution as the training data. After deployment in a rural clinic, the model's sensitivity drops to 80%. The data team analyzes the clinical images from the clinic and finds that the images have lower resolution and different lighting conditions compared to the training dataset. The team has the ability to collect more data from the clinic and retrain the model. What is the BEST course of action?

A.Reduce the model's complexity by removing several convolutional layers to improve generalization.

B.Apply transfer learning using a model pre-trained on a different medical imaging dataset.

C.Implement adversarial validation to identify which images are out-of-distribution and filter them out.

D.Collect additional retinal images from the rural clinic, label them, and retrain the model including the new data.

AnswerD

Adding data from the target domain re-aligns the model with the deployment environment.

Why this answer

Option D is correct because the performance drop is caused by a domain shift (lower resolution, different lighting) between the training and deployment data. The most direct and effective solution is to collect labeled images from the target domain (rural clinic) and retrain the model, which aligns with the principle of domain adaptation through data augmentation. This approach addresses the root cause by exposing the model to the actual distribution it will encounter in production.

Exam trap

CompTIA often tests the misconception that reducing model complexity or using generic transfer learning can fix domain shift, when in reality the most reliable solution is to retrain with data from the target deployment environment.

How to eliminate wrong answers

Option A is wrong because reducing model complexity (e.g., removing convolutional layers) would likely decrease capacity to learn domain-specific features, potentially worsening performance rather than fixing the domain shift. Option B is wrong because transfer learning from a different medical imaging dataset (e.g., X-rays or MRIs) may not help if the source domain still differs significantly from the rural clinic's retinal images; it could introduce irrelevant features or negative transfer. Option C is wrong because adversarial validation only identifies out-of-distribution samples but does not improve model performance on those samples; filtering them out would reduce the usable data and fail to address the need for the model to work on the clinic's images.

Practice this question →

49

MCQmedium

Refer to the exhibit. A team created an access policy for a fraud detection model endpoint. An intern reports being unable to access the model for testing. Reviewing the policy, what is the most likely cause?

A.The intern's role is not included in the allowed roles

B.The policy JSON has a syntax error

C.The endpoint path is incorrect

D.The intern's role is explicitly denied in the policy

AnswerD

Denied roles override any allowed list.

Why this answer

Option D is correct because the exhibit shows an explicit `Deny` effect for the intern's role in the policy. In AWS IAM (or similar cloud provider) access policies, an explicit deny overrides any allow, so even if the intern's role is listed in allowed roles, the explicit deny will block access. This is a fundamental principle of IAM policy evaluation logic.

Exam trap

CompTIA often tests the explicit deny override principle, where candidates mistakenly think that listing a role in allowed roles guarantees access, ignoring that an explicit deny in the same policy will block it.

How to eliminate wrong answers

Option A is wrong because the intern's role is actually listed in the allowed roles section, so the issue is not a missing role. Option B is wrong because the policy JSON is syntactically valid (no missing commas, brackets, or quotes) and would parse correctly. Option C is wrong because the endpoint path is correctly specified in the policy's `Resource` element, matching the model endpoint ARN.

Practice this question →

50

MCQhard

An MLOps team uses a CI/CD pipeline to automate model retraining. The pipeline triggers on new labeled data, runs feature engineering, retrains the model, evaluates against a holdout set, and deploys if metrics exceed thresholds. Recently, a retrained model passed validation but caused a 5% accuracy drop in production. Which improvement best prevents this?

A.Implement canary deployment with shadow scoring to compare with current model

B.Require manual approval before deployment

C.Use the entire production dataset for validation instead of a holdout set

D.Increase the amount of training data used in each retraining cycle

AnswerA

Canary deployment allows testing on live traffic with minimal risk.

Why this answer

Option D is correct because adding canary deployment and shadow testing catches performance issues before full rollout. Option A is wrong because more training data might not help and could introduce bias. Option B is wrong because manual approval slows down pipeline.

Option C is wrong because only using full dataset for evaluation doesn't simulate production conditions.

Practice this question →

51

MCQmedium

An e-commerce company uses a machine learning model to recommend products to users. The model is retrained weekly and deployed to production. For the past three weeks, the model's click-through rate (CTR) has been stable except on Mondays, when it drops by 15%. Analysis reveals that the training data is extracted on Sundays and includes only weekday behavior. On Mondays, user behavior shifts due to weekend browsing patterns not captured in the training data. The team wants to maintain a weekly retraining cadence but fix the Monday performance drop. Which solution best addresses the Monday CTR drop without changing the retraining frequency?

A.Deploy a separate model specifically for Monday predictions

B.Modify the data pipeline to include the full week (including the past weekend) in each retraining

C.Serve the previous week's model on Mondays to use older but stable patterns

D.Change to daily retraining to include weekend data more promptly

AnswerB

Captures weekend behavior without altering frequency.

Why this answer

Option B is correct because it directly addresses the root cause: the training data excludes weekend behavior, causing the model to be blind to Monday patterns. By modifying the data pipeline to include the full week (including the past weekend) in each retraining, the model learns from weekend browsing patterns and can generalize to Monday user behavior without changing the weekly retraining cadence. This ensures the training distribution matches the inference distribution on Mondays, stabilizing CTR.

Exam trap

CompTIA often tests the misconception that changing retraining frequency (Option D) is the only way to incorporate new data, when in fact adjusting the data window within the existing cadence (Option B) is a more efficient and correct solution.

How to eliminate wrong answers

Option A is wrong because deploying a separate model for Monday predictions introduces operational complexity and does not fix the data gap; it merely treats the symptom by creating a specialized model that still lacks weekend data unless separately trained. Option C is wrong because serving the previous week's model on Mondays would use older patterns that also exclude the most recent weekend behavior, and the model would be even more stale, likely worsening the drop. Option D is wrong because changing to daily retraining alters the retraining frequency, which the team explicitly wants to maintain; it also adds unnecessary overhead and does not address the fact that the training data extraction point (Sundays) is the core issue.

Practice this question →

52

MCQmedium

A healthcare AI system that diagnoses medical images must provide explanations for its predictions to comply with regulatory requirements. Which technique should the team implement?

A.Reduce the model's accuracy to make it simpler.

B.Only deploy rule-based systems.

C.Apply model interpretability methods such as SHAP or LIME.

D.Use a more complex deep learning model.

AnswerC

These methods provide explanations for individual predictions without sacrificing model accuracy.

Why this answer

Option C is correct because SHAP (SHapley Additive exPlanations) and LIME (Local Interpretable Model-agnostic Explanations) are established model interpretability techniques that provide per-prediction explanations, which are essential for regulatory compliance in healthcare AI. These methods generate feature attribution scores or local surrogate models to explain why a specific diagnosis was made, meeting transparency requirements without sacrificing model performance.

Exam trap

The trap here is that candidates often assume complex models are inherently better for compliance, but Cisco tests the understanding that interpretability techniques are required to bridge the gap between high-performance black-box models and regulatory transparency.

How to eliminate wrong answers

Option A is wrong because reducing model accuracy to make it simpler would degrade diagnostic performance and still not guarantee interpretability; a simpler model is not inherently explainable in a regulatory sense. Option B is wrong because only deploying rule-based systems is overly restrictive and impractical for complex medical image analysis, where deep learning models often achieve superior accuracy; rule-based systems may also lack the flexibility to handle edge cases. Option D is wrong because using a more complex deep learning model typically reduces interpretability, making it harder to provide the required explanations, and does not address regulatory compliance.

Practice this question →

53

MCQeasy

A startup has developed a natural language processing model for sentiment analysis. Their CI/CD pipeline includes a step that runs unit tests on the model's output format and a validation step that checks accuracy on a static test dataset. Recently, the pipeline often fails during the validation step, but the failures are inconsistent—sometimes the same model version passes, sometimes fails. The team suspects the test dataset is small and randomly sampled. They need a reliable validation process to deploy models with confidence. Which approach should the team implement?

A.Replace the static test set with k-fold cross-validation in each pipeline run

B.Increase the accuracy threshold to 95% so only very good models pass

C.Remove the validation step and rely on unit tests only

D.Fix the test dataset to be larger and more representative, and use a statistical test to compare against baseline

AnswerD

A fixed dataset and statistical test provide consistent and objective validation.

Why this answer

Option D is correct because the core issue is that the static test dataset is too small and randomly sampled, leading to inconsistent validation results. By fixing the dataset to be larger and more representative, and using a statistical test (e.g., a paired t-test or McNemar's test) to compare the model's accuracy against a baseline, the team can reliably determine if performance changes are statistically significant, eliminating the randomness that causes pipeline failures to be inconsistent.

Exam trap

CompTIA often tests the misconception that increasing the accuracy threshold or using cross-validation alone can fix validation instability, when the real solution is to address the root cause of small, non-representative test data with statistical rigor.

How to eliminate wrong answers

Option A is wrong because k-fold cross-validation is computationally expensive and time-consuming for a CI/CD pipeline, and it does not directly address the root cause of a small, randomly sampled test set; it would still suffer from variance if the dataset is small. Option B is wrong because simply raising the accuracy threshold to 95% does not fix the underlying inconsistency from a small test set; it may cause even more frequent failures due to random sampling noise, and it does not provide a statistical basis for decision-making. Option C is wrong because removing the validation step entirely would allow models with poor accuracy to be deployed, undermining the goal of deploying with confidence; unit tests alone cannot assess model performance.

Practice this question →

54

MCQeasy

A data scientist trains a regression model and notices the training loss is low but validation loss is high. Which technique should be applied FIRST to address this issue?

A.Increase the learning rate.

B.Add more layers to the neural network.

C.Increase the size of the training dataset.

D.Apply L1 or L2 regularization to the model.

AnswerD

Regularization penalizes large weights, reducing overfitting.

Why this answer

The scenario describes overfitting, where the model memorizes the training data but fails to generalize to unseen data. Applying L1 or L2 regularization (Option D) is the correct first step because it adds a penalty to the loss function for large weights, discouraging complexity and reducing overfitting without requiring additional data or architectural changes.

Exam trap

CompTIA often tests the distinction between overfitting and underfitting, and the trap here is that candidates may incorrectly choose to increase dataset size (Option C) as the first action, when regularization is the more immediate and practical first step to address overfitting without requiring new data collection.

How to eliminate wrong answers

Option A is wrong because increasing the learning rate would make training more unstable and could cause the loss to diverge, worsening both training and validation performance. Option B is wrong because adding more layers increases model capacity, which exacerbates overfitting when the training loss is already low and validation loss is high. Option C is wrong because increasing the size of the training dataset can help reduce overfitting, but it is not the first technique to apply; regularization is a simpler, more immediate fix that does not require collecting new data.

Practice this question →

55

MCQmedium

An AI operations team notices that the accuracy of a deployed fraud detection model has been declining over the past month. Which action should the team take to address this issue proactively?

A.Retrain the model with the most recent data immediately.

B.Manually update the model weights weekly.

C.Replace the model with a rule-based system.

D.Set up automated retraining pipeline triggered by performance degradation thresholds.

AnswerD

This allows continuous monitoring and automated response to drift, keeping the model accurate.

Why this answer

Option D is correct because it establishes an automated retraining pipeline triggered by performance degradation thresholds, which aligns with MLOps best practices for maintaining model accuracy in production. This proactive approach ensures the model is retrained when its performance drops below a predefined metric (e.g., AUC or F1 score), without requiring manual intervention. It addresses concept drift, which is a common cause of declining accuracy in deployed fraud detection models.

Exam trap

CompTIA often tests the misconception that retraining with the most recent data immediately is the best proactive action, when in fact automated threshold-based retraining is the correct MLOps practice to avoid overfitting and ensure controlled updates.

How to eliminate wrong answers

Option A is wrong because retraining with the most recent data immediately may introduce data leakage or overfit to recent noise, and it does not address the root cause of performance degradation (e.g., concept drift) in a controlled manner. Option B is wrong because manually updating model weights weekly is not a scalable or reliable practice; it introduces human error and does not leverage automated monitoring or drift detection. Option C is wrong because replacing a machine learning model with a rule-based system would likely reduce the model's ability to detect complex fraud patterns, and it ignores the potential to retrain or update the existing model.

Practice this question →

56

MCQeasy

A model serving endpoint is tested using curl commands. Based on the exhibit, what is the most likely issue?

A.The server is returning HTTP 500 errors

B.The input features are malformed

C.The model is experiencing intermittent high latency leading to timeouts

D.The model is not deployed on the server

AnswerC

The third request timed out, suggesting occasional performance degradation.

Why this answer

The exhibit shows that the first curl request succeeds (HTTP 200), but subsequent requests fail with 'curl: (28) Operation timed out' after the default timeout of 30 seconds. This pattern of intermittent success followed by timeouts is characteristic of a model experiencing high latency spikes, not a persistent server error or configuration issue. The server is reachable and the model responds correctly some of the time, ruling out deployment or malformed input issues.

Exam trap

CompTIA often tests the distinction between persistent errors (like 500 or 404) and intermittent timeout failures, where candidates mistakenly attribute timeouts to server errors or input issues rather than recognizing the pattern of variable latency.

How to eliminate wrong answers

Option A is wrong because the exhibit shows HTTP 200 responses for successful requests, not HTTP 500 errors; a server returning 500 errors would consistently fail with a 5xx status code, not timeouts. Option B is wrong because the first request succeeds, proving the input features are correctly formatted and accepted by the model; malformed features would cause persistent failures across all requests. Option D is wrong because the successful first request confirms the model is deployed and serving predictions; an undeployed model would return a 404 or 503 error, not a timeout after a successful response.

Practice this question →

57

MCQhard

An ML team monitors a production model using a dashboard that shows daily performance metrics. Over the past month, the model's accuracy has dropped from 92% to 87%, while the data distribution of input features has remained stable according to statistical tests. Which type of model drift is most likely occurring?

A.Data drift (covariate shift)

B.Model decay

C.Overfitting

D.Concept drift

AnswerD

Concept drift changes the mapping from inputs to outputs, reducing accuracy.

Why this answer

Concept drift occurs when the relationship between input features and the target variable changes, even if the input data distribution remains stable. In this scenario, the model's accuracy declines from 92% to 87% while input feature distributions are unchanged, indicating that the underlying mapping from features to labels has shifted—a classic sign of concept drift.

Exam trap

CompTIA often tests the distinction between data drift and concept drift by presenting a scenario where input distributions are stable but model performance degrades, leading candidates to mistakenly choose data drift (covariate shift) because they focus on the input features rather than the label relationship.

How to eliminate wrong answers

Option A is wrong because data drift (covariate shift) refers to changes in the distribution of input features, which the question explicitly states has remained stable according to statistical tests. Option B is wrong because model decay is a general term for performance degradation over time, but it is not a specific type of drift; the question asks for the type of drift, and concept drift is the precise classification. Option C is wrong because overfitting is a training-time issue where a model fits noise in the training data, leading to poor generalization on new data; it does not explain a gradual performance drop in production while input distributions remain stable.

Practice this question →

58

Multi-Selectmedium

Which TWO are best practices for deploying AI models in a containerized production environment? (Select TWO.)

Select 2 answers

A.Always pull the latest image tag for automatic updates

B.Store model artifacts inside the container image for portability

C.Use an orchestration platform like Kubernetes for scaling and health management

D.Package the model and its dependencies into a single container image

E.Configure JVM heap arguments inside the container if using Java

AnswersC, D

Kubernetes provides automated scaling and self-healing.

Why this answer

Option C is correct because orchestration platforms like Kubernetes provide automated scaling, self-healing, and rolling updates for containerized AI models. Kubernetes uses liveness and readiness probes to monitor model health and restart failed containers, ensuring high availability in production.

Exam trap

CompTIA often tests the distinction between containerization best practices (e.g., immutable images, external model storage) and generic software deployment habits (e.g., using latest tags, embedding data), so candidates mistakenly select options that seem convenient but violate production reliability principles.

Practice this question →

59

MCQmedium

An AI system experiences degraded accuracy over time due to changes in user behavior. Which monitoring metric should be prioritized to detect this issue earliest?

A.API response latency

B.Data drift detection on input features

C.Area under the ROC curve (AUC)

D.Model accuracy on a holdout validation set

AnswerB

Data drift detects changes before performance degrades.

Why this answer

Option B is correct: Data drift detection monitors changes in input distribution, which often precedes accuracy drop. Option A is wrong because accuracy is a lagging indicator. Option C is wrong because latency doesn't reflect data shift.

Option D is wrong because AUC is also lagging.

Practice this question →

60

Multi-Selecteasy

A data scientist is monitoring a deployed image classification model. Which TWO actions are best practices for detecting model drift? (Choose 2.)

Select 2 answers

A.Schedule automatic weekly retraining of the model.

B.Increase the model's complexity to improve generalization.

C.Use a holdout test set to periodically evaluate model accuracy.

D.Monitor the average prediction confidence of the model.

E.Track the distribution of input data over time.

AnswersC, E

Comparing performance on a static test set reveals concept drift.

Why this answer

Option C is correct because periodically evaluating the model on a holdout test set that reflects the current production data distribution is a direct method to detect accuracy degradation caused by model drift. This approach measures whether the model's performance on unseen data has declined over time, which is a key indicator of drift.

Exam trap

CompTIA often tests the distinction between detection and remediation actions, so candidates mistakenly choose retraining (Option A) as a detection method when it is actually a corrective action.

Practice this question →

61

Multi-Selecteasy

A data scientist is tuning a deep learning model. Which TWO hyperparameters directly affect the model's capacity to overfit?

Select 2 answers

A.Number of layers in the network.

B.Batch size.

C.Optimizer choice (e.g., SGD vs Adam).

D.Dropout rate.

E.Learning rate.

AnswersA, D

More layers increase capacity, raising overfitting risk.

Why this answer

Option A is correct because increasing the number of layers increases the model's depth, which expands its representational capacity and allows it to learn more complex patterns, including noise, thereby directly increasing overfitting risk. Option D is correct because dropout is a regularization technique that randomly drops neurons during training; a low dropout rate (e.g., 0.0) removes this regularization, while a high rate (e.g., 0.5) reduces overfitting by preventing co-adaptation of neurons.

Exam trap

CompTIA often tests the distinction between hyperparameters that affect model capacity (number of layers, dropout rate) versus those that affect training dynamics (batch size, optimizer, learning rate), leading candidates to mistakenly select learning rate or batch size as direct overfitting controls.

Practice this question →

62

MCQhard

A company deploys an AI model for loan approval. The model shows bias against a protected group. The team decides to use adversarial debiasing. What is the PRIMARY advantage of this approach?

A.It guarantees the model's predictions are private.

B.It reduces bias while preserving predictive performance by learning representations that are invariant to sensitive attributes.

C.It is simpler to implement than pre-processing techniques.

D.It ensures equal approval rates across all groups.

AnswerB

This is the core benefit of adversarial debiasing.

Why this answer

Adversarial debiasing is an in-processing technique that trains a primary model to predict the target (e.g., loan approval) while simultaneously training an adversary to predict the sensitive attribute from the model's learned representations. The primary model is penalized when the adversary succeeds, forcing it to learn representations that are invariant to the sensitive attribute. This reduces bias while preserving predictive performance because the model retains the ability to learn task-relevant patterns that are not correlated with the protected attribute.

Exam trap

The trap here is that candidates confuse 'reducing bias' with 'ensuring equal outcomes' (demographic parity), but adversarial debiasing targets equalized odds or equal opportunity by focusing on representation invariance, not strict rate equality.

How to eliminate wrong answers

Option A is wrong because adversarial debiasing does not guarantee privacy; it addresses fairness, not confidentiality, and does not provide differential privacy or encryption. Option C is wrong because adversarial debiasing is an in-processing technique that is generally more complex to implement than pre-processing techniques like reweighing or sampling, which modify the dataset before training. Option D is wrong because adversarial debiasing aims to reduce bias by learning invariant representations, but it does not enforce equal approval rates across groups; equal approval rates would be demographic parity, which is a different fairness metric and may not align with the model's predictive performance.

Practice this question →

63

MCQhard

Based on the exhibit, what is the most likely cause of the accuracy drop?

A.A required feature is missing from the production data pipeline.

B.Data drift in the 'income' feature has caused the model to become less accurate.

C.The model was overfitted to the training data.

D.The model's confidence threshold needs to be adjusted.

AnswerB

The detected distribution shift for 'income' indicates data drift, a common cause of performance degradation.

Why this answer

The exhibit shows a sudden and sustained drop in model accuracy coinciding with a shift in the distribution of the 'income' feature. This is a classic symptom of data drift, where the statistical properties of the input feature change over time, causing the model's learned patterns to no longer match the production data. Option B correctly identifies this as the most likely cause because the model was trained on a prior income distribution and is now encountering values outside that range.

Exam trap

CompTIA often tests the distinction between data drift and model overfitting by presenting a sudden accuracy drop after stable performance, leading candidates to incorrectly attribute it to overfitting when the exhibit clearly shows a distribution shift in a specific feature.

How to eliminate wrong answers

Option A is wrong because a missing feature in the production data pipeline would typically cause a pipeline failure or missing-value error, not a gradual accuracy drop that correlates with a specific feature's distribution shift. Option C is wrong because overfitting would manifest as high training accuracy with poor generalization from the start, not a sudden accuracy drop after a period of stable performance; the exhibit shows a clear change point, not a consistently low accuracy. Option D is wrong because adjusting the confidence threshold changes the precision-recall trade-off but does not address the underlying cause of the model's predictions becoming less reliable due to shifted input distributions; it would not restore the original accuracy level.

Practice this question →

64

Multi-Selectmedium

Which TWO actions should be taken to ensure an AI model complies with GDPR requirements when processing personal data?

Select 2 answers

A.Limit data collection to only what is necessary for the model

B.Provide a full explanation of model predictions

C.Store all user data for a minimum of 10 years

D.Anonymize all personal data before use

E.Implement user data deletion upon request

AnswersA, E

Data minimization is a GDPR principle.

Why this answer

Option A is correct because GDPR's data minimization principle (Article 5(1)(c)) requires that personal data collected be adequate, relevant, and limited to what is necessary for the purpose for which it is processed. In AI model training, this means collecting only the features essential for the model's objective, reducing the risk of processing excessive or irrelevant personal data.

Exam trap

CompTIA often tests the misconception that anonymization is always required before any AI processing of personal data, but GDPR allows processing under lawful bases without anonymization, making Option D a tempting but incorrect choice.

Practice this question →

65

MCQmedium

A data science team uses Git for version control of model code and DVC for data versioning. They want to implement a model registry to track trained models, their hyperparameters, and performance metrics. Which tool is specifically designed for this purpose and integrates with the existing workflow?

A.Apache Airflow

B.Docker

C.MLflow Model Registry

D.Kubernetes

AnswerC

MLflow provides a model registry that stores model versions and metadata.

Why this answer

MLflow Model Registry is specifically designed for managing model versions, tracking metadata, and integrating with Git and DVC. Apache Airflow is for workflow orchestration, not model registry. Kubernetes is for container orchestration.

Docker is for containerization.

Practice this question →

66

Multi-Selecthard

An AI operations team is monitoring a deployed image classification model. They notice a gradual increase in prediction confidence but a drop in accuracy. Which THREE actions should they take to diagnose the issue?

Select 3 answers

A.Analyze the model's calibration curve to see if confidence scores align with actual accuracy.

B.Increase the size of the training dataset by collecting more unlabeled data.

C.Compare the distribution of input features between training and recent production data.

D.Evaluate model performance on a held-out test set collected at deployment time.

E.Retrain the model immediately with the most recent data.

AnswersA, C, D

Calibration analysis reveals if model is overconfident due to drift.

Why this answer

Option A is correct because a calibration curve (reliability diagram) directly compares predicted confidence scores against actual accuracy. In this scenario, increasing confidence with dropping accuracy indicates miscalibration—the model is becoming overconfident. Analyzing the calibration curve reveals whether the confidence scores systematically deviate from true probabilities, which is the core diagnostic step for this specific symptom.

Exam trap

CompTIA often tests the distinction between diagnostic actions and corrective actions—candidates mistakenly jump to retraining (Option E) or data collection (Option B) instead of first analyzing calibration and data distribution (Options A, C, D) to identify the specific type of drift or miscalibration.

Practice this question →

67

MCQhard

A retail company uses a time-series model to forecast daily sales for inventory management. The model is a seasonal ARIMA trained on three years of daily data. It performed well during initial validation but after deployment, forecasts became inaccurate during holiday seasons, often underestimating demand by up to 40%. The data science team examined the features and found that the training data did not include any holiday indicators. They also discovered that the model's residuals show strong autocorrelation during holiday weeks. The company needs to improve the forecast for the upcoming holiday season. They have access to historical sales data with holiday dates and are considering several approaches. Which approach will BEST address the issue?

A.Engineer holiday-related features (e.g., holiday flag, days before/after) and retrain the ARIMA model with these features as exogenous variables.

B.Decrease the learning rate and increase the number of training epochs.

C.Create an ensemble of the ARIMA model and a seasonal naive model to average forecasts.

D.Replace the ARIMA model with a deep learning LSTM model trained on the same data.

AnswerA

Incorporating holiday information directly addresses the root cause of the forecast error.

Why this answer

The core issue is that the ARIMA model lacks holiday-related information, causing systematic underestimation during holiday periods. By engineering holiday features (e.g., binary flags, days-before/after indicators) and including them as exogenous variables in a SARIMAX model, the model can directly learn the demand spikes associated with holidays. This directly addresses the residual autocorrelation during holiday weeks and improves forecast accuracy without changing the underlying time-series framework.

Exam trap

CompTIA often tests the misconception that adding more data or switching to a more complex model (like LSTM) automatically fixes forecasting issues, when the real problem is missing relevant features that can be addressed with a simpler, interpretable approach like SARIMAX with exogenous variables.

How to eliminate wrong answers

Option B is wrong because decreasing the learning rate and increasing epochs is a hyperparameter tuning technique for gradient-based models (e.g., neural networks), not applicable to ARIMA which is trained via maximum likelihood estimation or least squares. Option C is wrong because averaging forecasts from an ARIMA and a seasonal naive model would dilute the ARIMA's signal and still fail to capture holiday effects, as the naive model also lacks holiday features. Option D is wrong because replacing ARIMA with an LSTM does not automatically solve the missing holiday indicator problem; the LSTM would also need engineered holiday features to learn the pattern, and the question asks for the best approach given available data, not a complete model swap.

Practice this question →

68

MCQmedium

A data scientist submits a model training job to a cloud ML platform. The job fails with an error: "Out of memory: Killed process." The training code is proven to work on the developer's local machine with 16GB RAM. The cloud instance has 32GB RAM. What is the most likely cause?

A.The cloud instance does not have enough memory

B.The training code is loading the entire dataset into memory without batch processing

C.The training model is too large for the GPU memory

D.The cloud instance ran out of disk space

AnswerB

If the dataset is larger than memory, the process may exhaust RAM even if instances have more memory than local.

Why this answer

Option B is correct because the error 'Out of memory: Killed process' occurs when the operating system's OOM killer terminates a process that exceeds available RAM. Even though the cloud instance has 32GB RAM, if the training code loads the entire dataset into memory without batch processing (e.g., using pandas.read_csv() without chunking), it can consume far more memory than the dataset size due to Python object overhead, temporary copies, and intermediate tensors. The local machine with 16GB RAM may have worked due to a smaller dataset or different memory pressure, but the cloud instance's 32GB is insufficient if the code is not memory-efficient.

Exam trap

CompTIA often tests the misconception that 'more RAM always solves memory errors', but the trap here is that the error is caused by inefficient data loading (no batching) rather than absolute memory size, so candidates must recognize the OOM killer message as a symptom of memory exhaustion from unbounded data consumption.

How to eliminate wrong answers

Option A is wrong because the cloud instance has 32GB RAM, which is double the local machine's 16GB, so the error is not simply due to insufficient memory—it's a code inefficiency. Option C is wrong because the error message 'Out of memory: Killed process' is a Linux kernel OOM killer message related to system RAM, not GPU memory; GPU out-of-memory errors typically appear as CUDA errors (e.g., 'CUDA out of memory'). Option D is wrong because disk space exhaustion would produce errors like 'No space left on device' or 'Disk quota exceeded', not an OOM killer process termination.

Practice this question →

69

MCQhard

A financial institution needs to integrate an AI-based credit scoring model into an existing mainframe system that processes transactions in COBOL. The model is deployed as a REST API. What is the best strategy to ensure minimal disruption and maintain data integrity?

A.Copy all transaction data to a cloud database for the model to access.

B.Use an API gateway with versioning and circuit breaker patterns.

C.Rewrite the mainframe system in Java to directly call the model.

D.Install a GPU on the mainframe to run the model natively.

AnswerB

This pattern enables loose coupling, resilience, and gradual integration without major changes to the mainframe.

Why this answer

Option B is correct because an API gateway with versioning and circuit breaker patterns allows the COBOL mainframe to call the REST API without modifying its core transaction logic. The gateway handles protocol translation, rate limiting, and failover, ensuring minimal disruption to the existing mainframe system while maintaining data integrity through transactional consistency and graceful degradation.

Exam trap

CompTIA often tests the misconception that legacy systems must be replaced or heavily modified to integrate with modern AI services, when in fact an API gateway provides a non-invasive integration layer that preserves the existing infrastructure.

How to eliminate wrong answers

Option A is wrong because copying all transaction data to a cloud database introduces latency, potential data inconsistency, and security risks, and does not address the integration between COBOL and the REST API. Option C is wrong because rewriting the mainframe system in Java is a massive, high-risk, and costly undertaking that would cause significant disruption and is unnecessary for integrating a REST API. Option D is wrong because installing a GPU on the mainframe does not enable native execution of the AI model, as mainframes lack the necessary software stack and the model is deployed as a REST API, not as a local executable.

Practice this question →

70

Multi-Selectmedium

Which TWO techniques should be considered when optimizing a deep learning model for deployment on edge devices with limited computational resources?

Select 2 answers

A.Apply adversarial training

B.Model quantization

C.Use a GPU for inference

D.Knowledge distillation

E.Increase the number of layers

AnswersB, D

Quantization reduces memory and computation requirements.

Why this answer

Model quantization reduces the precision of the model's weights and activations (e.g., from 32-bit floating point to 8-bit integer), which significantly decreases memory footprint and computational latency. This makes it a primary technique for deploying deep learning models on edge devices with limited resources.

Exam trap

CompTIA often tests the distinction between training-phase techniques (like adversarial training) and deployment-phase optimization techniques (like quantization and knowledge distillation), leading candidates to select options that improve model quality rather than reduce resource consumption.

Practice this question →

71

MCQeasy

A hospital's radiology department uses an AI model to detect lung nodules in CT scans. The model was trained on data from a specific brand of scanners and patient demographics common in Europe. Recently, the hospital acquired new scanners from a different manufacturer and started serving a more diverse patient population. Over the past month, the model's false-positive rate has increased by 15% and false-negative rate by 8%. The radiologists are losing confidence and are considering abandoning the AI tool altogether. The IT team has verified that the model inference is running correctly and the hardware is performing as expected. The data science team suspects the problem is related to the change in input data distribution. The hospital's AI operations policy requires that any model update must be validated on at least 500 recent cases before deployment. What is the BEST course of action for the AI operations team?

A.Roll back to the previous model version and restrict use of the AI tool to only European patients.

B.Collect 500 recent CT scans from the new scanners, retrain the model on a combined old and new dataset, and validate before deployment.

C.Retrain the model using the original training data but with increased regularization to avoid overfitting.

D.Adjust the model's decision threshold to reduce false positives and then monitor for two weeks.

AnswerB

Retraining with a representative sample addresses the data drift and meets the policy requirement of 500 validation cases.

Why this answer

Option C is correct because collecting 500 recent CT scans that represent the new scanner and patient distribution allows for validation of a retrained model that adapts to the new data. Option A is wrong because adjusting thresholds only addresses false-positive rate but not false-negative rate, and does not tackle the root cause. Option B is wrong because rolling back to the previous model where performance was good on old data would not improve performance on the new data.

Option D is wrong because retraining on the original training data would not include the new distribution and likely yield the same issues.

Practice this question →

72

MCQeasy

A small business launched a customer support chatbot powered by a pre-trained language model. The chatbot was fine-tuned on a dataset of past support tickets. For the first week, it performed well, accurately answering 85% of queries. After a routine software update that included a new version of the underlying language model library, the chatbot's accuracy dropped to 60% and it began giving nonsensical responses to some questions. The update did not change any code or configuration specific to the chatbot. The business has a backup of the previous environment. What is the MOST appropriate immediate action?

A.Retrain the chatbot on the original dataset using the new library version.

B.Add more intents to the chatbot's configuration to cover the errors.

C.Increase the model's temperature parameter to 1.5 to encourage more varied responses.

D.Roll back the software update to the previous version of the language model library.

AnswerD

The update likely introduced a regression; rolling back restores functionality immediately.

Why this answer

The most appropriate immediate action is to roll back the software update to the previous version of the language model library (Option D). The accuracy drop and nonsensical responses are directly caused by the library update, which likely changed internal model behavior (e.g., tokenization, attention mechanisms, or default hyperparameters) without any code or configuration changes. Restoring the previous environment immediately resolves the issue and allows the business to investigate the library changes in a controlled manner.

Exam trap

CompTIA often tests the misconception that retraining or adjusting hyperparameters can fix a regression caused by an underlying library change, when the immediate and correct action is to roll back to the known-good environment.

How to eliminate wrong answers

Option A is wrong because retraining the chatbot on the original dataset using the new library version does not address the root cause—the library update itself may have altered model internals (e.g., tokenizer version, default parameters) that cannot be fixed by retraining alone, and retraining is time-consuming and not an immediate fix. Option B is wrong because adding more intents does not resolve the underlying model behavior change; the chatbot is producing nonsensical responses, not missing intents, so intent expansion is irrelevant to the core issue. Option C is wrong because increasing the temperature parameter to 1.5 would make responses more random and less coherent, worsening the nonsensical outputs; temperature controls output randomness, not model correctness or library compatibility.

Practice this question →

73

Multi-Selectmedium

A DevOps team is deploying a machine learning model using a CI/CD pipeline. They want to ensure the model is reproducible and traceable. Which TWO practices should they implement?

Select 2 answers

A.Version the training dataset and code using Git and DVC.

B.Manually deploy the model to production after approval.

C.Store only the final model artifact in a shared drive.

D.Use a spreadsheet to record model version numbers.

E.Package the model in a Docker container with a fixed base image.

AnswersA, E

Version control for data and code ensures exact reproduction of training.

Why this answer

Option A and D are correct because versioning data and code, and using containers ensure reproducibility. Option B is wrong because manual deployment is error-prone and not traceable. Option C is wrong as it does not track the model.

Option E is wrong because manual tagging is inconsistent.

Practice this question →

74

Multi-Selecteasy

An AI system is being implemented in a healthcare setting. Which TWO ethical considerations should be prioritized?

Select 2 answers

A.Ensuring the model does not exhibit racial or gender bias

B.Maximizing cost reduction for the hospital

C.Providing explainable predictions to doctors

D.Replacing human judgment entirely with AI

E.Using open-source models to reduce licensing costs

AnswersA, C

Fairness is a core ethical concern.

Why this answer

Option A is correct because AI systems in healthcare must avoid racial or gender bias to ensure equitable treatment outcomes. Biased models can lead to misdiagnosis or unequal care, violating ethical standards and regulatory requirements like HIPAA. Prioritizing fairness aligns with responsible AI implementation in clinical settings.

Exam trap

CompTIA often tests the distinction between ethical priorities and operational or financial goals, tricking candidates into selecting cost-saving or efficiency options instead of fairness and explainability.

Practice this question →

75

MCQhard

A team trained a ResNet-50 model with the configuration shown. The high training accuracy and lower validation accuracy suggest overfitting. Which change to the training configuration is MOST likely to reduce overfitting?

A.Reduce number of epochs to 5.

B.Increase batch size to 64.

C.Increase learning rate to 0.01.

D.Add dropout layers after convolutional layers.

AnswerD

Dropout randomly drops neurons, reducing co-adaptation.

Why this answer

Adding dropout layers after convolutional layers is a regularization technique that randomly drops a fraction of neurons during training, which forces the network to learn more robust features and reduces overfitting. This directly addresses the symptom of high training accuracy with lower validation accuracy by preventing the model from relying too heavily on specific neurons.

Exam trap

CompTIA often tests the misconception that increasing batch size or reducing epochs directly fixes overfitting, when in fact these changes can harm convergence or underfit, while regularization techniques like dropout are the correct solution.

How to eliminate wrong answers

Option A is wrong because reducing the number of epochs to 5 would likely lead to underfitting, as the model would not have enough training iterations to converge, and it does not address the root cause of overfitting. Option B is wrong because increasing batch size to 64 can actually reduce the stochasticity of gradient updates, potentially leading to sharper minima and worse generalization, which may exacerbate overfitting. Option C is wrong because increasing the learning rate to 0.01 can cause the optimizer to overshoot minima and destabilize training, and it does not provide regularization to combat overfitting.

Practice this question →

Page 1 of 2 · 103 questions totalNext →

Ready to test yourself?

Try a timed practice session using only AI Implementation and Operations questions.

Start 20-question session