Knowledge + Practice

CompTIA AI+ AI0-001 (AI0-001) — Questions 976–1000

1000 questions total · 14pages · All types, answers revealed

Take a mock exam Exam hub

Page 14 of 14

976

MCQmedium

A bank uses an AI system for credit scoring. To meet fairness requirements, they want to ensure the model predicts similar outcomes for individuals who are similar with respect to the target variable, regardless of protected attributes. Which fairness metric addresses this?

A.Demographic parity

B.Calibration

C.Individual fairness

D.Equalised odds

AnswerC

Individual fairness mandates that similar individuals be treated similarly.

Why this answer

Individual fairness requires that similar individuals (with respect to legitimate features) receive similar predictions. Demographic parity requires equal selection rates across groups. Equalized odds requires equality of true positive and false positive rates.

Calibration ensures predicted probabilities match actual outcomes across groups.

Full explanation →

977

MCQhard

A data science team is deploying a real-time fraud detection model on edge devices in retail stores. The model must infer under 10ms and fit within 50MB memory. Which combination of techniques should the team apply?

A.Model parallelism and distributed inference

B.Increase batch size and use FP16 precision

C.Train a larger model and use distillation to transfer knowledge

D.Model quantization to INT8 and pruning of low-weight connections

AnswerD

INT8 quantization reduces model size and latency; pruning eliminates unnecessary weights, meeting both memory and speed constraints.

Why this answer

Quantization reduces model precision (e.g., FP32 to INT8) to shrink memory and speed up inference, while pruning removes redundant parameters. Distillation can further compress. These are standard for edge deployment.

Full explanation →

978

Multi-Selecteasy

Which TWO of the following are best practices for securing an AI model against adversarial attacks?

Select 2 answers

A.Model pruning to reduce the number of parameters.

B.Adversarial training with perturbed examples.

C.Input sanitization and validation.

D.Increasing model complexity to capture more patterns.

E.Hyperparameter optimization using grid search.

AnswersB, C

Adversarial training exposes the model to adversarial inputs, improving robustness.

Why this answer

Option B is correct because adversarial training explicitly augments the training dataset with perturbed examples (e.g., using FGSM or PGD attacks) to teach the model to recognize and resist malicious inputs. This method directly hardens the model against evasion attacks by improving its decision boundary robustness.

Exam trap

CompTIA often tests the misconception that increasing model complexity or pruning improves security, when in fact these techniques address performance or efficiency, not adversarial robustness.

Full explanation →

979

MCQmedium

An AI model is trained to predict loan default. The training data contains 95% non-default and 5% default. Which metric is most appropriate to evaluate model performance given the imbalanced dataset?

A.Mean squared error

B.F1-score

C.Accuracy

D.R-squared

AnswerB

F1-score considers both false positives and false negatives, providing a balanced measure for minority class performance.

Why this answer

The F1-score is the harmonic mean of precision and recall, making it robust to class imbalance. In this dataset with 95% non-default and 5% default, accuracy would be misleadingly high (95%) even if the model never predicts default, while F1-score penalizes poor recall of the minority class.

Exam trap

CompTIA often tests the misconception that accuracy is always the best metric, leading candidates to overlook its failure in imbalanced scenarios where a trivial classifier can achieve high accuracy.

How to eliminate wrong answers

Option A is wrong because Mean Squared Error (MSE) is a regression metric that measures average squared differences between predicted and actual values, not suitable for binary classification tasks like loan default prediction. Option C is wrong because accuracy is misleading on imbalanced datasets; a model predicting all non-default would achieve 95% accuracy but fail to identify any actual defaults. Option D is wrong because R-squared is a regression metric that indicates the proportion of variance explained by the model, inappropriate for evaluating classification performance on imbalanced data.

Full explanation →

980

MCQeasy

An organization is deploying a machine learning model that classifies loan applications. They want to prevent an attacker from reconstructing individual customer records from the model's predictions. Which type of attack should they defend against?

A.Membership inference

B.Data poisoning

C.Model inversion

D.Adversarial example

AnswerC

Model inversion attacks aim to reconstruct training data from model outputs.

Why this answer

Model inversion attacks allow an attacker to reconstruct the original training data by analyzing the model's predictions. In this scenario, the attacker could use the model's outputs to infer sensitive details about individual loan applicants, such as income or credit history, violating privacy. Defending against model inversion is critical when predictions can be used to reverse-engineer private training records.

Exam trap

Cisco often tests the distinction between model inversion (reconstructing data) and membership inference (detecting presence of data), so the trap here is confusing the goal of reconstructing records with simply inferring membership.

How to eliminate wrong answers

Option A is wrong because membership inference attacks aim to determine whether a specific record was part of the training dataset, not to reconstruct the actual data values. Option B is wrong because data poisoning attacks involve corrupting the training data to manipulate model behavior, not extracting or reconstructing existing records. Option D is wrong because adversarial example attacks craft malicious inputs to cause misclassification, not to reconstruct training data from predictions.

Full explanation →

981

MCQhard

A data scientist is training a model to detect fraudulent transactions. To protect customer privacy, the team wants to ensure that the model does not inadvertently memorize and reveal sensitive information about individuals in the training set. Which technique should be applied during training?

A.Differential privacy

B.Federated learning

C.Homomorphic encryption

D.Model quantization

AnswerA

Differential privacy adds noise during training to bound the influence of any single data point, protecting individual privacy.

Why this answer

Differential privacy is the correct technique because it adds calibrated noise to the training process or output, ensuring that the model cannot infer whether any specific individual's data was included in the training set. This directly addresses the goal of preventing memorization and leakage of sensitive information while still allowing the model to learn useful patterns for fraud detection.

Exam trap

Cisco often tests the misconception that federated learning alone guarantees privacy, when in fact it only addresses data locality and must be combined with differential privacy to prevent model inversion or membership inference attacks.

How to eliminate wrong answers

Option B (Federated learning) is wrong because it focuses on training models across decentralized data without sharing raw data, but it does not inherently prevent the model from memorizing individual records; additional privacy techniques like differential privacy are needed. Option C (Homomorphic encryption) is wrong because it enables computation on encrypted data, protecting data in transit or at rest, but it does not address model memorization or inference of training data from the model's outputs. Option D (Model quantization) is wrong because it reduces the precision of model weights to improve efficiency, but it has no effect on privacy or preventing memorization of sensitive information.

Full explanation →

982

MCQhard

A healthcare AI model is subject to GDPR because it processes data of EU patients. The model makes automated decisions about treatment plans. Under GDPR, patients have the right to obtain an explanation of the decision. Which approach BEST satisfies this 'right to explanation'?

A.Generate a SHAP-based explanation showing the contribution of each input feature to the decision

B.Offer the patient the option to have the decision reviewed by a human

C.State that the decision was made by an AI system and cannot be explained

D.Provide a copy of the model's source code to the patient

AnswerA

SHAP provides a local explanation that is both interpretable and meets the requirement to explain the decision's logic.

Why this answer

GDPR's right to explanation requires meaningful information about the logic involved in automated decisions. Providing a clear, understandable explanation using techniques like SHAP fulfills this. A simplified summary may be needed, but the explanation must be substantive.

Pointing to model cards or a confidence score alone is insufficient.

Full explanation →

983

MCQhard

An organization wants to implement an AI ethics board. Which composition best ensures independence and expertise?

A.All members from the legal department

B.IT department head and data scientists

C.Mix of internal stakeholders and external ethicists

D.Only senior executives from the company

AnswerC

Ensures independence and diverse expertise.

Why this answer

Option C is correct because an AI ethics board must combine internal stakeholders (who understand organizational context, data flows, and operational constraints) with external ethicists (who provide independent, unbiased perspectives and specialized knowledge of ethical frameworks like IEEE Ethically Aligned Design or the EU AI Act). This composition ensures the board can evaluate AI systems for bias, fairness, and transparency without being dominated by business or technical interests, which is critical for maintaining trust and regulatory compliance.

Exam trap

Cisco often tests the misconception that technical expertise alone (Option B) is sufficient for AI ethics governance, but the trap is that independence and multidisciplinary perspectives are explicitly required to avoid conflicts of interest and ensure comprehensive ethical evaluation.

How to eliminate wrong answers

Option A is wrong because a board composed solely of legal department members lacks the technical expertise to assess AI model behavior, data provenance, and algorithmic bias, and may focus narrowly on legal compliance rather than broader ethical principles. Option B is wrong because IT department heads and data scientists bring deep technical knowledge but have inherent conflicts of interest (e.g., pressure to deploy models quickly) and lack the independent ethical oversight needed to challenge internal decisions. Option D is wrong because senior executives prioritize business outcomes and shareholder value, which can compromise impartiality and lead to ethics being subordinated to profit, violating the independence required for effective governance.

Full explanation →

984

MCQmedium

A retail company uses a gradient boosting model to predict customer lifetime value (CLV). The model currently uses 50 features including purchase history, demographics, and web behavior. The model's RMSE on the test set is 120. The data science team wants to improve the model's accuracy without increasing training time significantly. They have access to additional data: customer support interaction logs (text), social media sentiment (text), and third-party credit scores (numeric). They also have the ability to perform feature engineering, hyperparameter tuning, and ensemble methods. Which approach is most likely to yield the best improvement in predictive performance with minimal increase in training time?

A.Add the customer support text as a feature using TF-IDF vectors

B.Use an ensemble of gradient boosting and random forest models

C.Perform hyperparameter tuning using grid search

D.Engineer new features such as average purchase value and recency

AnswerD

Feature engineering can capture patterns without adding new data sources or significant time.

Why this answer

Option D is correct because engineering domain-relevant features like average purchase value and recency directly captures the underlying behavioral patterns that drive customer lifetime value, often providing a higher signal-to-noise ratio than adding raw text or third-party data. This approach leverages existing data without significantly increasing the feature dimensionality or training time, unlike adding TF-IDF vectors which would dramatically expand the feature space and slow training.

Exam trap

CompTIA often tests the misconception that adding more data (especially text) or complex ensemble methods always improves model accuracy, while the correct approach is to engineer features that capture domain-specific patterns with minimal computational overhead.

How to eliminate wrong answers

Option A is wrong because adding customer support text as TF-IDF vectors would introduce thousands of sparse features, significantly increasing training time and risking overfitting without guaranteed improvement in RMSE. Option B is wrong because ensembling gradient boosting with random forest typically increases training time substantially (both models must be trained) and may not outperform a well-tuned single gradient boosting model on structured data. Option C is wrong because hyperparameter tuning using grid search is computationally expensive, often requiring many model fits, and would increase training time more than feature engineering without leveraging the new data sources.

Full explanation →

985

MCQmedium

An insurance company uses a black-box deep learning model to set premiums. Regulators demand explanation for individual decisions. Which interpretability technique should the data science team apply to generate local explanations for each prediction?

A.Attention visualization

B.Model cards

C.LIME

D.SHAP values

AnswerC

LIME is designed specifically for local, interpretable explanations of any classifier.

Why this answer

LIME (Local Interpretable Model-agnostic Explanations) is the correct choice because it generates local, interpretable explanations for individual predictions by approximating the black-box model with a simpler, interpretable surrogate model around the specific instance. This directly meets the regulatory requirement for explaining why a particular premium was set for a specific customer, without needing access to the model's internal structure.

Exam trap

The trap here is that candidates often confuse SHAP and LIME as both being local explainers, but Cisco tests the nuance that LIME is model-agnostic and simpler to implement for deep learning black-boxes, while SHAP assumes a specific game-theoretic framework that may not align with the model's actual behavior.

How to eliminate wrong answers

Option A is wrong because attention visualization is specific to transformer-based models and highlights which input parts the model 'attends to', but it does not provide a faithful, model-agnostic local explanation for any black-box deep learning model, and it requires access to attention weights which may not be available. Option B is wrong because model cards are high-level documentation that describe overall model performance, intended use, and limitations across the entire dataset, not per-instance explanations for individual decisions. Option D is wrong because SHAP values, while providing local explanations based on cooperative game theory, are computationally expensive for deep learning models and can be misleading if the model is not additive or if feature independence assumptions are violated; LIME is more practical for quick, local explanations in a regulatory context.

Full explanation →

986

Multi-Selecthard

A company uses an AI model to screen job applicants. A disparate impact analysis reveals that the model's rejection rate for a protected group is significantly higher than for others. Which THREE actions should the company take to address this?

Select 3 answers

A.Revisit training data for historical bias and consider reweighting

B.Ignore the disparity because the model is accurate overall

C.Remove all demographic attributes from the dataset

D.Apply fairness constraints or adversarial debiasing during training

E.Consider using a different model that achieves better fairness metrics

AnswersA, D, E

Addressing data bias is a fundamental step to reduce disparate impact.

Why this answer

Option A is correct because revisiting the training data for historical bias and applying reweighting directly addresses the root cause of disparate impact. If the training data contains biased labels or skewed representation of the protected group, the model will learn and amplify those biases. Reweighting adjusts the loss function to give more importance to underrepresented or disadvantaged groups, helping to equalize error rates across groups.

Exam trap

Cisco often tests the misconception that removing protected attributes (option C) is sufficient to eliminate bias, when in reality it can hide bias and still allow proxy discrimination, making it an incomplete and sometimes counterproductive solution.

Full explanation →

987

MCQhard

A data scientist notices the model overfits. Which change to the exhibit's configuration would most likely reduce overfitting?

A.Remove dropout layers

B.Increase learning rate to 0.01

C.Add L2 regularization to dense layers

D.Increase units in the first dense layer to 512

AnswerC

L2 regularization adds a penalty on large weights, discouraging complex models and reducing overfitting.

Why this answer

Adding L2 regularization to dense layers penalizes large weights by adding a squared magnitude term to the loss function, which forces the model to learn simpler patterns and reduces overfitting. This directly addresses the core issue of the model memorizing noise in the training data.

Exam trap

CompTIA often tests the misconception that increasing model capacity (more units or layers) or removing regularization always improves performance, when in fact these changes exacerbate overfitting; candidates must recognize that regularization techniques like L2 are specifically designed to penalize complexity and reduce overfitting.

How to eliminate wrong answers

Option A is wrong because removing dropout layers would actually increase overfitting, as dropout is a regularization technique that randomly drops neurons during training to prevent co-adaptation. Option B is wrong because increasing the learning rate to 0.01 (a relatively high value) can cause the optimizer to overshoot minima and lead to unstable training, but it does not directly reduce overfitting; in fact, a too-high learning rate may prevent convergence altogether. Option D is wrong because increasing units in the first dense layer to 512 adds more parameters to the model, which increases capacity and typically worsens overfitting rather than reducing it.

Full explanation →

988

Multi-Selecthard

Which TWO deployment strategies allow for testing a new model version before fully rolling it out?

Select 2 answers

A.Shadow deployment

B.Canary deployment

C.Direct cutover

D.A/B testing with traffic splitting

E.Blue-Green deployment

AnswersB, D

Canary releases route a subset of users to the new version for validation.

Why this answer

Canary deployment is correct because it routes a small percentage of live traffic to the new model version while the majority continues using the stable version. This allows real-world validation of the new model's performance and error rates under production load before a full rollout, minimizing blast radius if issues arise.

Exam trap

The trap here is that candidates confuse shadow deployment with canary deployment, mistakenly thinking shadow also tests user-facing behavior, when in fact shadow only tests infrastructure impact without validating model outputs against live user expectations.

Full explanation →

989

MCQhard

An e-commerce company deploys a recommendation model that must serve predictions with sub-100 ms latency for millions of users during peak hours. The model is a large neural network. Which architecture is most suitable?

A.Batch process predictions every hour.

B.Use a distributed system with load balancers and model replicas.

C.Deploy the model on a single powerful GPU server.

D.Use serverless functions with auto-scaling.

AnswerB

This architecture handles high traffic and meets latency requirements efficiently.

Why this answer

Option B is correct because distributing the model across multiple servers with load balancers and replicas allows horizontal scaling to handle millions of concurrent users while maintaining sub-100 ms latency. This architecture provides fault tolerance and can dynamically adjust to peak traffic loads, which is essential for real-time inference with large neural networks.

Exam trap

CompTIA often tests the misconception that a single powerful server or serverless functions can meet strict latency and throughput requirements, but the trap here is that horizontal scaling with load-balanced replicas is the only viable solution for high-concurrency, low-latency inference with large models.

How to eliminate wrong answers

Option A is wrong because batch processing predictions every hour introduces latency of up to 3600 seconds, which fails the sub-100 ms requirement and is unsuitable for real-time recommendation systems. Option C is wrong because a single powerful GPU server creates a single point of failure and cannot scale horizontally to handle millions of concurrent users during peak hours, leading to resource contention and latency spikes. Option D is wrong because serverless functions typically have cold start delays (often 100 ms to several seconds) and may not support large neural network models due to memory and execution time limits (e.g., AWS Lambda max 15 minutes, 10 GB memory), making them unsuitable for low-latency, high-throughput inference.

Full explanation →

990

MCQmedium

A team is training a image classification model. They split the dataset into training, validation, and test sets. After training, the model achieves 98% accuracy on the training set but only 72% on the test set. Which step in the AI project lifecycle should the team focus on?

A.Data acquisition – collect more data

B.Model selection – use regularization or reduce model complexity

C.Deployment – re-deploy with a different serving framework

D.Data preparation – check for train/test leakage

AnswerB

The high training accuracy and low test accuracy is classic overfitting. Regularization, dropout, or simpler models can reduce the gap.

Why this answer

The large gap indicates overfitting, which is a model selection/regularization issue. They need to apply techniques like dropout, data augmentation, or reduce model complexity.

Full explanation →

991

MCQhard

A data scientist notices that a model's performance on the training set is excellent, but validation accuracy is poor. The team used the same dataset for feature engineering and model selection. What is the MOST likely cause?

A.Train/test leakage caused by using the same data for feature engineering and model selection

B.The dataset is too small for the model complexity

C.The learning rate is too high

D.The model is overfitting due to high variance

AnswerA

Using validation data to guide feature engineering leaks information, inflating training performance and hurting generalization.

Why this answer

Reusing the same data for feature engineering and model selection creates data leakage from validation set into training decisions, leading to overfitting and poor generalization.

Full explanation →

992

MCQhard

A financial institution wants to deploy an AI system that automates loan approvals. Under the EU AI Act, this system would be classified as high-risk. Which of the following is a MANDATORY requirement for high-risk AI systems before market placement?

A.Obtaining a CE marking of conformity through self-assessment only

B.Establishing a risk management system throughout the system lifecycle

C.Acquiring liability insurance for AI-related harms

D.Publishing the training dataset in an open repository

AnswerB

A continuous risk management system is a mandatory requirement for high-risk AI systems under the EU AI Act.

Why this answer

The EU AI Act requires a conformity assessment (often including third-party involvement) for high-risk AI systems to ensure compliance with requirements such as data governance, transparency, and human oversight. The other options are either voluntary, not required, or apply to lower-risk tiers.

Full explanation →

993

MCQeasy

Under the GDPR, individuals have the right to not be subject to a decision based solely on automated processing if it produces legal effects. Which of the following is a typical safeguard that organisations must provide to comply with this right?

A.The right to have all personal data deleted immediately

B.The right to receive a detailed mathematical explanation of the model

C.The right to demand a more favourable automated decision

D.The right to obtain human intervention on the part of the controller

AnswerD

GDPR Article 22 requires that meaningful human intervention be available for automated decisions producing legal effects.

Why this answer

GDPR requires human intervention as a safeguard for automated decisions with legal effects. The right to explanation and human oversight are key. Deleting data is not a safeguard; consent is not sufficient; detailed explanations are part of transparency.

Full explanation →

994

MCQmedium

A team trains a random forest model on a dataset with 50 features. The model's performance on the test set is significantly worse than on the training set. Which technique is most appropriate to address this issue?

A.Apply cross-validation to tune hyperparameters and reduce overfitting

B.Increase the number of trees in the forest

C.Use feature scaling

D.Perform PCA to reduce dimensions

AnswerA

Cross-validation finds optimal max depth, min samples split, etc., to combat overfitting.

Why this answer

The model's significantly worse test performance compared to training performance is a classic symptom of overfitting. Cross-validation is the most appropriate technique to address this because it evaluates the model's performance on multiple subsets of the data, providing a more robust estimate of generalization. Tuning hyperparameters (e.g., max_depth, min_samples_split) using cross-validation directly reduces overfitting by constraining model complexity, which is the core issue here.

Exam trap

Cisco often tests the misconception that increasing model complexity (e.g., more trees) always improves generalization, when in fact the core issue of overfitting is best addressed by validation-based hyperparameter tuning to control model complexity.

How to eliminate wrong answers

Option B is wrong because increasing the number of trees in a random forest generally improves performance by reducing variance through averaging, but it does not directly address overfitting caused by overly complex trees or insufficient regularization; in fact, more trees can sometimes exacerbate overfitting if individual trees are already too deep. Option C is wrong because feature scaling is irrelevant for random forests, as tree-based models are invariant to monotonic transformations of features; they split based on thresholds and do not rely on distance metrics. Option D is wrong because performing PCA to reduce dimensions is a dimensionality reduction technique that can help with overfitting by removing noise, but it is not the most appropriate first step; cross-validation with hyperparameter tuning directly targets the overfitting problem without losing interpretability or risking information loss from linear transformations.

Full explanation →

995

MCQhard

An MLOps team observes that their production inference API experiences increasing latency as more concurrent requests arrive. They need to scale horizontally while maintaining session state of preprocessing steps. Which deployment strategy should they implement?

A.Deploy stateless containers without session persistence

B.Use a single larger GPU instance to handle all requests

C.Deploy multiple instances behind a round-robin load balancer with sticky sessions

D.Implement a message queue (e.g., Kafka) to buffer requests

AnswerC

Sticky sessions ensure that all requests from a user session are routed to the same instance, preserving session state during horizontal scaling.

Why this answer

Sticky sessions (session affinity) ensure that all requests from a given client are routed to the same backend instance, preserving the in-memory session state of preprocessing steps. Combined with a round-robin load balancer, this allows horizontal scaling while maintaining stateful behavior, which is essential for the described latency issue under concurrent load.

Exam trap

Cisco often tests the distinction between stateless and stateful scaling, where candidates mistakenly choose message queues (Option D) thinking they solve concurrency, but they fail to address the synchronous session state requirement.

How to eliminate wrong answers

Option A is wrong because stateless containers without session persistence would lose the preprocessing session state between requests, breaking the required stateful behavior. Option B is wrong because scaling vertically with a single larger GPU instance does not address horizontal scaling needs and creates a single point of failure, while also not solving the latency increase under concurrent requests. Option D is wrong because a message queue like Kafka buffers requests asynchronously, which introduces decoupling and potential ordering issues, but does not directly provide horizontal scaling with session state preservation for synchronous inference requests.

Full explanation →

996

MCQhard

An MLOps team automates model deployment with a CI/CD pipeline. A performance regression is detected after deploying a new model version. The team needs to automatically roll back to the previous version. Which approach best enables safe automated rollback?

A.Use a blue/green deployment with automated health checks and traffic switching

B.Maintain a manual rollback script that the operations team can run

C.Deploy new models as canary releases and monitor for 24 hours

D.Automatically keep the previous model version in storage for later use

AnswerA

Blue/green allows instant rollback by redirecting traffic.

Why this answer

Blue/green deployment with automated health checks and traffic switching is the best approach because it allows the team to instantly route all traffic back to the previous (green) environment if the new (blue) version fails health checks. This ensures zero-downtime rollback without manual intervention, directly addressing the need for safe automated rollback in a CI/CD pipeline.

Exam trap

CompTIA often tests the distinction between preserving artifacts (storage) and enabling automated traffic switching (deployment strategy), so candidates mistakenly choose Option D thinking storage alone ensures rollback capability.

How to eliminate wrong answers

Option B is wrong because a manual rollback script introduces human delay and error risk, contradicting the requirement for automated rollback. Option C is wrong because canary releases with a 24-hour monitoring window do not provide immediate automated rollback; they rely on manual decision-making after observation, which is not fully automated. Option D is wrong because simply keeping the previous model version in storage does not enable automatic traffic switching or rollback; it only preserves the artifact, not the deployment state.

Full explanation →

997

MCQeasy

A team is using a pre-trained language model for sentiment analysis. They want to adapt it to a specific domain with limited labeled data. Which approach is most efficient?

A.Fine-tune the pre-trained model on domain data

B.Use the pre-trained model as is

C.Train a new model from scratch

D.Ensemble multiple pre-trained models

AnswerA

Fine-tuning updates the model weights slightly on domain data, achieving good performance with few examples.

Why this answer

Fine-tuning a pre-trained language model on domain-specific labeled data is the most efficient approach because it leverages the general language understanding learned from large corpora while adapting to the target domain with minimal additional data. This process uses transfer learning, where only the final layers or a subset of parameters are updated, significantly reducing the amount of labeled data and compute required compared to training from scratch.

Exam trap

Cisco often tests the misconception that a pre-trained model can be used directly for any domain without adaptation, leading candidates to choose Option B, but the trap here is that domain-specific tasks require fine-tuning to align the model's representations with the target data distribution.

How to eliminate wrong answers

Option B is wrong because using the pre-trained model as-is (zero-shot inference) typically yields poor performance on domain-specific sentiment analysis due to vocabulary and context mismatches, as the model was not exposed to domain-specific jargon or sentiment nuances. Option C is wrong because training a new model from scratch requires a massive labeled dataset (often millions of examples) and extensive computational resources, which contradicts the constraint of limited labeled data. Option D is wrong because ensembling multiple pre-trained models without fine-tuning them on domain data does not address the domain adaptation problem; it merely averages their general predictions, which may still be inaccurate for domain-specific sentiment.

Full explanation →

998

MCQeasy

A company has developed a deep learning model for image classification. The team wants to deploy the model to production with high availability and scalability. Which approach should they use?

A.Run the model on a laptop during business hours.

B.Deploy the model as a monolithic application on a single server.

C.Embed the model directly into a mobile app.

D.Use a containerized approach with Kubernetes.

AnswerD

Kubernetes provides orchestration, scaling, and high availability for containerized applications.

Why this answer

Option D is correct because containerization with Kubernetes provides the orchestration, auto-scaling, and self-healing capabilities required for high availability and scalability in production. Kubernetes manages container lifecycles, distributes traffic across replicas via Services and Ingress controllers, and can automatically scale pods based on CPU/memory metrics or custom metrics, ensuring the deep learning model handles variable loads without downtime.

Exam trap

CompTIA often tests the misconception that embedding AI models directly into mobile apps or running them on a single server is sufficient for production, when in reality enterprise-grade deployments require container orchestration for resilience and elasticity.

How to eliminate wrong answers

Option A is wrong because running the model on a laptop during business hours lacks any production-grade availability, scalability, or fault tolerance; it is a single point of failure and cannot handle concurrent requests. Option B is wrong because a monolithic application on a single server creates a single point of failure, cannot scale horizontally, and offers no load balancing or automated recovery, making it unsuitable for high availability. Option C is wrong because embedding the model directly into a mobile app offloads inference to client devices, which introduces latency, security risks, and inconsistent performance; it does not provide centralized high availability or scalability for the production service.

Full explanation →

999

MCQeasy

Which neural network architecture is specifically designed to handle sequential data and mitigate the vanishing gradient problem?

A.Convolutional Neural Network (CNN)

B.Transformer

C.Vanilla Recurrent Neural Network (RNN)

D.Long Short-Term Memory (LSTM) network

AnswerD

LSTMs use forget, input, and output gates to control information flow, effectively handling long sequences and vanishing gradients.

Why this answer

LSTM (Long Short-Term Memory) is a type of RNN designed with gating mechanisms to prevent vanishing gradients in long sequences. CNNs are for spatial data; vanilla RNNs suffer from vanishing gradients; transformers use attention but are not specifically designed to mitigate vanishing gradients (they use residual connections).

Full explanation →

1000

MCQmedium

An AI team notices that their hiring model consistently selects male candidates over equally qualified female candidates. Analysis shows the training data contains past hiring decisions where men were predominantly hired. Which type of bias is the root cause?

A.Algorithmic bias

B.Confirmation bias

C.Selection bias

D.Historical bias

AnswerD

The data reflects past discriminatory hiring practices, causing the model to perpetuate that bias.

Why this answer

Historical bias is the root cause because the training data reflects past hiring decisions that systematically favored male candidates, encoding societal or organizational prejudices into the model. The model learns these historical patterns and perpetuates them, leading to discriminatory outcomes against equally qualified female candidates. This is distinct from algorithmic bias, which would arise from the model's design or optimization process itself.

Exam trap

Cisco often tests the distinction between historical bias (data-driven) and algorithmic bias (model-driven), and the trap here is that candidates may confuse the source of bias as being from the algorithm itself rather than the training data.

How to eliminate wrong answers

Option A is wrong because algorithmic bias refers to bias introduced by the algorithm's design, training process, or optimization function, not by the data itself. Option B is wrong because confirmation bias is a cognitive bias where individuals favor information that confirms their preexisting beliefs, which is not applicable to a machine learning model's training data. Option C is wrong because selection bias occurs when the data is not representative of the population due to non-random sampling, but here the data accurately reflects historical hiring decisions, which are themselves biased.

Full explanation →

Page 14 of 14

All pages

1 2 3 4 5 6 7 8 9 10 11 12 13 14

Practice AI0-001 by domain

Target a specific domain to shore up weak areas.

AI Infrastructure and Technologies AI Security AI Concepts and Foundations AI Concepts and Techniques Machine Learning and Deep Learning AI Models and Data Engineering Implementing AI Solutions AI Implementation and Operations AI Security, Ethics and Governance AI Governance and Ethics

See all domains with question counts →

CompTIA AI+ AI0-001 AI0-001 Questions 976–1000 | Page 14/14 | Courseiva