How many AI Concepts and Techniques questions are on the AI0-001 exam?

The AI Concepts and Techniques domain is one of the weighted domains on the AI0-001 exam. The Courseiva question bank has 75 practice questions for this domain.

How can I practice AI Concepts and Techniques questions for AI0-001?

Click any of the 75 questions listed on this page to see the full question and explanation, or use the session launcher to start a focused practice session of 10, 20, 30 or 50 questions drawn only from the AI Concepts and Techniques domain.

Free AI0-001 AI Concepts and Techniques Practice Questions (2026)

Practice AI Concepts and Techniques questions

10Q 20Q 30Q 50Q

All AI0-001 AI Concepts and Techniques questions (75)

Start session

Click any question to see the full explanation and answer options, or start a focused practice session above.

A data scientist is building a model to predict whether a loan application will default. The dataset has 10,000 labeled examples with 1,000 defaults. Which metric is MOST appropriate for evaluating this highly imbalanced binary classification?

A company is deploying a large language model for customer support. They want to reduce the number of off-topic or nonsensical responses while maintaining creativity. Which parameter adjustment would BEST achieve this?

A startup wants to identify unusual patterns in network traffic to detect potential security breaches. They have a large dataset of normal traffic but very few labeled attacks. Which machine learning approach is MOST suitable?

A research team is training a deep learning model for image classification using a small dataset of 1,000 labeled images. They are concerned about overfitting. Which combination of regularisation techniques would be MOST effective?

A developer is building a natural language processing system to classify customer reviews as positive, neutral, or negative. They have 50,000 labeled reviews. Which model architecture is MOST appropriate for this task?

A company wants to recommend products to users based on their past purchase history. Which machine learning paradigm is BEST suited for this task?

A machine learning engineer is training a logistic regression model and notices that the loss is decreasing very slowly. The learning rate is set to 0.001. What is the MOST likely cause and appropriate fix?

A team is training a generative adversarial network (GAN) to generate realistic images of furniture. The generator loss decreases sharply while the discriminator loss increases. What is the MOST likely issue and recommended action?

An AI practitioner needs to extract key phrases from a large collection of customer support emails for trend analysis. Which technique is MOST suitable?

A data scientist is selecting a model for a binary classification task where interpretability is critical because of regulatory requirements. The dataset has 20 features and 10,000 samples. Which model is MOST appropriate?

A developer is using a pre-trained BERT model for a question-answering system. They want to ensure the model can handle out-of-vocabulary words. Which component of the BERT architecture is responsible for this?

A team is training a recurrent neural network (RNN) with LSTM units to predict stock prices. The validation loss is significantly higher than the training loss. Which action is MOST likely to reduce the gap?

A company is deploying a chatbot using a large language model. They want to mitigate the risk of prompt injection attacks. Which TWO measures should be implemented?

A data scientist is evaluating a binary classifier for a medical diagnosis task. The dataset is imbalanced with 5% positive cases. Which THREE metrics should the data scientist consider for a comprehensive evaluation?

A company wants to build a system that can generate new product images for an online catalog. Which TWO generative AI approaches are most suitable?

A company wants to build a customer service chatbot that answers questions about their internal policy documents. The documents are updated monthly, and the team cannot afford to retrain a model each time. Which approach is MOST appropriate?

A data scientist is building a model to predict whether a credit card transaction is fraudulent, using labeled historical data. Which machine learning paradigm is being used?

A deep learning engineer is training a transformer model and notices that validation perplexity increases after a few epochs while training perplexity continues to decrease. Which of the following is the MOST likely cause?

A product team wants a system that can generate high-quality synthetic images of furniture in different room settings for an online catalog. The images must be photorealistic and vary in style. Which generative AI approach is BEST suited for this task?

An AI practitioner needs to measure the performance of a binary classification model for disease detection, where the cost of false negatives is very high. Which metric should be prioritized?

A data scientist is using a linear regression model to predict house prices and observes that the model performs well on training data but poorly on test data. Which regularisation technique is MOST appropriate to reduce overfitting?

A team is developing a sentiment analysis model and obtains the following performance on the test set: accuracy=0.92, precision=0.75, recall=0.80, F1=0.77. The baseline majority-class classifier achieves 0.85 accuracy. Which conclusion is MOST justified?

An AI engineer is designing a system to detect unusual patterns in network traffic that may indicate a security breach. The system should learn from normal traffic patterns and flag deviations. Which machine learning approach is MOST appropriate?

A machine learning engineer is training a neural network for image classification. The training loss decreases slowly and the model accuracy improves only marginally each epoch. Which hyperparameter adjustment is MOST likely to accelerate convergence?

A data analyst wants to use a model that provides feature importance scores to understand which factors most influence customer churn. They also need the model to handle both numerical and categorical data with minimal preprocessing. Which algorithm is BEST suited?

An AI system that can perform any intellectual task that a human being can is referred to as:

A research team is fine-tuning a BERT model for a text classification task. They notice that the model's performance on the validation set fluctuates wildly across epochs, sometimes dropping significantly from one epoch to the next. Which technique is MOST likely to stabilise training?

An AI developer is selecting a model architecture for a real-time video surveillance system that must detect objects in each frame and also track movement patterns across frames. Which TWO architectures should the developer combine? (Choose 2)

A healthcare startup is building a diagnostic support system using a large language model. The system must provide accurate, evidence-based answers and avoid generating harmful or fabricated information. Which THREE techniques should be implemented to achieve this? (Choose 3)

A machine learning team is splitting a dataset for a binary classification problem. They want to ensure robust evaluation and avoid data leakage. Which TWO practices should they follow? (Choose 2)

A data scientist is building a model to predict credit default using historical loan data. The dataset contains 100,000 records with 50 features, including income, debt-to-income ratio, and loan amount. The target variable is binary (default vs. no default). The goal is to maximize interpretability while maintaining high accuracy. Which algorithm is MOST appropriate?

A team is training a deep learning model for image classification. The training loss decreases steadily but the validation loss plateaus after 20 epochs and then starts to increase. Which action is MOST likely to improve generalization?

Which machine learning paradigm is best suited for training a model to play a game by learning from its own actions and rewards, without labeled data?

A natural language processing team wants to build a sentiment analysis model for customer reviews. They have 10,000 labeled reviews and 1 million unlabeled reviews. Which approach would MOST effectively leverage the unlabeled data?

A company is deploying a text generation model for customer service emails. They want to ensure the model's responses are factual and based on internal knowledge bases. Which technique is most effective?

A machine learning engineer is training a transformer model for machine translation. The model's perplexity on the validation set is 8.5, and the BLEU score is 32. After increasing the number of encoder layers from 6 to 12, perplexity drops to 7.2 but BLEU decreases to 28. What is the MOST likely cause?

Which neural network architecture is specifically designed to handle sequential data and mitigate the vanishing gradient problem?

A data scientist is evaluating a binary classification model. The model achieves 95% accuracy on the test set, but the precision is 0.60 and recall is 0.55. The dataset has 90% negative class samples. Which metric should the team focus on to improve the model?

A company wants to generate realistic images of new product designs. They have a large dataset of existing product images. Which generative AI approach is MOST suitable for creating novel, high-quality images?

A team is fine-tuning a BERT model for a document classification task. They notice the model achieves high F1 scores on the training set but low F1 on the validation set. Which regularization technique would be MOST effective?

In unsupervised learning, which task involves grouping similar data points together based on feature similarities?

A developer is using a large language model via an API. They want the model to solve a math problem step by step. Which prompt engineering technique should they use?

A data scientist is preparing to train a convolutional neural network (CNN) for image classification. Which TWO actions are most effective for preventing overfitting? (Choose 2)

A company wants to deploy an LLM-based chatbot that can handle sensitive customer information. Which THREE measures should be implemented to mitigate prompt injection attacks? (Choose 3)

A machine learning team is evaluating a logistic regression model for a binary classification task. The dataset has 1,000 samples and 20 features. Which TWO metrics are most appropriate for evaluating model performance? (Choose 2)

A data scientist needs to predict whether a customer will churn (yes/no) based on historical data. Which type of machine learning problem is this?

A team trains a neural network for image classification. During training, the loss decreases on the training set but increases on the validation set after a few epochs. What is the most likely cause?

A generative AI model is asked to 'Write a poem about AI' and returns a very short, generic response. The user wants longer, more creative outputs. Which parameter adjustment is MOST likely to help?

A bank wants to detect fraudulent transactions in real-time. The dataset is highly imbalanced (99.9% legitimate, 0.1% fraud). Which evaluation metric is MOST appropriate for model performance?

Which type of neural network is BEST suited for processing sequential data such as time series or natural language?

An AI engineer is tuning a large language model for a summarization task. The output summaries are too verbose and include irrelevant details. Which technique should be applied to encourage concise outputs?

A model trained on customer reviews achieves 98% accuracy on the test set. However, when deployed, it performs poorly on real-world data. The data scientist suspects distribution shift. Which action is MOST important to address this?

Which of the following is a key characteristic of Narrow AI (Weak AI)?

A team is using a pre-trained BERT model for a sentiment analysis task on product reviews. They want to adapt it to their specific domain with limited labeled data. Which approach is MOST effective?

An organization's AI system uses a decision tree model for loan approval. The compliance team requires explanations for each decision. Which property of decision trees makes them suitable for this requirement?

A prompt engineer wants to reduce the risk of prompt injection attacks in an LLM-based application that processes user input. Which strategy is MOST effective?

A data scientist is building a recommendation system for an e-commerce platform. The dataset includes user purchase history, product descriptions, and user demographics. The goal is to recommend products that a user is likely to purchase. Which TWO techniques are most appropriate for this task? (Select TWO.)

A team is deploying a sentiment analysis model that must achieve high precision and high recall. They have a labeled dataset of 10,000 samples. They want to minimize overfitting. Which THREE actions are most appropriate? (Select THREE.)

A company wants to classify images of products into categories. They have a large dataset of labeled images. Which TWO types of neural networks are most suitable for this task? (Select TWO.)

A data scientist is building a model to predict whether a transaction is fraudulent. The dataset has 99.9% legitimate transactions and 0.1% fraudulent ones. Which evaluation metric is MOST appropriate to assess model performance given this class imbalance?

Which machine learning paradigm involves training an agent to make decisions by interacting with an environment and receiving rewards or penalties based on its actions?

A developer is fine-tuning a large language model for a legal document summarization task. They notice that during training, the loss decreases rapidly in the first few epochs but then plateaus with high variance. Which hyperparameter adjustment is MOST likely to help stabilize training?

A team is deploying a sentiment analysis model for social media posts. The model currently performs well on English text but poorly on code-switched text (e.g., Spanglish). Which approach is MOST effective for improving performance on code-switched data without starting from scratch?

Which neural network architecture is specifically designed to process sequential data, such as time series or sentences, by maintaining a hidden state that captures information about previous inputs?

A company wants to automatically group customer support tickets into categories (e.g., billing, technical, account) without pre-labeled data. Which machine learning approach should they use?

A generative AI model produces images from text prompts. The outputs are often blurry and lack fine details. Which model type is MOST likely being used, and which improvement would best address this issue?

Which of the following best describes the difference between narrow AI and general AI?

A team is training a deep learning model for image classification. They observe that training accuracy is high but validation accuracy is low, indicating overfitting. Which TWO techniques should they apply to reduce overfitting? (Select TWO)

An AI engineer is fine-tuning a transformer-based language model for a domain-specific task. They want to improve the model's factual accuracy and reduce hallucinations. Which THREE strategies should they consider? (Select THREE)

A data scientist needs to select a regression model to predict house prices. The dataset contains many features, some of which are irrelevant. Which TWO algorithms are BEST suited for this scenario, and why? (Select TWO)

A company is deploying an LLM-powered application that answers questions based on internal documents. They want to minimize prompt injection attacks where users trick the model into ignoring instructions. Which THREE measures should they implement? (Select THREE)

A machine learning engineer is training a convolutional neural network (CNN) for object detection in satellite imagery. The training loss is not decreasing significantly. Which TWO adjustments could help the model converge? (Select TWO)

A company wants to use machine learning to recommend products to customers based on their purchase history. Which TWO techniques are appropriate for this task? (Select TWO)

A data scientist is preparing a dataset for a binary classification model. The dataset has 1000 samples, with 800 positives and 200 negatives. To evaluate the model properly, which THREE steps should they take? (Select THREE)

Practice all 75 AI Concepts and Techniques questions

Other AI0-001 exam domains

AI Infrastructure and Technologies AI Security AI Concepts and Foundations Machine Learning and Deep Learning AI Models and Data Engineering Implementing AI Solutions AI Implementation and Operations AI Security, Ethics and Governance AI Governance and Ethics

Frequently asked questions

What does the AI Concepts and Techniques domain cover on the AI0-001 exam?

The AI Concepts and Techniques domain covers the key concepts tested in this area of the AI0-001 exam blueprint published by CompTIA. Courseiva provides free domain-focused practice, mock exams, missed-question review, and readiness tracking across all AI0-001 domains — no account required.

How many AI Concepts and Techniques questions are in the AI0-001 question bank?

The Courseiva AI0-001 question bank contains 75 questions in the AI Concepts and Techniques domain. Click any question to see the full explanation and answer breakdown.

What is the best way to practice AI Concepts and Techniques for AI0-001?

Start with a 10-question focused session to identify your baseline accuracy in this domain. Read every explanation — even for questions you answer correctly — to understand the reasoning. Once you score consistently above 80%, move to a 20–30 question session to confirm depth before moving to the next domain.

Can I practice only AI Concepts and Techniques questions for AI0-001?

Yes — the session launcher on this page draws questions exclusively from the AI Concepts and Techniques domain. Choose 10, 20, 30, or 50 questions for a focused session, or click individual questions to review them one by one.

Free forever · No credit card required

Track your AI0-001 domain progress

Save your results, see per-domain analytics, and get readiness scores — free, for every certification.

Free forever · Every certification included