Practice AI0-001 AI Concepts and Techniques questions with full explanations on every answer.
Start practicing
AI Concepts and Techniques — choose a session length
Free · No account required
Click any question to see the full explanation and answer options, or start a focused practice session above.
A data scientist is building a model to predict whether a loan application will default. The dataset has 10,000 labeled examples with 1,000 defaults. Which metric is MOST appropriate for evaluating this highly imbalanced binary classification?
2A company is deploying a large language model for customer support. They want to reduce the number of off-topic or nonsensical responses while maintaining creativity. Which parameter adjustment would BEST achieve this?
3A startup wants to identify unusual patterns in network traffic to detect potential security breaches. They have a large dataset of normal traffic but very few labeled attacks. Which machine learning approach is MOST suitable?
4A research team is training a deep learning model for image classification using a small dataset of 1,000 labeled images. They are concerned about overfitting. Which combination of regularisation techniques would be MOST effective?
5A developer is building a natural language processing system to classify customer reviews as positive, neutral, or negative. They have 50,000 labeled reviews. Which model architecture is MOST appropriate for this task?
6A company wants to recommend products to users based on their past purchase history. Which machine learning paradigm is BEST suited for this task?
7A machine learning engineer is training a logistic regression model and notices that the loss is decreasing very slowly. The learning rate is set to 0.001. What is the MOST likely cause and appropriate fix?
8A team is training a generative adversarial network (GAN) to generate realistic images of furniture. The generator loss decreases sharply while the discriminator loss increases. What is the MOST likely issue and recommended action?
9An AI practitioner needs to extract key phrases from a large collection of customer support emails for trend analysis. Which technique is MOST suitable?
10A data scientist is selecting a model for a binary classification task where interpretability is critical because of regulatory requirements. The dataset has 20 features and 10,000 samples. Which model is MOST appropriate?
11A developer is using a pre-trained BERT model for a question-answering system. They want to ensure the model can handle out-of-vocabulary words. Which component of the BERT architecture is responsible for this?
12A team is training a recurrent neural network (RNN) with LSTM units to predict stock prices. The validation loss is significantly higher than the training loss. Which action is MOST likely to reduce the gap?
13A company is deploying a chatbot using a large language model. They want to mitigate the risk of prompt injection attacks. Which TWO measures should be implemented?
14A data scientist is evaluating a binary classifier for a medical diagnosis task. The dataset is imbalanced with 5% positive cases. Which THREE metrics should the data scientist consider for a comprehensive evaluation?
15A company wants to build a system that can generate new product images for an online catalog. Which TWO generative AI approaches are most suitable?
16A company wants to build a customer service chatbot that answers questions about their internal policy documents. The documents are updated monthly, and the team cannot afford to retrain a model each time. Which approach is MOST appropriate?
17A data scientist is building a model to predict whether a credit card transaction is fraudulent, using labeled historical data. Which machine learning paradigm is being used?
18A deep learning engineer is training a transformer model and notices that validation perplexity increases after a few epochs while training perplexity continues to decrease. Which of the following is the MOST likely cause?
19A product team wants a system that can generate high-quality synthetic images of furniture in different room settings for an online catalog. The images must be photorealistic and vary in style. Which generative AI approach is BEST suited for this task?
20An AI practitioner needs to measure the performance of a binary classification model for disease detection, where the cost of false negatives is very high. Which metric should be prioritized?
21A data scientist is using a linear regression model to predict house prices and observes that the model performs well on training data but poorly on test data. Which regularisation technique is MOST appropriate to reduce overfitting?
22A team is developing a sentiment analysis model and obtains the following performance on the test set: accuracy=0.92, precision=0.75, recall=0.80, F1=0.77. The baseline majority-class classifier achieves 0.85 accuracy. Which conclusion is MOST justified?
23An AI engineer is designing a system to detect unusual patterns in network traffic that may indicate a security breach. The system should learn from normal traffic patterns and flag deviations. Which machine learning approach is MOST appropriate?
24A machine learning engineer is training a neural network for image classification. The training loss decreases slowly and the model accuracy improves only marginally each epoch. Which hyperparameter adjustment is MOST likely to accelerate convergence?
25A data analyst wants to use a model that provides feature importance scores to understand which factors most influence customer churn. They also need the model to handle both numerical and categorical data with minimal preprocessing. Which algorithm is BEST suited?
26An AI system that can perform any intellectual task that a human being can is referred to as:
27A research team is fine-tuning a BERT model for a text classification task. They notice that the model's performance on the validation set fluctuates wildly across epochs, sometimes dropping significantly from one epoch to the next. Which technique is MOST likely to stabilise training?
28An AI developer is selecting a model architecture for a real-time video surveillance system that must detect objects in each frame and also track movement patterns across frames. Which TWO architectures should the developer combine? (Choose 2)
29A healthcare startup is building a diagnostic support system using a large language model. The system must provide accurate, evidence-based answers and avoid generating harmful or fabricated information. Which THREE techniques should be implemented to achieve this? (Choose 3)
30A machine learning team is splitting a dataset for a binary classification problem. They want to ensure robust evaluation and avoid data leakage. Which TWO practices should they follow? (Choose 2)
31A data scientist is building a model to predict credit default using historical loan data. The dataset contains 100,000 records with 50 features, including income, debt-to-income ratio, and loan amount. The target variable is binary (default vs. no default). The goal is to maximize interpretability while maintaining high accuracy. Which algorithm is MOST appropriate?
32A team is training a deep learning model for image classification. The training loss decreases steadily but the validation loss plateaus after 20 epochs and then starts to increase. Which action is MOST likely to improve generalization?
33Which machine learning paradigm is best suited for training a model to play a game by learning from its own actions and rewards, without labeled data?
34A natural language processing team wants to build a sentiment analysis model for customer reviews. They have 10,000 labeled reviews and 1 million unlabeled reviews. Which approach would MOST effectively leverage the unlabeled data?
35A company is deploying a text generation model for customer service emails. They want to ensure the model's responses are factual and based on internal knowledge bases. Which technique is most effective?
36A machine learning engineer is training a transformer model for machine translation. The model's perplexity on the validation set is 8.5, and the BLEU score is 32. After increasing the number of encoder layers from 6 to 12, perplexity drops to 7.2 but BLEU decreases to 28. What is the MOST likely cause?
37Which neural network architecture is specifically designed to handle sequential data and mitigate the vanishing gradient problem?
38A data scientist is evaluating a binary classification model. The model achieves 95% accuracy on the test set, but the precision is 0.60 and recall is 0.55. The dataset has 90% negative class samples. Which metric should the team focus on to improve the model?
39A company wants to generate realistic images of new product designs. They have a large dataset of existing product images. Which generative AI approach is MOST suitable for creating novel, high-quality images?
40A team is fine-tuning a BERT model for a document classification task. They notice the model achieves high F1 scores on the training set but low F1 on the validation set. Which regularization technique would be MOST effective?
41In unsupervised learning, which task involves grouping similar data points together based on feature similarities?
42A developer is using a large language model via an API. They want the model to solve a math problem step by step. Which prompt engineering technique should they use?
43A data scientist is preparing to train a convolutional neural network (CNN) for image classification. Which TWO actions are most effective for preventing overfitting? (Choose 2)
44A company wants to deploy an LLM-based chatbot that can handle sensitive customer information. Which THREE measures should be implemented to mitigate prompt injection attacks? (Choose 3)
45A machine learning team is evaluating a logistic regression model for a binary classification task. The dataset has 1,000 samples and 20 features. Which TWO metrics are most appropriate for evaluating model performance? (Choose 2)
46A company wants to build a customer service chatbot that answers questions about their internal policy documents. The documents are updated monthly, and the team cannot afford to retrain a model each time. Which approach is MOST appropriate?
47A data scientist needs to predict whether a customer will churn (yes/no) based on historical data. Which type of machine learning problem is this?
48A team trains a neural network for image classification. During training, the loss decreases on the training set but increases on the validation set after a few epochs. What is the most likely cause?
49A generative AI model is asked to 'Write a poem about AI' and returns a very short, generic response. The user wants longer, more creative outputs. Which parameter adjustment is MOST likely to help?
50A bank wants to detect fraudulent transactions in real-time. The dataset is highly imbalanced (99.9% legitimate, 0.1% fraud). Which evaluation metric is MOST appropriate for model performance?
51Which type of neural network is BEST suited for processing sequential data such as time series or natural language?
52An AI engineer is tuning a large language model for a summarization task. The output summaries are too verbose and include irrelevant details. Which technique should be applied to encourage concise outputs?
53A model trained on customer reviews achieves 98% accuracy on the test set. However, when deployed, it performs poorly on real-world data. The data scientist suspects distribution shift. Which action is MOST important to address this?
54Which of the following is a key characteristic of Narrow AI (Weak AI)?
55A team is using a pre-trained BERT model for a sentiment analysis task on product reviews. They want to adapt it to their specific domain with limited labeled data. Which approach is MOST effective?
56An organization's AI system uses a decision tree model for loan approval. The compliance team requires explanations for each decision. Which property of decision trees makes them suitable for this requirement?
57A prompt engineer wants to reduce the risk of prompt injection attacks in an LLM-based application that processes user input. Which strategy is MOST effective?
58A data scientist is building a recommendation system for an e-commerce platform. The dataset includes user purchase history, product descriptions, and user demographics. The goal is to recommend products that a user is likely to purchase. Which TWO techniques are most appropriate for this task? (Select TWO.)
59A team is deploying a sentiment analysis model that must achieve high precision and high recall. They have a labeled dataset of 10,000 samples. They want to minimize overfitting. Which THREE actions are most appropriate? (Select THREE.)
60A company wants to classify images of products into categories. They have a large dataset of labeled images. Which TWO types of neural networks are most suitable for this task? (Select TWO.)
61A data scientist is building a model to predict whether a transaction is fraudulent. The dataset has 99.9% legitimate transactions and 0.1% fraudulent ones. Which evaluation metric is MOST appropriate to assess model performance given this class imbalance?
62Which machine learning paradigm involves training an agent to make decisions by interacting with an environment and receiving rewards or penalties based on its actions?
63A developer is fine-tuning a large language model for a legal document summarization task. They notice that during training, the loss decreases rapidly in the first few epochs but then plateaus with high variance. Which hyperparameter adjustment is MOST likely to help stabilize training?
64A team is deploying a sentiment analysis model for social media posts. The model currently performs well on English text but poorly on code-switched text (e.g., Spanglish). Which approach is MOST effective for improving performance on code-switched data without starting from scratch?
65Which neural network architecture is specifically designed to process sequential data, such as time series or sentences, by maintaining a hidden state that captures information about previous inputs?
66A company wants to automatically group customer support tickets into categories (e.g., billing, technical, account) without pre-labeled data. Which machine learning approach should they use?
67A generative AI model produces images from text prompts. The outputs are often blurry and lack fine details. Which model type is MOST likely being used, and which improvement would best address this issue?
68Which of the following best describes the difference between narrow AI and general AI?
69A team is training a deep learning model for image classification. They observe that training accuracy is high but validation accuracy is low, indicating overfitting. Which TWO techniques should they apply to reduce overfitting? (Select TWO)
70An AI engineer is fine-tuning a transformer-based language model for a domain-specific task. They want to improve the model's factual accuracy and reduce hallucinations. Which THREE strategies should they consider? (Select THREE)
71A data scientist needs to select a regression model to predict house prices. The dataset contains many features, some of which are irrelevant. Which TWO algorithms are BEST suited for this scenario, and why? (Select TWO)
72A company is deploying an LLM-powered application that answers questions based on internal documents. They want to minimize prompt injection attacks where users trick the model into ignoring instructions. Which THREE measures should they implement? (Select THREE)
73A machine learning engineer is training a convolutional neural network (CNN) for object detection in satellite imagery. The training loss is not decreasing significantly. Which TWO adjustments could help the model converge? (Select TWO)
74A company wants to use machine learning to recommend products to customers based on their purchase history. Which TWO techniques are appropriate for this task? (Select TWO)
75A data scientist is preparing a dataset for a binary classification model. The dataset has 1000 samples, with 800 positives and 200 negatives. To evaluate the model properly, which THREE steps should they take? (Select THREE)
The AI Concepts and Techniques domain covers the key concepts tested in this area of the AI0-001 exam blueprint published by CompTIA. Courseiva provides free domain-focused practice, mock exams, missed-question review, and readiness tracking across all AI0-001 domains — no account required.
The Courseiva AI0-001 question bank contains 75 questions in the AI Concepts and Techniques domain. Click any question to see the full explanation and answer breakdown.
Start with a 10-question focused session to identify your baseline accuracy in this domain. Read every explanation — even for questions you answer correctly — to understand the reasoning. Once you score consistently above 80%, move to a 20–30 question session to confirm depth before moving to the next domain.
Yes — the session launcher on this page draws questions exclusively from the AI Concepts and Techniques domain. Choose 10, 20, 30, or 50 questions for a focused session, or click individual questions to review them one by one.
Save your results, see per-domain analytics, and get readiness scores — free, for every certification.
Sign Up FreeFree forever · Every certification included