AI0-001 · topic practice

Machine Learning and Deep Learning practice questions

Practise CompTIA AI+ AI0-001 Machine Learning and Deep Learning practice questions — original exam-style scenarios with answer choices, explanations, and analysis of common mistakes.

Courseiva uses original exam-style practice questions designed for learning and revision. The goal is to understand the concepts, recognise exam patterns, and improve through explanations — not memorise copied exam dumps.

Reviewed byJohnson Ajibi· MSc IT Security
20 questionsDomain: Machine Learning and Deep Learning

What the exam tests

What to know about Machine Learning and Deep Learning

Machine Learning and Deep Learning questions test whether you can apply the concept in context, not just recognise a definition.

How the topic appears in realistic exam-style scenarios.

Which detail in the question changes the correct answer.

How to eliminate plausible but wrong options.

How to connect the question back to the wider exam objective.

Watch out for

Common Machine Learning and Deep Learning exam traps

  • Answering from memory before reading the full scenario.
  • Missing a constraint such as cost, availability, security, scope or command context.
  • Choosing a broad answer when the question asks for the most specific fix.
  • Ignoring why the wrong options are tempting.

Practice set

Machine Learning and Deep Learning questions

20 questions · select your answer, then reveal the explanation

A data scientist is building a classification model to detect fraudulent transactions. The dataset is highly imbalanced with only 1% fraudulent cases. Which approach should the scientist use to evaluate model performance most effectively?

A machine learning team is deploying a model that predicts customer churn. They notice that the model's predictions are highly sensitive to small changes in input features, leading to inconsistent outputs. Which technique should the team apply to improve model stability?

A deep learning model for image classification is overfitting the training data. The team has already tried data augmentation and dropout. Which additional technique should they implement to reduce overfitting?

Question 4easymultiple choice
Read the full NAT/PAT explanation →

A company wants to deploy a machine learning model that requires continuous learning as new data arrives. The model must be able to adapt to changing patterns without retraining from scratch. Which approach should be used?

A data engineer is designing a pipeline to train a linear regression model on a dataset with 10 million rows and 50 features. The dataset fits in memory. Which approach should the engineer use to train the model efficiently?

A data scientist is training a convolutional neural network (CNN) for object detection. The training loss decreases rapidly but then plateaus at a high value, and the validation loss starts increasing. Which action should the scientist take to improve the model?

A team is building a recommendation system using collaborative filtering. They have a sparse user-item matrix. Which technique should they use to handle the sparsity and improve recommendations?

Which TWO techniques are commonly used to handle missing data in a machine learning dataset? (Choose TWO.)

Which THREE are common activation functions used in neural networks? (Choose THREE.)

Which TWO are valid techniques to reduce overfitting in a deep neural network? (Choose TWO.)

A data scientist is training a multi-class classifier with 10 classes. The training log shows the above output for the first two epochs. What is the most likely cause?

Exhibit

Refer to the exhibit.

```
Epoch 1/10
 - loss: 2.3026 - accuracy: 0.1000 - val_loss: 2.3026 - val_accuracy: 0.1000
Epoch 2/10
 - loss: 2.3026 - accuracy: 0.1000 - val_loss: 2.3026 - val_accuracy: 0.1000
```

A team is reviewing a neural network model summary. The input layer expects 784 features (e.g., 28x28 images). How many parameters does the first dense layer have?

Exhibit

Refer to the exhibit.

```
Model: "sequential"
_________________________________________________________________
Layer (type)                 Output Shape              Param #
=================================================================
dense (Dense)                (None, 128)               100352
_________________________________________________________________
dense_1 (Dense)              (None, 64)                8256
_________________________________________________________________
dense_2 (Dense)              (None, 10)                650
=================================================================
Total params: 109,258
Trainable params: 109,258
Non-trainable params: 0
_________________________________________________________________
```

A data scientist is training a neural network to classify images of handwritten digits. The model achieves 99% accuracy on training data but only 85% on validation data. Which technique should the scientist apply first to address this issue?

A company is deploying a machine learning model to predict customer churn. The dataset is highly imbalanced (95% non-churn, 5% churn). The model achieves 96% accuracy, but the F1-score for the churn class is only 0.2. Which metric should the team prioritize to evaluate model performance for this business problem?

Question 15hardmultiple choice
Read the full NAT/PAT explanation →

An autonomous vehicle system uses a deep reinforcement learning agent to navigate. The agent's reward function gives +1 for reaching the destination and -0.1 for each time step. After training, the agent learns to circle the block repeatedly without reaching the destination. Which modification is most likely to fix this behavior?

A machine learning engineer is building a spam filter. The dataset contains 10,000 emails, of which 1,000 are spam. The engineer decides to use a Random Forest classifier. Which preprocessing step is most critical to ensure the model generalizes well to new, unseen emails?

Which TWO techniques are commonly used to prevent overfitting in deep neural networks?

Refer to the exhibit. A data scientist is training a binary classifier. Based on the training log, which problem is the model experiencing?

Exhibit

Refer to the exhibit.

```
Epoch 1/10
 - loss: 0.6932 - acc: 0.5123 - val_loss: 0.6981 - val_acc: 0.5012
Epoch 2/10
 - loss: 0.4521 - acc: 0.7845 - val_loss: 0.6890 - val_acc: 0.5123
Epoch 3/10
 - loss: 0.2312 - acc: 0.9234 - val_loss: 0.7123 - val_acc: 0.4987
Epoch 4/10
 - loss: 0.1023 - acc: 0.9789 - val_loss: 0.8567 - val_acc: 0.4856
Epoch 5/10
 - loss: 0.0456 - acc: 0.9923 - val_loss: 1.0234 - val_acc: 0.4765
```
Question 19hardmultiple choice
Read the full NAT/PAT explanation →

A healthcare startup is developing a deep learning model to detect diabetic retinopathy from retinal fundus images. The dataset contains 50,000 images, but only 5% are labeled as positive for the disease. The team uses a convolutional neural network (CNN) with a final sigmoid layer and binary cross-entropy loss. After training for 20 epochs, the model achieves 95% accuracy on the test set, but the recall for the positive class is only 10%. The team suspects the model is biased toward the negative class due to class imbalance. The data is stored in a secure environment, and no additional labeled data can be obtained. The team has access to the following techniques: oversampling the minority class, undersampling the majority class, using class weights in the loss function, applying data augmentation, and using a different architecture. Which course of action is most likely to improve recall for the positive class while maintaining reasonable overall performance?

A retail company uses a gradient boosting model to predict customer lifetime value (CLV). The model currently uses 50 features including purchase history, demographics, and web behavior. The model's RMSE on the test set is 120. The data science team wants to improve the model's accuracy without increasing training time significantly. They have access to additional data: customer support interaction logs (text), social media sentiment (text), and third-party credit scores (numeric). They also have the ability to perform feature engineering, hyperparameter tuning, and ensemble methods. Which approach is most likely to yield the best improvement in predictive performance with minimal increase in training time?

Free account

Track your progress over time

Create a free account to save your results and see which topics improve across sessions.

Focused Machine Learning and Deep Learning sessions

Start a Machine Learning and Deep Learning only practice session

Every question in these sessions is drawn from the Machine Learning and Deep Learning domain — nothing else.

Related practice questions

Related AI0-001 topic practice pages

Move into related areas when this topic feels solid.

Frequently asked questions

What does the AI0-001 exam test about Machine Learning and Deep Learning?
Machine Learning and Deep Learning questions test whether you can apply the concept in context, not just recognise a definition.
How should I use these practice questions?
Select your answer before revealing the explanation. Then read why each option is right or wrong — this active recall approach builds retention far faster than re-reading notes.
Can I practise just Machine Learning and Deep Learning questions in a focused session?
Yes — the session launcher on this page draws every question from the Machine Learning and Deep Learning domain. Use a 10-question session first to gauge your baseline, then move to 20 or 30 once the weak spots are clear.
Where can I practise other AI0-001 topics?
Use the topic links above to move to related areas, or go back to the AI0-001 question bank to see all topics.
Are these real exam questions or dumps?
These are original practice questions written to test the same concepts the AI0-001 exam covers. They are not copied from any real exam or dump site.