Question 1,197 of 1,755
ModelinghardMultiple SelectObjective-mapped

Quick Answer

The answer is training for too many epochs, as this allows the model to memorize noise and irrelevant patterns in the training data rather than learning generalizable features. This occurs because a complex model, such as a deep neural network with high capacity, can easily overfit when exposed to a small dataset, capturing spurious correlations that do not reflect the true underlying distribution. On the AWS Certified Machine Learning Specialty MLS-C01 exam, this concept tests your understanding of the bias-variance tradeoff and model regularization; a common trap is confusing overfitting with underfitting or assuming more epochs always improve accuracy. To avoid this, remember the mnemonic "Epochs Overfit Noise" (EON) — excessive epochs on limited data lead to memorization, not generalization.

MLS-C01 Modeling Practice Question

This MLS-C01 practice question tests your understanding of modeling. The scenario asks you to isolate a root cause — eliminate options that address a different problem before choosing. After answering, compare your reasoning against the explanation and wrong-answer breakdown below. Once you have made your selection, read the full explanation to reinforce the concept and understand why each distractor is designed to mislead on exam day.

Which THREE of the following are common causes of overfitting in machine learning models?

Question 1hardmulti select
Full question →

Answer choices

Why each option matters

Answer the question above first, then reveal the full breakdown to understand why each option is right or wrong.

Correct answer & explanation

Using a complex model like a deep neural network on a small dataset

Option A is correct because a complex model like a deep neural network has high capacity and can easily memorize noise and patterns specific to a small dataset, rather than learning generalizable features. With limited training samples, the model fails to capture the underlying data distribution, leading to poor performance on unseen data.

Key principle: Answer the scenario, not the keyword: identify the specific constraint before choosing the most familiar-sounding option.

Answer analysis

Option-by-option breakdown

For each option: why learners choose it and why it is or isn't the right answer here.

  • Using a complex model like a deep neural network on a small dataset

    Why this is correct

    Complex models on small data overfit.

    Related concept

    Read the scenario before looking for a memorised answer.

  • Model has too many parameters relative to the number of training samples

    Why this is correct

    Too many parameters allow memorization.

    Related concept

    Read the scenario before looking for a memorised answer.

  • Having a large dataset with many samples

    Why it's wrong here

    More data reduces overfitting.

  • Training for too many epochs

    Why this is correct

    Long training can lead to overfitting.

    Related concept

    Read the scenario before looking for a memorised answer.

  • Using regularization techniques

    Why it's wrong here

    Regularization prevents overfitting.

Common exam traps

Common exam trap: answer the scenario, not the keyword

Cisco often tests the misconception that more data or regularization causes overfitting, when in fact both are standard countermeasures; the trap is confusing correlation with causation in model training dynamics.

Detailed technical explanation

How to think about this question

Overfitting occurs when a model learns the training data too well, including its noise and outliers, resulting in high variance. This is often quantified by a large gap between training and validation loss. In practice, techniques like early stopping, cross-validation, and data augmentation are used alongside regularization to mitigate overfitting, especially in deep learning where the number of parameters can exceed the number of samples by orders of magnitude.

KKey Concepts to Remember

  • Read the scenario before looking for a memorised answer.
  • Find the constraint that changes the correct option.
  • Eliminate answers that are true in general but not in this case.

TExam Day Tips

  • Watch for words such as best, first, most likely and least administrative effort.
  • Review why wrong options are wrong, not only why the correct option is correct.

Key takeaway

Answer the scenario, not the keyword: identify the specific constraint before choosing the most familiar-sounding option.

Real-world example

How this comes up in practice

A healthcare organisation deploys an application with a public-facing web tier and a private database tier. The database subnet has no public IP and only accepts connections from the web tier's security group. Questions like this test whether you can design cloud network isolation using VNets/VPCs, subnets, and security group rules.

What to study next

Got this wrong? Here's your next step.

Identify which exam domain this question belongs to, review the core concept, then practise similar questions from the same domain.

Related practice questions

Related MLS-C01 practice-question pages

Use these pages to review the topic behind this question. This is how one missed question becomes focused revision.

Practice this exam

Start a free MLS-C01 practice session

Short sessions build daily habit. Longer sessions build exam-day stamina. Try a timed session to simulate real conditions.

FAQ

Questions learners often ask

What does this MLS-C01 question test?

Modeling — This question tests Modeling — Read the scenario before looking for a memorised answer..

What is the correct answer to this question?

The correct answer is: Using a complex model like a deep neural network on a small dataset — Option A is correct because a complex model like a deep neural network has high capacity and can easily memorize noise and patterns specific to a small dataset, rather than learning generalizable features. With limited training samples, the model fails to capture the underlying data distribution, leading to poor performance on unseen data.

What should I do if I get this MLS-C01 question wrong?

Identify which exam domain this question belongs to, review the core concept, then practise similar questions from the same domain.

What is the key concept behind this question?

Read the scenario before looking for a memorised answer.

About these practice questions

Courseiva creates original exam-style practice questions with explanations and wrong-answer analysis. It does not publish real exam questions, exam dumps, or protected exam content. Learn why practice questions differ from exam dumps →

How Courseiva writes practice questions · Editorial policy

Same concept, more angles

2 more ways this is tested on MLS-C01

These questions test the same concept from different angles. Work through them to make sure you can recognise it however the exam phrases it.

Variation 1. A data scientist is training a neural network for a multi-class classification problem with 100 classes. The model uses a softmax output layer and cross-entropy loss. During training, the loss decreases steadily but the accuracy on the validation set plateaus early. Which of the following is the most likely cause?

hard
  • A.Batch size is too large
  • B.The model is overfitting the training data
  • C.Number of epochs is too small
  • D.Learning rate is too high

Why B: When the validation accuracy plateaus early while training loss continues to decrease, it indicates that the model is memorizing the training data rather than learning generalizable patterns. This is classic overfitting, where the softmax output layer produces high-confidence predictions for training samples but fails to generalize to unseen validation data, causing cross-entropy loss to drop on the training set while validation accuracy stagnates.

Variation 2. A data scientist is training a neural network for a multi-class classification problem. The model is overfitting. Which TWO of the following techniques can help reduce overfitting? (Choose two.)

medium
  • A.Increase the number of hidden layers.
  • B.Add dropout layers after the hidden layers.
  • C.Decrease the learning rate.
  • D.Add L2 regularization to the loss function.
  • E.Reduce the batch size.

Why B: Option B is correct because dropout layers randomly deactivate a fraction of neurons during training, which prevents the network from relying too heavily on any single neuron and forces it to learn more robust features. This reduces co-adaptation among neurons and is a standard regularization technique to combat overfitting in neural networks.

Keep practising

More MLS-C01 practice questions

Last reviewed: Jun 24, 2026

Question Discussion

Share a tip, memory trick, or ask about the reasoning behind this question. Do not post real exam questions, leaked content, braindumps, or copyrighted exam material. Comments are moderated and may be removed without notice.

Loading comments…

Sign in to join the discussion.

This MLS-C01 practice question is part of Courseiva's free Amazon Web Services certification practice question bank. Courseiva provides original exam-style practice questions with explanations, topic-based practice, mock exams, readiness tracking, and study analytics to help learners prepare for the MLS-C01 exam.