mediummultiple choiceObjective-mapped

A data scientist is developing a classification model to detect fraudulent transactions. The dataset is split into training and test sets. The data scientist repeatedly tunes the model's hyperparameters and evaluates performance on the test set until the test accuracy reaches 95%. However, when the model is deployed on new, unseen data, its accuracy drops to 70%. Which concept best explains this performance degradation?

Question 1mediummultiple choice

A data scientist is developing a classification model to detect fraudulent transactions. The dataset is split into training and test sets. The data scientist repeatedly tunes the model's hyperparameters and evaluates performance on the test set until the test accuracy reaches 95%. However, when the model is deployed on new, unseen data, its accuracy drops to 70%. Which concept best explains this performance degradation?

Answer choices

Why each option matters

Good practice is not just finding the correct option. The wrong answers often show the exact trap the exam wants you to fall into.

Distractor review

Overfitting to the training data

Overfitting to training data would cause high training accuracy but low test accuracy, but the tuning process described uses the test set directly, not the training set.

Distractor review

Data leakage from the training set to the test set

Data leakage typically occurs when information from outside the training set influences the model during training, e.g., future data used to predict the past. This scenario does not describe such leakage.

Best answer

Overfitting to the test set

Correct. The model's hyperparameters were tuned based on test set performance, causing the model to perform well on that specific test set but poorly on new data. This is overfitting to the test set.

Distractor review

Underfitting the training data

Underfitting would result in poor performance on both training and test sets, which is not the case here as the model achieved 95% test accuracy.

Common exam trap

Common exam trap: answer the scenario, not the keyword

Many certification questions include familiar terms but test a specific constraint. Read the exact wording before choosing an answer that is generally true but wrong for this case.

Technical deep dive

How to think about this question

This question should be treated as a scenario, not a definition check. Identify the problem, the constraint and the best action. Then compare each option against those facts.

KKey Concepts to Remember

Read the scenario before looking for a memorised answer.
Find the constraint that changes the correct option.
Eliminate answers that are true in general but not in this case.
Use explanations to understand the rule behind the answer.

TExam Day Tips

Underline the problem statement mentally.
Watch for words such as best, first, most likely and least administrative effort.
Review why wrong options are wrong, not only why the correct option is correct.

Related AI-900 practice-question pages

Use these pages to review the topic behind this question. This is how one missed question becomes focused revision.

AI-900 fundamentals practice questions

Practise AI-900 questions linked to AI-900 fundamentals.

AI-900 scenario practice questions

Practise AI-900 questions linked to AI-900 scenario.

AI-900 troubleshooting practice questions

Practise AI-900 questions linked to AI-900 troubleshooting.

Questions learners often ask

What does this AI-900 question test?

Read the scenario before looking for a memorised answer.

What is the correct answer to this question?

The correct answer is: Overfitting to the test set — This scenario describes overfitting to the test set (also known as 'test set leaking' or 'data snooping'). By repeatedly evaluating on the test set and adjusting hyperparameters to maximize test accuracy, the data scientist inadvertently allowed the test set to influence the model selection. This causes the model to become overly specialized to the test data and fail to generalize to new data. The proper practice is to use a separate validation set for tuning and only use the test set for final evaluation.

What should I do if I get this AI-900 question wrong?

Then try more questions from the same exam bank and focus on understanding why the wrong options are tempting.

Discussion

Loading comments…

Why each option matters

Distractor review

Distractor review

Best answer

Distractor review

Common exam trap: answer the scenario, not the keyword

How to think about this question

KKey Concepts to Remember

TExam Day Tips

Related AI-900 practice-question pages

AI-900 fundamentals practice questions

AI-900 scenario practice questions

AI-900 troubleshooting practice questions

More questions from this exam

A developer wants to build a virtual assistant that can understand user intents such as 'Book a flight' or 'Check weather' and extract relevant entities like destination and date. The developer has a small set of labeled example utterances. Which Azure AI Language feature should the developer use?

A developer is building a customer support chatbot using Azure OpenAI. The chatbot should never reveal its system instructions or internal configuration. The developer wants to add a rule at the beginning of the conversation to prevent prompt injection attacks. Which technique should they use?

A developer is using Azure OpenAI to generate creative product descriptions. The outputs are often repetitive and lack variety. The developer wants to increase the diversity of the generated text while still keeping it coherent. Which parameter should the developer increase?

A developer is using Azure OpenAI Service to generate product descriptions. They want the output to be highly focused and deterministic, with less randomness. Which parameter should they decrease?

Questions learners often ask

What does this AI-900 question test?

What is the correct answer to this question?

What should I do if I get this AI-900 question wrong?

Discussion