A machine learning engineer notices that a fraud detection model's false positive rate has increased significantly over the past week. The model was retrained two weeks ago with new data. Which attack is MOST likely responsible?
Poisoned data during retraining can cause the model to misclassify legitimate transactions as fraud, raising false positives.
Why this answer
Data poisoning corrupts training data, causing the model to learn incorrect patterns. The retraining with new data introduces the poisoned samples, degrading performance. Adversarial examples are at inference time, model inversion reconstructs data, and prompt injection targets LLMs.