You have deployed a regression model that predicts house prices. Over the past month, the model's predictions have been consistently too high. You suspect data drift in the input features. Which monitoring metric should you prioritize to confirm this?
Trap 1: Monitor prediction drift (prediction distribution)
Prediction drift indicates output change but not which feature drifted.
Trap 2: Monitor feature attribution drift using SHAP values
Feature attribution drift measures impact on predictions, not input distribution.
Trap 3: Monitor residual distribution drift
Residuals measure error, not input drift.
- A
Monitor prediction drift (prediction distribution)
Why wrong: Prediction drift indicates output change but not which feature drifted.
- B
Monitor feature distribution drift using a divergence metric like Jensen-Shannon divergence
Feature drift measures input distribution change.
- C
Monitor feature attribution drift using SHAP values
Why wrong: Feature attribution drift measures impact on predictions, not input distribution.
- D
Monitor residual distribution drift
Why wrong: Residuals measure error, not input drift.