A retail company uses Einstein Prediction Service to forecast customer churn. To improve model accuracy, which data preparation step is most critical?
Trap 1: Select only the top three features based on correlation.
Feature selection is secondary to data quality.
Trap 2: Use a different algorithm like neural networks.
Algorithm choice is less impactful than data quality.
Trap 3: Increase the dataset size by collecting more customer records.
More data without quality can degrade performance.
- A
Select only the top three features based on correlation.
Why wrong: Feature selection is secondary to data quality.
- B
Clean the dataset by handling missing values and outliers.
Proper data cleaning ensures the model learns accurate patterns.
- C
Use a different algorithm like neural networks.
Why wrong: Algorithm choice is less impactful than data quality.
- D
Increase the dataset size by collecting more customer records.
Why wrong: More data without quality can degrade performance.