A team is exploring a dataset with missing values in multiple columns. They want to decide whether to drop rows or impute values. Which approach is most appropriate for exploratory data analysis?
Understanding the missing data pattern is crucial before deciding on imputation or deletion.
Why this answer
Option A is correct because during EDA, it is important to first understand the pattern and extent of missing data before deciding on treatment. Option B is wrong because dropping rows without analysis may discard valuable data. Option C is wrong because imputing without understanding the missing mechanism may introduce bias.
Option D is wrong because EDA does not require using a specific AWS service.