A data scientist wants to train a machine learning model to predict the exact market price of a house based on features such as square footage, number of bedrooms, and location. Which type of machine learning task should be used?
Trap 1: Classification
Classification predicts discrete categories or labels, not continuous numeric values like price.
Trap 2: Clustering
Clustering groups unlabeled data into clusters based on similarity; it does not predict a specific numeric value.
Trap 3: Anomaly Detection
Anomaly Detection identifies data points that differ significantly from the majority, not suitable for predicting a continuous value.
- A
Classification
Why wrong: Classification predicts discrete categories or labels, not continuous numeric values like price.
- B
Regression
Regression predicts a continuous numeric value, which is exactly what is needed for predicting house price.
- C
Clustering
Why wrong: Clustering groups unlabeled data into clusters based on similarity; it does not predict a specific numeric value.
- D
Anomaly Detection
Why wrong: Anomaly Detection identifies data points that differ significantly from the majority, not suitable for predicting a continuous value.