- One of the key aspects of supervised machine learning is model evaluation and validation.
- Two popular measures of evaluation
Coefficient of Determination
- Measures the proportion of variance in the target variable that is explained by the model
- The best possible score for is 1.0 (when )
- Lower values are worse
- is 0.0 when
Mean Squared Error ()
- Measures average squared difference between actual and predicted values
- Lower MSE = better predictions
IMPORTANT
It’s an absolute error metric → the smaller, the better
$$
MSE = \frac{1}{n} \sum (y - \hat y)^2
$$
- Where:
- the term is the total number of observations in the dataset