Distributed Machine Learning and Its Implementation with H2O/

...

Evaluating Machine Learning Models

Learn how to evaluate, compare, and optimize machine learning models.

We'll cover the following...

Evaluating a regression model
Evaluating a classification model
- Confusion matrix

In this lesson, we’ll learn about different evaluation metrics for machine learning classification and regression problems. It’s crucial to choose the right metric for the desired outcome. The metrics vary in objectives and must be carefully selected for the specific use case.

Evaluating a regression model

Regression models predict a continuous output variable. Some commonly used metrics in regression models are:

Mean squared error (MSE): MSE measures the average squared difference between the predicted and actual values, where a lower value of MSE indicates a better fit.
Root-mean-square error (RMSE): RMSE is the square root of MSE. Like MSE, it measures the average deviation between predicted and actual values. MSE and RMSE are very sensitive to outliers.
Mean absolute error (MAE): MAE measures the average absolute difference between predicted and actual values. It is less sensitive to outliers than MSE.
R-squared (R²): R² measures the proportion of the variance in the dependent variable that is predictable from the independent variables. It ranges between 0 and 1, and a higher value indicates a better fit model.
Mean absolute percentage error (MAPE): MAPE is an error metric based on percentages. It measures the average absolute percentage difference between predicted and actual values and is used for cases where percentage errors are more important than absolute errors.

Let’s say we are predicting the price of a house based on features such as location, square footage, and number of bedrooms and bathrooms. We can use the above metrics to evaluate the performance of our model. For instance:

MSE or RMSE would tell us how well the model predicts the actual prices of houses and if there are ...

Introduction to Machine Learning

Supervised Learning: Regression Models with H2O

Supervised Learning: Classification Models with H2O

Unsupervised Learning: Clustering with H2O

Unsupervised Learning: Anomaly Detection with H2O

Closing Notes

Appendix

Evaluating Machine Learning Models

Evaluating a regression model