What are some common evaluation metrics used in machine learning?

Evaluating the performance of machine learning models is essential for determining their effectiveness and suitability for specific tasks.

Following are some evaluation metrics used in machine learning:

Accuracy
Logarithmic loss
Confusion matrix
F1 score
Mean squared error
Mean absolute error
Root mean square error (RMSE)

Accuracy

Accuracy is a fundamental evaluation metric, reflecting the model's ability to classify instances correctly. It measures the ratio of correctly classified instances to the total number of instances.

The formula for calculating accuracy is:

When $y=1$ , the contribution to logarithmic loss becomes $y \times log(p)$ , representing the logarithm of the predicted probability for the positive class.
When $y=0$ , the contribution becomes $(1 - y) \times log(1 - p)$ , representing the logarithm of the predicted probability for the negative class.

Confusion matrix

Confusion matrix is a tabular representation that provides a detailed breakdown of a machine learning model's performance in a classification task. Using the values or data from the confusion matrix, it analyzes the following predictions made by the model:

True positive (TP): The model correctly predicted instances as positive when they were actually positive.
True negative (TN): The model correctly predicted instances as negative when they were actually negative.
False positive (FP): The model incorrectly predicted instances as positive when they were actually negative.
False negative (FN): The model incorrectly predicted instances as negative when they were actually positive.

F1 score

The F1 score is a harmonic mean of precision and recall, providing a balanced measure of a model's performance. The F1 score ranges from 0 to 1, with a higher value indicating better model performance.

The formula for calculating the F1 score is as follows:

RMSE is similar to mean square error (MSE), but the key difference is that RMSE expresses the same units as the original data. By taking the square root, RMSE brings the metric back to the scale of the dependent variable, making it easier to interpret and compare with the actual values.

Learn more about classification vs. regression.

Conclusion

Evaluating the performance of machine learning models is essential for determining their effectiveness and suitability for specific tasks. By understanding these metrics and their appropriate applications, data scientists can make informed decisions and optimize their models for better performance.

Free Resources

Learn in-demand tech skills in half the time

PRODUCTS

Mock Interview

New

Courses

Skill Paths

Projects

Assessments

What are some common evaluation metrics used in machine learning?

Accuracy

Logarithmic loss

Confusion matrix

F1 score

Mean squared error

Mean absolute error

Root mean square error

Conclusion