Machine Learning with NumPy, pandas, scikit-learn, and More/

...

Ridge Regression

Understand the need for regularization in linear regression.

We'll cover the following...

Chapter Goals:
B. Choosing the best alpha
Time to Code!

While ordinary least squares regression is a good way to fit a linear model onto a dataset, it relies on the fact that the dataset's features are each independent, i.e. uncorrelated. When many of the dataset features are linearly correlated, e.g. if a dataset has multiple features depicting the same price in different currencies, it makes the least squares regression model highly sensitive to noise in the data.

Because real life data tends to have noise, and will often have some linearly correlated features in the dataset, we combat this by performing regularization. For ordinary least squares regression, the goal is to find the weights (coefficients) for the linear model that minimize the sum of squared residuals:

What you'll learn from this course

Data Manipulation with NumPy

Data Analysis with pandas

Data Preprocessing with scikit-learn

Data Modeling with scikit-learn

Clustering with scikit-learn

Gradient Boosting with XGBoost

Deep Learning with TensorFlow

Deep Learning with Keras

Ridge Regression

Chapter Goals: