Ridge and Lasso Regression
Learn about ridge and lasso regression, their comparison, and the importance of their contours’ intersection with MSE.
Without regularization, we only have to optimize for training loss, but with regularization, we need to optimize for both training loss (data) and the regularization function (smoothness).
Ridge regression
When the model is linear in parameters, the loss function is squared loss, and the regularization function is L2 norm, then the regression problem is known as ridge regression.
Linear model
Let be a training data set for regression with a single real target, that is, . When the number of features is , that is, ...