The Limitations of Linear Regression

Understand the limitations of basic linear regression in machine learning, including difficulties with parameter tweaking and precision. Learn why simple methods fail at scale and how gradient descent offers a faster, more accurate solution for optimizing multiple parameters simultaneously.

We'll cover the following...

Our algorithm

In the previous chapter, we wrote a piece of code that learns. However, if that code is reviewed by computer scientists, they would find it lacking. In particular, they would raise an objection to the train() function. According to the stern computer scientist this code might work okay for this simple example but it would not scale to real-world problems.

In this chapter, we’ll address those concerns in two ways.

First, we won’t get our code reviewed by a computer scientist.
Second, we’ll analyze the shortcomings of the current train() implementation and solve them with one of machine learning’s key ideas, an algorithm called gradient descent.

Like our current train() code, gradient descent is a way to find the minimum of the loss function, but it’s faster, more precise, and more general than the code from the previous chapter. Gradient descent is not just useful for our tiny program. In fact, we cannot go very far in ML without gradient descent. In different forms, this algorithm will accompany us to the end of this chapter. Let’s start with the problem that gradient descent is meant to solve.

Our algorithm

Our program can successfully forecast pizza sales, but why stop there? We can forecast many other possibilities, for example, maybe we could use the same code to forecast other things, such as the stock market.

However, if we try to apply our linear regression program to a different problem, we would bump into an impediment. Our code is based on a simple line-shaped model with two parameters: the weight $w$ and the bias $b$ ...

1.How Machine Learning Works

2.Our First Learning Program

3.Walking the Gradient

4.Hyperspace

5.A Discern Machine

6.Get Real

7.The Final Challenge

8.The Perceptron

9.Designing the Network

10.Building the Network

11.Training the Network

12.How Classifiers Work

13.Batchin’ Up

14.The Zen of Testing

15.Let’s Do Development

16.A Deeper Kind of Network

Project

17.Defeating Overfitting

18.Taming Deep Networks

19.Beyond Vanilla Networks

20.Into the Deep

Project

Mock Interview

The Limitations of Linear Regression

Our algorithm