Linear Algebra for Data Science Using Python/

...

Neural Networks

Learn about neural networks, a popular model for both regression and classification.

We'll cover the following...

Why neural networks?
What’s a neuron?
What’s a neural network?
Forward pass
Neural network is a function
Training the network
Why non-linear activations?
Locating the linear algebra

Why neural networks?

Neural networks have become very popular in recent years. This is due to several factors:

Universal approximation theorem: For any function, there exists a neural network that approximates it to arbitrary precision.
Big Data: Neural networks are data-hungry for estimating the parameters more reliably. The choice of neural networks is justified more recently with the availability of Big DataLarger, complex, and exponentially increasing amounts of information from new sources.
Software and hardware: A more stable support of software packages (TensorFlow, PyTorch, MXNet), as well as specialized hardware for efficient and customized computations, makes the choice of neural networks more practical.

What’s a neural network?

In a neural network, we combine different neurons, each having a possibly different set of parameters, $\bold{w}$ and a possibly different activation function. The figure below is a typical example of a neural network. The first layer consists of inputs to the network and is known as the input layer. The input layer has labels, $x_i$ . . Unlike other layers, the input layer doesn’t have neurons. So, no computation happens in the input layer other than copying the input to the layer. The last layer consists of the outputs and is known as the output layer. The output layer has labels, $\hat y_j$ . . All layers between the input layer and the output layer are known as hidden layers. The hidden layers have labels of the form $a_k^l$ , where $k$ is the neuron index in layer $l$ . Both the hidden layer and the output layer are computational, that is, they consist of neurons that are computational units.

Introduction

Linearity

Matrices

Solving Linear Systems

Singularity

Linear Regression and Least Squares

Vector Space

Vector Spaces of a Matrix

Singular Value Decomposition: SVD

Learning to Find Discriminative Null Space for Face Recognition

Neural Networks

Why neural networks?

What’s a neuron?

What’s a neural network?

Forward pass