...

Solution Review: Backpropagation - 3 Layered Neural Network

Learn about backpropagating the error in a 3 layered neural network.

We'll cover the following...

Solution
Explanation

Press + to interact

Python 3.5

import numpy as np 
import matplotlib.pyplot as plt
def sigmoid(z):
    """Compute sigmoid values for each sets of scores in x"""
    return 1 / (1 + np.exp(-z))
def softmax(x):
    """Compute softmax values for each sets of scores in x"""
    return np.exp(x) / np.sum(np.exp(x), axis=1) 
def forward_propagation(x, w1, w2, w3, b1, b2, b3):
   """ 
    Computes the forward propagation operation for the 3-layered 
    neural network and returns the output at the 2 hidden layers 
    and the output layer
   """
   net_h1 = np.dot(x, w1) + b1 # net output at the first hidden layer
   out_h1 = sigmoid(net_h1) # applying the sigmoid activation to the first hidden layer net output
   net_h2 = np.dot(out_h1, w2) + b2 # net output at the second hidden layer
   out_h2 = sigmoid(net_h2) # applying the sigmoid activation to the second hidden layer net output
   net_y = np.dot(out_h2, w3) + b3 # net output of the output layer
   out_y = softmax(net_y) # applying the softmax activation to the net output of output layer
   return out_h1, out_h2, out_y
def backpropagation(y, out_y, out_h2, out_h1, w3, w2, x):
    """ 
    Computes the backpropagation operation for the 
    3-layered neural network and returns the gradients
    of weights and biases
    """
    # Back propagating error from output to second hidden layer
    l3_error = out_y - y # Calculating error at layer 3
    dW3 = np.dot(out_h2.T, l3_error) # Change in weights at layer 3
    db3 = np.sum(l3_error, axis = 0, keepdims=True) # Change in bias at layer 3
    # Back propagating error from second hidden layer to first hidden layer
    dh2 = np.dot(w3, l3_error) 
    l2_error = np.multiply(dh2.T, out_h2 * (1 - out_h2)) # Calculating error at layer 2
    dW2 = np.dot(out_h1.T, l2_error) # Change in weights at layer 2
    db2 = np.sum(l2_error, axis = 0, keepdims=True) # Change in bias at layer 2
    # Back propagating error from first hidden layer to input layer
    dh1 = np.dot(w2, l2_error.T)
    l1_error = np.multiply(dh1.T, out_h1 * (1 - out_h1)) # Calculating error at layer 1
    dW1 = np.dot(x.T, l1_error) # Change in weights at layer 1
    db1 = np.sum(l1_error, axis=0, keepdims=True) # Change in bias at layer 1
    return dW1, dW2, dW3, db1, db2, db3 
# Creating data set   
# A 
a = [0, 0, 1, 1, 0, 0, 
   0, 1, 0, 0, 1, 0, 
   1, 1, 1, 1, 1, 1, 
   1, 0, 0, 0, 0, 1, 
   1, 0, 0, 0, 0, 1] 
   
# B 
b =[0, 1, 1, 1, 1, 0, 
   0, 1, 0, 0, 1, 0, 
   0, 1, 1, 1, 1, 0, 
   0, 1, 0, 0, 1, 0, 
   0, 1, 1, 1, 1, 0] 
# C 
c =[0, 1, 1, 1, 1, 0, 
   0, 1, 0, 0, 0, 0, 
   0, 1, 0, 0, 0, 0, 
   0, 1, 0, 0, 0, 0, 
   0, 1, 1, 1, 1, 0] 
  
# Creating labels 
y =[[1, 0, 0], 
   [0, 1, 0], 
   [0, 0, 1]] 
# converting data and labels into numpy array 
x = np.array([a, b, c]) 
# Labels are also converted into NumPy array 
y = np.array(y) 
np.random.seed(42) # seed function to generate the same random value
n_x = 30 # number of nodes in the input layer
n_h1 = 5 # number of nodes in the first hidden layer
n_h2 = 4 # number of nodes in the second hidden layer
n_y = 3 # number of nodes in the output layer
w1 = np.random.randn(n_x, n_h1) # weights of the first hidden layer
w2 = np.random.randn(n_h1, n_h2) # weights of the second hidden layer
w3 = np.random.randn(n_h2, n_y) # weights of the output layer
b1 = np.zeros((1, n_h1)) # bias of the first hidden layer
b2 = np.zeros((1, n_h2)) # bias of the second hidden layer
b3 = np.zeros((1, n_y)) # bias of the output layer
out_h1, out_h2, out_y = forward_propagation(x, w1, w2, w3, b1, b2, b3)
dW1, dW2, dW3, db1, db2, db3 = backpropagation(y, out_y, out_h2, out_h1, w3, w2, x)
print("dW1:\n",dW1, "\ndw2:\n", dW2, "\ndW3:\n", dW3, "\ndb1:\n", db1, "\ndb2:\n", db2, "\ndb3:\n", db3)

Explanation

📝 Note: The dimensions of the parameters after a particular operation are specified in the parenthesis for a better understanding of the batch update operation.

backpropagation function:

The function takes parameters, the target output y, the predicted output out_y, the hidden layer outputs, out_h1, and out_h2 along with their weights at the hidden layers w2 and w3 and the input x.

Backpropagates error from the output layer to second hidden layer:

It computes error at the output layer. It is simply the difference between the predicted output out_y and target output y.
```
l3_error(3, 3) = out_y(3, 3) - y(3, 3)
```
It calculates gradients of the

...

✨Before We Begin

✨Introduction to Deep Learning

✨Simple Perceptron Models in NumPY

✨Towards Deep Neural Networks in NumPY

🖥️ Project: Build a Letter Classification Model

Deep Learning in NumPy Exam

✨Building Deep Learning Models with Keras

✨Fine-tune Keras Model

Anomaly Detection in Medical Images: Python, TF, and PyTorch

🖥️ Project: Build a Digit Recognition Model

✨Conclusion

Deep Learning in Keras Exam

Purchase Price Prediction for Used Cars Using Deep Learning

Solution Review: Backpropagation - 3 Layered Neural Network

Solution

Explanation