The Error Slope between the Input and Hidden Layers

Learn how to find the error slope for the weights between the input and hidden layers.

The error rate between the input and hidden layers

That expression we just looked at is for refining the weights between the hidden and output layers. Now we need to finish the job and find a similar error slope for the weights between the input and hidden layers.

We could do loads of algebra again, but we don’t have to. We can simply use that physical interpretation we just did and rebuild an expression for the new set of weights we’re interested in:

  • The first part, which was the (target – actual)(\text{target – actual}) error, now becomes the recombined backpropagated error out of the hidden nodes, just like we saw before. Let’s call that eje_j.

  • The sigmoid parts can stay the same, but the sum expressions inside refer to the preceding layers. So the sum is over all the inputs moderated by the weights into a hidden node jj. We can call this iji_j.

  • The last part is now the output of the first layer of nodes oio_i, which happen to be the input signals.

This nifty way of avoiding work is simply taking advantage of the symmetry in the problem to construct a new expression. We say it’s simple, but it is a very powerful technique, wielded by some of the smartest mathematicians and scientists.

So, the second part of the final answer we’ve been striving towards is the slope of the error function for the weights between the input and hidden layers:

Ewij=(ej)sigmoid(iwijoi)(1sigmoid(iwijoi))oi\frac{\partial E}{\partial w_{ij}} = -(e_j)\cdot \text{sigmoid}\left(\sum_i w_{ij}\cdot o_i \right) \left(1-\text{sigmoid}\left(\sum_i w_{ij} \cdot o_i\right)\right)\cdot o_i

Get hands-on with 1400+ tech skills courses.