What is exploding gradient problem?

Neural networks are used in several applications like computer vision, natural language processing, speech processing, and so on. The performance of neural networks has improved considerably over the past decade. Only a well-trained neural network can perform well on the test dataset. Many beginners face problems in the training process of the neural network.

In gradient-based learning algorithms, we use gradients to learn the weights of a neural network. It works like a chain reaction as the gradients closer to the output layers are multiplied with the gradients of the layers closer to the input layers. These gradients are used to update the weights.

If the gradients are large, the multiplication of these gradients will become huge over time. This results in the model being unable to learn and its behavior becomes unstable. This problem is called the exploding gradient problem.

Example

Let's consider an overly simplified example. Suppose we have a 20-layer neural network, and each layer has only one neuron in each layer.

This is a huge value. Imagine how this would scale for a 50-layered network.

How to detect an exploding gradient

There are some key rules that can help identify whether or not the gradient is exploding. These are as follows:

The model is not performing well on the training data.
There are large changes in the learning loss (unstable learning).
The loss becomes NaN.

Solution

Some of the suggested solutions to tackle the exploding gradient problem are given below:

Use batch normalizationBatch normalization allows us to normalize the output of a layer in a specific range. This is usually between -1 and 1.
Use less number of layers
Carefully initialize weights
Use gradient clipping

Free Resources

Learn in-demand tech skills in half the time

PRODUCTS

Mock Interview

New

Courses

Skill Paths

Projects

Assessments

TRENDING TOPICS

Learn to Code

Tech Interview Prep

Generative AI

Data Science

Machine Learning

GitHub Students Scholarship

Early Access Courses

Blind 75

Layoffs

Pricing

For Individuals

Try for Free

Gift a Subscription

CONTRIBUTE

Become an Author

Become an Affiliate

Earn Referral Credits

RESOURCES

Blog

Cheatsheets

Webinars

Answers

ABOUT US

Our Team

Careers

Hiring

Frequently Asked Questions

Press

LEGAL

Cookie Policy

Business Terms of Service

Data Processing Agreement

INTERVIEW PREP COURSES

Grokking the Modern System Design Interview

Grokking the Product Architecture Design Interview

Grokking the Coding Interview Patterns

Machine Learning System Design