...

/

Introduction: Understanding Long Short-Term Memory Networks

Introduction: Understanding Long Short-Term Memory Networks

Get an overview of long short-term memory networks.

How long short-term memory networks work

In this chapter, we’ll discuss the fundamentals behind a more advanced RNN variant known as long short-term memory networks (LSTMs). Here, we’ll focus on understanding the theory behind LSTMs so we can discuss their implementation later. LSTMs are widely used in many sequential tasks (including stock market prediction, language modeling, and machine translation) and have proven to perform better than older sequential models (for example, standard RNNs), especially given the availability of large amounts of data. LSTMs are designed to avoid the problem of the vanishing gradient. ...