Improving Sequential Models: Beam Search
Discover how to improve sequential model text generation using beam search. Learn to predict multiple steps ahead with LSTMs by evaluating several candidate sequences simultaneously. Understand the implementation details and the benefits of beam search over greedy sampling to produce more coherent and varied text.
We'll cover the following...
As we saw earlier, the generated text can be improved. Now let’s see if beam search might help to improve the performance. The standard way to predict from a language model is by predicting one step at a time and using the prediction from the previous time step as the new input. In beam search, we predict several steps ahead before picking an input.
This enables us to pick output sequences that may not look as attractive if taken individually but are better when considered as a sequence. The way beam search works is by, at a given time, predicting