Generative AI with Python and TensorFlow 2/

...

Building a Better Optimiser

Learn about the optimization procedure and how to apply it to neural networks.

We'll cover the following...

The optimization procedure is used to minimize the error function (in examples like the ones we have discussed so far), effectively “learning” the parameters of the network by selecting those that yield the lowest error. Referring to our discussion of backpropagation, this problem has two components:

How to initialize the weights: In many applications historically, we see that the authors used random weights within some range and hoped that the use of backpropagation would result in at least a locally minimal loss function from this random starting point.
How to find the local minimum loss: In basic backpropagation, we used gradient descent using a fixed learning rate and a first derivative update to traverse the potential solution space of weight matrices; however, there is good reason to believe there might be more efficient ways to find a local minimum.

In fact, both of these have turned out to be key considerations toward progress in deep learning research.

Gradient descent to ADAM

The original version proposed in 1986 for training neural networks averaged the loss over the entire dataset before taking the gradient and updating the weights. Obviously, this is quite slow and makes distributing the model difficult, as we can’t split up the input data and model replicas; if we use them, each needs to have access to the whole dataset.

In contrast, SGD computes gradient updates after $n$ samples, where $n$ could range from $1$ to $N$ ...

Introduction to the Course

An Introduction to Generative AI

Building Blocks of Deep Neural Networks

Teaching Networks to Generate Digits

Painting Pictures with Neural Networks Using VAEs

Recognize Handwritten Digits Using a Deep Neural Network

Image Generation with GANs

Dataset Augmentation with GANs

Style Transfer with GANs

Assessment: Introduction to Generative AI to Style Transfer

Deepfakes with GANs

The Rise of Methods for Text Generation

NLP 2.0: Using Transformers to Generate Text

Composing Music with Generative Models

Generating New Music with Artificial Intelligence

Play Video Games with Generative AI: GAIL

Emerging Applications in Generative AI

Assessment: Deepfakes using GANs to Emerging Applications

Conclusion

Appendix

Building a Better Optimiser

Gradient descent to ADAM