Generative AI with Python and TensorFlow 2/

...

Improved GANs—Deep Convolutional GAN

Learn about an improved version of GANs, the deep convolutional GAN, and how it can be implemented.

We'll cover the following...

Implementation
Vector arithmetic

Vanilla GAN proved the potential of adversarial networks. The ease of setting up the models and the quality of the output sparked much interest in this field. This led to a lot of research in improving the GAN paradigm.

Published in 2016, this work by Radford et al. introduced several key contributions to improve GAN outputs apart from focusing on convolutional layers, which are discussed in the original GAN paperRadford, Alec, Luke Metz, and Soumith Chintala. 2015. “Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks.” ArXiv.org. 2015. https://arxiv.org/abs/1511.06434.. The 2016 paper emphasized using deeper architectures instead. The following figure shows the generator architecture for a deep convolutional GAN (DCGAN) (as proposed by the authors). The generator takes the noise vector as input and then passes it through a repeating setup of up-sampling layers, convolutional layers (shown as CONV 1, CONV 2, CONV 3, and CONV 4), and batch normalization layers to stabilize the training.

Press + to interact

Until the introduction of DCGANs, the output image resolution was quite limited. A Laplacian pyramid or LAPGAN was proposed to generate high-quality images, but it also suffered from certain fuzziness in the output. The DCGAN paper also made use of another important invention, the batch normalization layer. Batch normalization was presented after the original GAN paper and proved useful in stabilizing the overall training by normalizing the input for each unit to have zero mean and unit variance. To get higher-resolution images, it made use of strides greater than 1 while moving the convolutional filters.

Implementation

Let’s start by preparing the discriminator model. CNN-based binary classifiers are simple models. One modification we make here is to use strides longer than $1$ ...

Introduction to the Course

An Introduction to Generative AI

Building Blocks of Deep Neural Networks

Teaching Networks to Generate Digits

Painting Pictures with Neural Networks Using VAEs

Recognize Handwritten Digits Using a Deep Neural Network

Image Generation with GANs

Dataset Augmentation with GANs

Style Transfer with GANs

Assessment: Introduction to Generative AI to Style Transfer

Deepfakes with GANs

The Rise of Methods for Text Generation

Exploring OpenAI API

NLP 2.0: Using Transformers to Generate Text

Composing Music with Generative Models

Generating New Music with Artificial Intelligence

Play Video Games with Generative AI: GAIL

Emerging Applications in Generative AI

Assessment: Deepfakes using GANs to Emerging Applications

Conclusion

Appendix

Improved GANs—Deep Convolutional GAN

Implementation