We know that GANs, if trained properly, are capable of learning the latent distribution of data and using that information to create new samples. This extraordinary ability of GANs makes them perfect for applications such as image inpainting, which is filling the missing part in images with plausible pixels.

In this section, we will learn how to train a GAN model to perform image inpainting based on generative image inpainting paperYu, Jiahui, Zhe Lin, Jimei Yang, Xiaohui Shen, Xin Lu, and Thomas S. Huang. "Generative image inpainting with contextual attention." In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 5505-5514. 2018., although an updated version of the paperhttps://github.com/JiahuiYu/generative_inpainting has also been published. Before we starting working on addressing image inpainting with GANs, there are a few fundamental concepts to understand as they are crucial to comprehend the method.

Efficient convolution from `im2col` to `nn.Unfold`

If you have previously been curious enough to try implementing convolutional neural networks on your own (either with Python or C/C++), you must know the most painful part of work is the backpropagation of gradients, and the most time-consuming part is the convolutions (assuming that it is a plain CNN implementation such as LeNet).

There are several ways to perform the convolution in our code (apart from directly using deep learning tools such as PyTorch):

Calculate the convolution directly as per definition, which is usually the slowest way.
Use Fast Fourier Transform (FFT)An algorithm that computes the discrete Fourier transform of a sequence, or its inverse., which is not ideal for CNNs since the sizes of kernels are often way too small compared to the images.
Treat the convolution as matrix multiplication (in other words, General Matrix Multiply or GeMM) using im2col. This is the most common method used by numerous software and tools and is a lot faster.
Use the Winograd methodIn Winograd convolution, the input and kernel are sampled at a given set of points using transform matrices., which is faster than GeMM under certain circumstances.

In this section, we will only talk about the first three methods. To learn more about the Winograd method, feel free to check out this projecthttps://github.com/andravin/wincnn and this paperLavin, Andrew, and Scott Gray. "Fast algorithms for convolutional neural networks." In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4013-4021. 2016..

Now that we have understood different types of convolution operations, take the opportunity to test your knowledge by interacting with our AI widget below. You can begin the conversation with a simple “Hello.”

Getting Started

Generative Adversarial Networks Fundamentals

Best Practices for Model Design and Training

Building Our First GAN with PyTorch

Generating Images Based on Label Information

Image-to-Image Translation and Its Applications

Image Restoration with GANs

Training GANs to Break Different Models

Image Generation from Description Text

Sequence Synthesis with GANs

Reconstructing 3D Models with GANs

Concluding Remarks

Appendix

Generative Image Inpainting

Efficient convolution from `im2col` to `nn.Unfold`

Python code for 2D convolution

Generative Image Inpainting

Efficient convolution from im2col to nn.Unfold

Python code for 2D convolution

Efficient convolution from `im2col` to `nn.Unfold`