Hands-On Generative Adversarial Networks with PyTorch/

...

Text-to-Image Synthesis with GANs

Explore the concepts of word embedding and text-to-image synthesis.

We'll cover the following...

Quick introduction to word embedding
Translating text to image with zero-shot transfer learning
- Zero-shot learning

Press + to interact

We know that almost every GAN model generates synthesized data by establishing a definite mapping from a certain form of input data to the output data. Therefore, in order to generate an image from a corresponding description sentence, we need to understand how to represent sentences with vectors.

Quick introduction to word embedding

It is rather easy to define an approach for transforming the words in a sentence into vectors. We can simply assign different values to all the possible words (for example, let 001 represent I, 002 represent eat, and 003 represent apple) so that the sentence can be uniquely represented by a vector (for example, "I eat apple" would become [001, 002, 003]). This is basically how words are represented in computers. However, languages are much more complicated and flexible than cold digits. Without knowing the meaning of words (for example, a noun or a verb, positive or negative), it is nearly impossible to establish ...

Getting Started

Generative Adversarial Networks Fundamentals

Best Practices for Model Design and Training

Building Our First GAN with PyTorch

Generating Images Based on Label Information

Image-to-Image Translation and Its Applications

Image Restoration with GANs

Training GANs to Break Different Models

Image Generation from Description Text

Sequence Synthesis with GANs

Reconstructing 3D Models with GANs

Concluding Remarks

Appendix

Text-to-Image Synthesis with GANs

Quick introduction to word embedding