...

Generative Models

Learn about the generative models and their types.

We'll cover the following...

Overview
Types of generative models
Applications of generative models
Challenges and limitations

Press + to interact

In the following section, we’ll discuss the various types of generative models and their respective applications.

Types of generative models

There are various generative models, and the choice to use a specific type depends on the nature of an application. Numerous generative models are in use today, and the number continues to grow as researchers keep improving the existing models. Let’s discuss some of the most widely used types of generative models.

Probabilistic models

These models are a basic form of generative models. As the name suggests, probabilistic models capture the probability distribution of the provided data and use it to generate new samples. These models can be based on simple distributions, such as Gaussian distributions, or complex models, like mixture models. Hidden Markov models (HMM) and Gaussian mixture models (GMM) are types of probabilistic models. These models are used in time series modeling, speech recognition, and natural language processing.

Press + to interact

Variational autoencoders (VAEs)

Variational autoencoders (VAEs) are a type of generative model that combines the elements of probabilistic modeling with the autoencoders. This type of model takes an input image and transforms it into a lower-dimensional representation known as latent space. In a VAE, the latent space refers to a low-dimensional representation where the encoder network maps the input data (e.g., images) into a probability distribution, typically a Gaussian distribution, represented by mean (μ) and variance (σ^2) parameters. This step reduces the complexity of the image while retaining its essential features. VAEs learn to encode input images into latent space and decode them back to their original space with sampling from a probability distribution. This allows them to generate new data points by sampling from the latent space. These VAEs are widely used in data compression, anomaly detection, and image generation.

Press + to interact

Generative adversarial networks (GANs)

Generative adversarial networks (GANs) comprise two neural networks: a generator and a discriminator. The objective of the generator part is to create data similar to the provided dataset, while the role of the discriminator is to differentiate between the real and generated datasets. This adversarial process leads to the generator improving its ability to create highly realistic data. GANs are widely used in image-to-image translation, video, and image generation.

Press + to interact

Applications of generative models

Generative models have a wide range of applications in numerous domains. Let’s discuss some of them:

Text generation: Transformers and RNNs generate text, from creative writing to providing chatbot responses.
Image generation: GANs can generate high-definition (HD) images, from artworks and landscapes to human faces.
Anomaly detection: The process of modeling normal data distribution allows the generative models to detect outliers or anomalies in data.
Data augmentation: To improve the performance of machine learning models, generative models can be used to create additional training data.
Improved resolution: GANs enhance video and image quality by providing higher-resolution versions.
Drug discovery: Generative models can accelerate the pharmaceutical drug discovery process.
Art generation: Generative models can help artists and creators generate novel artworks and explore exciting creative possibilities.

Press + to interact

These applications highlight the versatility and significance of generative models in various fields. The applications come with several challenges and limitations in the following section.

Challenges and limitations

While generative models provide enormous advantages, they still face challenges and limitations. Some of them are as follows:

Evaluation: Accessing the quality of generated data is still a significant task. It’s difficult to capture the quality of generated data with common metrics.
Mode collapse: In GANs, mode collapse refers to a situation where the generator produces a limited set of similar or identical samples rather than a diverse output range. This can severely degrade the quality and diversity of the generated data.
Data requirements: In some situations, generative models require large datasets for training purposes, but the provided data is not enough or available.
Training instability: Hyperparameter tuning is an important aspect of generative model training. It can create challenges if hyperparameters are not set properly.

Press + to interact

Modes	OpenAI	Meta AI	Google	Others
Text	GPT-4	LLaMA	PaLM 2, Bard	Falcon, Dolly, Alpaca
Image	CLIP, DALL-E 2	ImageBind, SAM	Imagen	Stable Diffusion, Midjourney
Audio	Whisper	Voicebox	MusicLM	-

Course Overview

Getting Started

Image Creation from Scratch

Wrapping Up

Appendix

Data Augmentation for ML Datasets

Generative Models

Overview