Discovering New Drugs

Explore research on drug discovery and protein structure prediction using generative modeling.

We'll cover the following...

Searching chemical space with generative molecular graph networks
Folding proteins with generative models
- What is protein folding?
Predicting physical properties

Generative AI is making a large impact on biotechnology research. We’ll focus on two major areas of interest: drug discovery and protein structure prediction.

Drug discoveryKirkpatrick, Peter, and Clare Ellis. 2004. “Chemical Space.” Nature 432 (7019): 823–23. https://doi.org/10.1038/432823a. involves the exploration and development of new pharmaceutical compounds to combat various diseases and medical conditions. On the other hand, protein structure prediction focuses on the computational modeling of three-dimensional protein structures based on amino acid sequences.

Searching chemical space with generative molecular graph networks

At its base, a medicine—be it drugstore aspirin or an antibiotic prescribed by a doctor—is a chemical graph consisting of nodes (atoms) and edges (bonds) (shown in the figure “Chemical graph”). Like the generative models used for textual data, graphs have the special property of not being fixed in length. There are many ways to encode a graph, including a binary representation based on numeric codes for the individual fragments (shown in the figure “Chemical graph”) and “SMILES” strigs that are linearized representations of 3D molecules.

The number of potential features in a chemical graph is quite large; in fact, the number of potential chemical structures that are in the same size and property range as known drugs has been estimated¹ at 10⁶ —even larger than the number of research papers on generative models; for reference, the number of atomsVillanueva, John Carl. 2009. “How Many Atoms Are There in the Universe?” Universe Today. July 31, 2009. https://www.universetoday.com/36302/atoms-in-. in the observable universe is between $10^{78}$ to $10^{82}$ .

One can appreciate, then, that a large challenge of drug discovery—finding new drugs for existing and emerging diseases—is the sheer size of the potential space one might need to search. Experimental approaches for drug screening—testing thousands, millions, or even billions of compounds in high-throughput experiments to find a chemical needle in a haystack with potential therapeutic properties—have been used for decades. However, the development of computational methods such as machine learning has opened the door for “virtual screening” on a far larger scale.

1.Introduction to the Course

2.An Introduction to Generative AI

3.Building Blocks of Deep Neural Networks

4.Teaching Networks to Generate Digits

5.Painting Pictures with Neural Networks Using VAEs

Project

6.Image Generation with GANs

Project

7.Style Transfer with GANs

Assessment

8.Deepfakes with GANs

9.The Rise of Methods for Text Generation

10.NLP 2.0: Using Transformers to Generate Text

11.Composing Music with Generative Models

Project

12.Play Video Games with Generative AI: GAIL

13.Emerging Applications in Generative AI

Assessment

14.Conclusion

15.Appendix

Discovering New Drugs

Searching chemical space with generative molecular graph networks