The MNIST Database

Learn about the MNIST database, which we'll use as our primary datasets.

In developing the DBN model, we’ll use a dataset that we have discussed before—the MNIST database, which contains digital images of hand-drawn digitsLeCun, Yann; Léon Bottou; Yoshua Bengio; Patrick Haffner (1998). Gradient-Based Learning Applied to Document Recognition. Proceedings of the IEEE. 86 (11): 2278–2324 from 00 to 99. This database is a combination of two sets of earlier images from the National Institute of Standards and Technology (NIST): Special Database 11 (digits written by US high school students) and Special Database 2LeCun, Yann; Corinna Cortes; Christopher J.C. Burges. MNIST handwritten digit database, Yann LeCun, Corinna Cortes, and Chris Burges (written by US Census Bureau employees), the sum of which is split into 60,000 training images and 10,000 test images.

The original images in the dataset were all black and white, while the modified dataset normalized them to fit into a 2020 x ...