Introduction to JAX and Deep Learning/

...

Haiku

This lesson will introduce Haiku, a high-level neural network library that provides an object-oriented interface.

We'll cover the following...

Introduction
Transforms
Linear
Convolutional
Normalization
- Layer normalization
- Group normalization
References

Press + to interact

If we run the code above, it will not execute and throw the following error:

ValueError: All hk. Module's must be initialized inside an hk.transform.

Transforms

If you recall, throughout the course, JAX transformations operate only on pure functions. In contrast, Haiku is object-oriented.

Luckily, Haiku’s simple hk.transform helps us by converting an impure Haiku function into a pair of pure functions: init() and apply().

Initialize: Takes a PRNGKey and the input matrix (usually of 1’s or 0’s as the purpose here is just to have the size) and returns a randomly initialized matrix. This is yet another application of JAX PRNG.
Apply: It takes the initialized matrix, PRNGKey, and the input matrix. Since PRNGKey has little use in neural network transforms (dropout being a notable exception), we can simply pass None. Passing the input (size) matrix is also a redundant feature, but we cannot bypass it.

Press + to interact

Having seen the basic mechanisms of Haiku, we’ll explore its vast function library a bit more.

Haiku has support for almost every neural network module:

Linear

For linear modules, we have these three functions:

Linear()
Bias()
nets.MLP()

Linear layer

The Linear() adds a linear layer with the following required parameters:

Output size, which is a sequence of integers.
Whether to use bias or not, which is set to true by default.

By default, Linear() uses the same initializer for weights as the one used in batch normalization’s original paper.

Bias

If we wanted to add a bias separately, we would use the Bias() function. All its parameters are optional and are mentioned below as a reference:

output_size
bias_dims is a parameter for bias vector dimensions.
b_init specifies the algorithm for bias initialization.
name allows us to actually specify a name for any Haiku module.

Multi-layer perceptron (MLP)

As we saw in the last lesson, we can quickly jump to the point by making an MLP using the function, haiku.nets.MLP(). Its parameters are:

output_sizes specifies the size of (number of neurons in) each layer, input as a sequence of integers.
w_init specifies the weights

...

Introduction

JAX Programming Model

Linear Algebra

Random Variables and Distributions

JAX Ecosystem

Project: GAN Using the JAX ecosystem

Appendix