Generative AI with Python and TensorFlow 2/

...

Running GAIL on PyBullet Gym

Learn how to implement GAIL and build the actor-critic network.

We'll cover the following...

Running GAIL
The agent: Actor-critic network
The discriminator

Press + to interact

We first clear the environment with reset(). Then, for up to 1,000 timesteps, we sample the action space (for example, the $x$ , $y$ , and $z$ coordinates representing movements of the walking figure within the virtual environment). Then, we use that action to get an updated reward and observation and render the result until the movement completes.

This demonstration comes from a completely untrained hopper. For our GAIL implementation, we’ll need a hopper that has been successfully trained to walk as a sample of “expert" trajectories for the algorithm. For this purpose, we’ll download a set of hopper data from the OpenAI site.

These contain a set of NumPy files, such as deterministic.trpo.Hopper.0.00.npz, that contain samples of data from reinforcement learning agents trained using the Trust Region Policy Optimization algorithm used in step 4 of the GAIL algorithmGAILalgorithm we discussed earlierSchulman, John, Sergey Levine, Philipp Moritz, Michael I Jordan, and Pieter Abbeel. 2015. “Trust Region Policy Optimization.” ArXiv.org. 2015. https://arxiv.org/abs/1502.05477..

If we load this data, we can also visualize it using the Pybullet simulator, but this time we will see steps from the expert rather than the random baseline agent:

Press + to interact

import numpy as np
mujoco_hopper_np = np.load('deterministic.trpo.Hopper.0.00.npz')
for i_episode in range(20):
    observation = env.reset()
    num_episodes = mujoco_hopper_np['acs'].shape[0]
    episode = np.random.choice(num_episodes)
    for t in range(1000):
        env.render("human")
        action = mujoco_hopper_np['acs'][episode][t]
        observation, reward, done, info = env.step(action)
        if done:
            print("Episode finished after {} timesteps".format(t+1))
            break
env.close()

Introduction to the Course

An Introduction to Generative AI

Building Blocks of Deep Neural Networks

Teaching Networks to Generate Digits

Painting Pictures with Neural Networks Using VAEs

Recognize Handwritten Digits Using a Deep Neural Network

Image Generation with GANs

Dataset Augmentation with GANs

Style Transfer with GANs

Assessment: Introduction to Generative AI to Style Transfer

Deepfakes with GANs

The Rise of Methods for Text Generation

Exploring OpenAI API

NLP 2.0: Using Transformers to Generate Text

Composing Music with Generative Models

Generating New Music with Artificial Intelligence

Play Video Games with Generative AI: GAIL

Emerging Applications in Generative AI

Assessment: Deepfakes using GANs to Emerging Applications

Conclusion

Appendix

Running GAIL on PyBullet Gym

Running GAIL