Setup

Learn to set up the environment for data transformation.

We'll cover the following...

Overview of the setup

First, we need to load our snapshot of the original data. In the case of pandas, we load the original data. However, for PySpark, we use the snapshot.

Here’s the list of imports we would need to work with pandas and PySpark: ...