Convert and Encode Data Types
Understand how to convert and encode data types appropriately.
Convert data types
There are four main ways we can convert the data type of DataFrame columns:
We can use the
astype()
function to enforce apandas
dtype.We can use the
to_
functions liketo_datetime()
,to_timedelta()
, andto_numeric()
for manual dtype conversion.We can use the
apply()
function to apply the custom Python function to columns.We can use the
convert_dtypes()
function for automatic dtype inference and conversion.
Lets illustrate these different ways by using a dataset of students’ demographics and academic performance.
# View students datasetprint(df)print('='*55)# Inspect data typesprint(df.dtypes)
There are a series of changes we need to make to this dataset:
Convert
student_id
from float to the integer type.Convert
gender
from object (aka string) to category type.Convert
enroll_year
from object to integer type.Convert
birthdate
from object ...
Get hands-on with 1400+ tech skills courses.