Grouping
Learn how DataFrames can be grouped based on particular columns.
We'll cover the following...
Chapter Goals:
- Learn how to group DataFrames by columns
- Write code to retrieve home run statistics through DataFrame grouping
A. Grouping by column
When dealing with large amounts of data, it is usually a good idea to group the data by common categories. For example, we could group a large dataset of MLB player statistics by year, so we can deal with each year's data separately.
With pandas DataFrames, we can perform dataset grouping with the groupby
function. A common usage of the function is to group a DataFrame by values from a particular column, e.g. a column representing years.
The code below shows how to use the groupby
function, ...