Yearly Median Review
Learn to aggregate data in pandas and PySpark.
We'll cover the following
Calculate yearly median review in Pandas
To calculate any kind of aggregation, both pandas and PySpark API provide the groupby
and agg
methods which return a DataFrame. First, we have to group the data by year and month. Then we have to calculate the final median score in two steps, as shown in the following example:
Get hands-on with 1200+ tech skills courses.