Challenge: Data Transformation
Let's solve programming challenges related to data transformation in PySpark.
Task
Perform summary statistics on the review_text
and vote
columns.
Steps
- Impute
NaN
values. - Show statistics of
vote
. - Show statistics of
review_text
.
Get hands-on with 1300+ tech skills courses.