Pandas DataFrame Operations - Read, View and Extract Information
Learning Pandas with IMDB-Movies Dataset
Time for some real fun! We have learned how to create DataFrames already. Now we are going to explore the many operations that can be performed on them.
To make this step more engaging and fun, we are going to work with the IMDB Movies Dataset. The IMDB dataset is a publicly available dataset that contains information about 14,762 movies. Each row consists of a movie and for each movie we have information like title, year of release, director, number of awards, rating, duration etc. Sounds fun to explore, right? Let’s put our data scientist’s hat on, and dive into the world of the movies! 🔎🎥
Important DataFrame Operations
We are going to go through the most important DataFrame operations for a data scientist to know, one by one:
1. Reading Data From CSVs
The dataset for these lessons is here on Kaggle. Once we have downloaded the data, we can load it using the DataFrame creation method we mentioned in the previous lesson.
📌 Note: Once you have gone through these “IMDB-lessons”, I highly recommend you download this dataset and play with it. It is really important to get your hands dirty; don’t just read through these lessons!
📌You can also find the Juptyter Notebook with all the code for these “IMDB-lessons” on my Git ...