Exploratory Data Analysis

Get familiar with EDA, statistical properties, and visualizing underlying data relationships.

Exploratory data analysis (EDA) is a critical exercise that involves a series of essential steps, including data cleaning, visualization, descriptive statistics, and hypothesis testing. The ultimate objective is to derive meaningful insights into the underlying relationships within the data that allow us to gain a better understanding before modeling.

In this lesson, we’ll perform our EDA on the Lending Club loans dataset. This is a collection of data related to loans that were facilitated through the Lending Club platform, a peer-to-peer marketplace that connects borrowers with investors. The dataset includes information on loans, including amount, interest rate, term, purpose, borrower information (such as homeownership, employment status, and income), status, and other relevant details. Let’s take a look at some data samples here:

Get hands-on with 1200+ tech skills courses.