Introduction to the Dataset for the Course

Understand the problem statement and the loan approval dataset.

Problem statement

A company known as Dream Housing Finance offers a wide variety of home loans. They maintain a presence in all of the urban, semi-urban, and rural regions of the country. The process begins with the customer submitting an application for a home loan, and it is followed by the company’s efforts to cross-check the information provided in the application and then verify the customer’s eligibility for the loan.

The company wants to be able to automatically determine, in real-time, if a customer is eligible for the loan they’ve applied for based on the information they provide in their online loan applications.

They have provided a dataset to automate this process, which will identify the customer segments that are qualified for loan amounts. This will allow them to specifically target these customers.

The loan approval dataset

In this course, we’ll utilize the loan dataset, which is a binary classification dataset consisting of loan details and the status of different customers. The aim is to develop an ML model that predicts if a customer’s request for a loan can be approved or not.

Binary classification is a type of supervised learning in ML where the goal is to classify input data into one of two possible categories. The categories are typically represented as:

  • 0 and 1

  • True and false

  • Positive and negative

Note: These categories can also be presented in different ways.

The ML algorithm is trained on a labeled dataset, where each data point is linked to the correct category label. The objective of the ML algorithm is to learn a decision boundary that separates the two classes. Once the ML model is trained, it can be used to predict the category of new, unseen data points.

Binary classification is used in many different ways, such as to detect spam, fraud, and medical diagnoses.

Here are a few sample rows of the loan approval dataset:

Get hands-on with 1200+ tech skills courses.