An Introductory Guide to Data Science and Machine Learning/

...

DBSCAN Clustering and Customer Segmentation

Density-based clustering is also one of the most widely used clustering algorithms which helps in detecting the outliers as in data. In this lesson, you can discover more about it.

We'll cover the following...

- DBSCAN clustering
  - Working of DBSCAN clustering
- - Hyperparameters
- - Advantages
  - Disadvantages
- Other variations
- Customer segmentation problem

DBSCAN clustering

The acronym DBSCAN stands for Density Based spatial clustering of Applications with Noise. It works on the analogy that clusters are the areas of high density separated by the areas of low density. Due to its property of considering clusters as areas of high density separated from areas of low density, it can deal with clusters of any shape unlike K-means clustering which assumes clusters are spherical, equally dense, and not contaminated by outliers.

It marks points as outliers or noise that lie alone in low-density regions (whose nearest neighbors are too far away). It also makes the assumption that there is noise in the dataset. Clusters in density-based clustering satisfy the following properties:

All points in a cluster are mutually-density connected.
If a point is density reachable from some point of the cluster, it is also the part of the cluster.

Working of DBSCAN clustering

DBSCAN works in the following way.

It starts by identifying core samples or points in the dataset. A Core sample or point is the one that has at least min_samples or MinPts points around it within a distance of eps $\epsilon$ ...

What is Data Science ?

Applications of Data Science

Overview of Libraries

Probability and Statistics

Machine Learning Part-1

Machine Learning Part-2

Machine Learning Part-3

Deep Learning

Machine Learning Tools and Libraries

Big Data Tools and Technologies

Where to go next ?

DBSCAN Clustering and Customer Segmentation

DBSCAN clustering

Working of DBSCAN clustering