Definition: Data clustering

Clustering is a classic data mining technique based on machine learning that divides ​groups of abstract objects into classes of similar objects.

Clustering helps to split data into several subsets. Each of these clusters consists of data objects with high inter-similarity and low intra-similarity.

Clustering methods can be classified into the following categories:

  • Partitioning method
  • Hierarchical method
  • Density-based method
  • Grid-based method
  • Model-based method
  • Constraint-based method
Clustering
Clustering

Applications

Clustering helps to identify customers who have a certain customer record and similar conduct (i.e., buying patterns). In many applications (i.e., market research, pattern recognition, data and image processing), the clustering analysis is used in large numbers. Clustering can also help advertisers to find different groups within their customer base. For example, customer groups can be defined by buying patterns.

Free Resources

Copyright ©2024 Educative, Inc. All rights reserved