Similarity and Dissimilarity Measures
We'll cover the following...
Similarity or dissimilarity measures are core components of clustering algorithms that cluster similar data points into the same clusters. In contrast, dissimilar or distant data points are placed into different clusters. Although the choice of a similarity/dissimilarity measure is task-dependent, it’s good to know the common ones.
Note: The measures involve two data points, say and , in .
Minkowski distance
The Minkowski distance between points and is defined as follows:
...