Introduce Similarity
Understand the concept of similarity and its relation to distance which is fundamental for entity resolution.
We'll cover the following...
We'll cover the following...
Entity resolution is about identifying records that belong to the same real-world entity. We compare candidate pairs of records and decide if it is a match or no-match for each. In other words, we have to solve a binary classification problem.
Features for binary classification
Let’s introduce feature engineering in the context of entity resolution. Let
We feed the model with vectors of numeric values