Introduce Similarity
Understand the concept of similarity and its relation to distance which is fundamental for entity resolution.
Entity resolution is about identifying records that belong to the same real-world entity. We compare candidate pairs of records and decide if it is a match or no-match for each. In other words, we have to solve a binary classification problem.
Features for binary classification
Let’s introduce feature engineering in the context of entity resolution. Let
We feed the model with vectors of numeric values
Get hands-on with 1400+ tech skills courses.