An Introduction to Entity Resolution in Python/

...

Match vs. No-Match

Combine individual scores into a match vs. no-match prediction policy using plausible rules.

We'll cover the following...

Matching names and addresses
From matches to cross-references
Key takeaway

Press + to interact

Python 3.8

rule_1 = scores['customer_name_c_score'].ge(0.8) & scores['street_c_score'].ge(0.8)
rule_2 = scores['customer_name_c_score'].ge(0.9) & scores['street_c_score'].ge(0.5) & scores['city_c_score'].ge(0.8)
rule_3 = scores['customer_name_p_score'].ge(0.9) & scores['street_p_score'].ge(0.9) & scores['city_p_score'].ge(0.9)
rule_4 = scores['phone_c_score'].eq(1.)
# Match if any individual rule is true, else no match:
predicted_matches = scores.loc[rule_1 | rule_2 | rule_3 | rule_4].index
print(predicted_matches[:3])  # Print 1st three matches as an example

Introduction to Entity Resolution and Applications

A Quickstart Guide Using the RecordLinkage Package

Preprocessing

Indexing

Feature Engineering

Pairwise Matching

Clustering

Integration

Entity Resolution Fundamentals

Matching Products Across Two Online Shops

Conclusion

Appendix

Auto-Tagging System for Content Categorization

Match vs. No-Match