An Introduction to Entity Resolution in Python

Explore entity resolution in Python, including use cases, semantic preprocessing, graph clustering, and weak supervision. Boost business value with hands-on coding and strategic decisions.
5.0
63 Lessons
8h
Join 2.8 million developers at
A typical business stores data across multiple systems, including ERPs for operations, a CRM for marketing, files, notebooks, and BI apps for other purposes. Records of the same customer (entity) exist in multiple places, likely not in sync across nor unique within sources. This inconsistent situation generates an opportunity for us to drive business value by cross-referencing and deduplicating records with entity resolution. This course covers business acumen and hands-on coding. It starts with several business cases and a quick introduction to entity resolution in Python. Then, it explores semantic-preserving preprocessing, similarity feature engineering, graph clustering, weak supervision, confident learning, and integration. As a developer, you’ll increase your company’s business value by developing and deploying entity resolution pipelines. As a decision-maker, you’ll know which solution best suits your business cases and how to negotiate the best value for your money.
A typical business stores data across multiple systems, including ERPs for operations, a CRM for marketing, files, notebooks, an...Show More

WHAT YOU'LL LEARN

The ability to deduplicate records using Python
Familiarity with an entity resolution framework and business cases
An understanding of semantic similarity and search
Experience with classification in the context of entity resolution
Hands-on experience in data-centric AI using weak supervision and confident learning
The ability to deduplicate records using Python

Show more

Learning Roadmap

Your Personalized Roadmap is ready!
Your roadmap is tailored to your weekly
schedule - adjust it anytime.
Your roadmap is tailored to your weekly schedule - adjust it anytime.
You can customize your roadmap further or retake assessment from here
Certificate of Completion
Showcase your accomplishment by sharing your certificate of completion.
Author NameAn Introduction to EntityResolution in Python

Course Author:

Developed by MAANG Engineers
Every Educative lesson is designed by a team of ex-MAANG software engineers and PhD computer science educators, and developed in consultation with developers and data scientists working at Meta, Google, and more. Our mission is to get you hands-on with the necessary skills to stay ahead in a constantly changing industry. No video, no fluff. Just interactive, project-based learning with personalized feedback that adapts to your goals and experience.

Trusted by 2.8 million developers working at companies

Fuel Your Tech Career with Smarter Learning

Built for 10x Developers
Get job-ready by lessons designed by industry professionals
Roadmaps Built Just for You
One-size-fits-all courses are a thing of the past
Keeping you state-of-the-art
Future proof yourself with our catalog
Meet PAL - Your AI Coach
Get Personalized feedback from your personalized learning agent
Built to Simulate the MAANG Experience
AI Mock Interviews & Quizzes with targeted guidance

Free Resources

FOR TEAMS

Interested in this course for your business or team?

Unlock this course (and 1,000+ more) for your entire org with DevPath