Advanced
63 Lessons
8h
Certificate of Completion
Takeaway Skills
The ability to deduplicate records using Python
Familiarity with an entity resolution framework and business cases
An understanding of semantic similarity and search
Experience with classification in the context of entity resolution
Hands-on experience in data-centric AI using weak supervision and confident learning
Course Overview
A typical business stores data across multiple systems, including ERPs for operations, a CRM for marketing, files, notebooks, and BI apps for other purposes. Records of the same customer (entity) exist in multiple places, likely not in sync across nor unique within sources. This inconsistent situation generates an opportunity for us to drive business value by cross-referencing and deduplicating records with entity resolution. This course covers business acumen and hands-on coding. It starts with several bu...
Course Content
Introduction to Entity Resolution and Applications
A Quickstart Guide Using the RecordLinkage Package
Preprocessing
Indexing
Feature Engineering
Pairwise Matching
12 Lessons
Clustering
6 Lessons
Integration
8 Lessons
Entity Resolution Fundamentals
Assessment
Matching Products Across Two Online Shops
Project
Conclusion
1 Lesson
Appendix
3 Lessons
How You'll Learn
You don’t get better at swimming by watching others. Coding is no different. Practice as you learn with live code environments inside your browser.
Videos are holding you back. Educative‘s interactive, text-based lessons accelerate learning — no setup, downloads, or alt-tabbing required.
Learn faster and smarter with adaptive AI tools embedded in every Educative course.
Built-in assessments let you test your skills. Completion certificates let you show them off.
Recommended Courses
BEFORE STARTING THIS COURSE
AFTER FINISHING THIS COURSE