Manual Review and Labeling

Learn how to enhance the review experience with the help of clustering.

Clustering is a critical step in every entity resolution pipeline. Most importantly, it resolves conflicts from pairwise matching and enables us to build a cross-reference table.

We can stop after clustering if we are satisfied with the resolution quality, or we can start another training cycle with the help of some manual review—the topic of this lesson.

Humans in the loop

The following figure shows one of many possible entity resolution workflows, with two (optional) spots for humans in the loop.

