Machine learning-based record linkage

The record linkage problem is modeled as a machine learning problem. It is solved in both unsupervised and supervised manners. In cases where we only have the features of the tuples we want to de-dupe and don't have ground truth information, an unsupervised learning method such as K-means is employed.

Let us look at the unsupervised learning.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset