Splink: Fast, accurate and scalable record linkage
Posted by:
Robin Linacre - Data scientist, Ministry of Justice, Posted on:
-
Categories:
Data Engineering, Data science, Python
A common data quality problem is to have multiple different records that refer to the same entity but no unique identifier that ties these entities together. For example, customer data may have been entered multiple times by accident, or …