Skip to main content

Robin Linacre - Data scientist, Ministry of Justice

Splink: Fast, accurate and scalable record linkage

Posted by: , Posted on: - Categories: Data Engineering, Data science, Python
Some of the graphical outputs of Splink

  A common data quality problem is to have multiple different records that refer to the same entity but no unique identifier that ties these entities together.  For example, customer data may have been entered multiple times by accident, or …

Help us start a data revolution for government

Laptop screen with a post-it note saying ‘Join the conversation!’

We want to start a data revolution focusing on service delivery and improving human experience. Find out what we think government needs to do and how you can help.