Splink: Fast, accurate and scalable record linkage

A common data quality problem is to have multiple different records that refer to the same entity but no unique identifier that ties these entities together. For example, customer...
A common data quality problem is to have multiple different records that refer to the same entity but no unique identifier that ties these entities together. For example, customer...
...free tools that I use to convert postcode data into useful maps. My example shows sample data of football stadiums in Great Britain. Mapping Sheets This Google Sheets add-on allows...
...metrics’ or collecting data just because you can, focus on what you’re actually going to do with it. A simple performance framework could consist of: what’s the objective? what’s happening...
The Companies House Service is a new, unified web service that brings together the company search and filing functionality that was previously split across disparate, legacy web services. This free...
...supplier would essentially mean starting from scratch, rather than having a code base that they own and can have someone else improve. The assessment panel recommends that the entire team...
...keep the raw data on a spreadsheet with a higher set of permissions and just pull in what you need onto the dashboard Sheet. Adding the code The process consists...
...simple request, were it not for the sheer number of pages that existed at that that time. You can see what it looked like on the National Archives site: ...
...to new data. The choices made by the analysts can be made open, because they are embodied in the code that can be made open. This includes details that might...
...if they want to learn. This blog gives free resources that the GDS Data Science team like (there are also many other good resources online). Introducing Data Science For a...
...a sustainable service that’s free for assisted digital users. This requirement is only unnecessary if there is demonstrable proof from user research that there are no users with assisted digital...