Python
As the 21st century progresses, using data effectively has become a priority for many organisations, including the Office for National Statistics (ONS). The ONS's unique focus, however, goes beyond just utilising data effectively. The organisations ultimate goal is to create …
A common data quality problem is to have multiple different records that refer to the same entity but no unique identifier that ties these entities together. For example, customer data may have been entered multiple times by accident, or …
Learn how to create good practice accessible spreadsheets as part of Reproducible Analytical Pipelines using gptables. Learn about how we developed in the open.
...by mobile phone network across the railways. In conversations with people on different mobile phone networks I had never twigged that the degree of signal loss I experience is not...
Representing text as vectors We can represent the text on each GOV.UK page as a semantic vector. This is denoted by a list of numbers, which conveys information about the...
A year and a half ago, two GDS Designers asked me, “Can you show us how Highway Code content on the GOV.UK site is performing?” This would have been a simple request, were it not for the sheer number of pages …