Reproducible Analytical Pipelines

...the code, and ensuring that the code does what we expect it to, but because we have written an R package, it’s also very easy for us to institute tests...
...the code, and ensuring that the code does what we expect it to, but because we have written an R package, it’s also very easy for us to institute tests...
...developers, the team cannot have confidence that the code delivered to web browsers works for service users on the devices and browsers they use, or in a way that is...
...NLP is used to interpret unstructured text data, such as free-text notes or survey feedback. It can help us look for similarities and uncover patterns in what people have written,...
...surprising: 53% of the time, manual coders disagreed about how a survey should be coded. We can see this in the chart below, which shows how often volunteer coders applied...
A common data quality problem is to have multiple different records that refer to the same entity but no unique identifier that ties these entities together. For example, customer...
...free tools that I use to convert postcode data into useful maps. My example shows sample data of football stadiums in Great Britain. Mapping Sheets This Google Sheets add-on allows...
...metrics’ or collecting data just because you can, focus on what you’re actually going to do with it. A simple performance framework could consist of: what’s the objective? what’s happening...
The Companies House Service is a new, unified web service that brings together the company search and filing functionality that was previously split across disparate, legacy web services. This free...
...supplier would essentially mean starting from scratch, rather than having a code base that they own and can have someone else improve. The assessment panel recommends that the entire team...
...keep the raw data on a spreadsheet with a higher set of permissions and just pull in what you need onto the dashboard Sheet. Adding the code The process consists...