Data Engineering

Secure by Design: how automation is strengthening data assurance across government

Digital illustration of hyper-fast data processing in a server room, showing a glowing blue and gold double helix of data funneling into a central processing cube, surrounded by holographic screens displaying real-time analytics.

Discover how GDS is using automation and secure design to improve how the government collects and analyses security assurance data.

Making the Algorithmic Transparency Recording Standard (ATRS) mandatory across government

Bounding boxes are commonly used in AI research to signify where a computer vision algorithm has detected an object in an image. Here the artist has played with this aesthetic: The bounding boxes are 3D-printed frames positioned in the physical environment around objects. Sometimes the objects stick out of their frame.

The use of the Algorithmic Transparency Recording Standard (ATRS) became mandatory for central government in 2024. Read about how the GDS Data and AI ethics team have rolled out the mandate across government and how they have updated the ATRS to reflect learnings from this process.

Using Data Science for Next-Gen Statistics

Rap sticker on a laptop

As the 21st century progresses, using data effectively has become a priority for many organisations, including the Office for National Statistics (ONS). The ONS's unique focus, however, goes beyond just utilising data effectively. The organisations ultimate goal is to create …

Splink: Fast, accurate and scalable record linkage

Posted by: , Posted on: - Categories: Data Engineering, Data science, Python
Some of the graphical outputs of Splink

  A common data quality problem is to have multiple different records that refer to the same entity but no unique identifier that ties these entities together.  For example, customer data may have been entered multiple times by accident, or …