How to Access and Use Ministry of Justice Datasets for Academic Research
18 March 2024
This showcase invited Ministry of Justice analysts to provide an overview of the datasets available to researchers as part of its Data First programme. A summary of the event can be found below.

The showcase was in collaboration between the Ministry of Justice (MoJ) and the UCL Centre for Education and Criminal Justice (ECJ) and supported by UCL Public Policy and UCL Grand Challenge of Justice and Equality.
Data First is a ground-breaking data linkage programme led by the MoJ and funded by ADR UK (Administrative Data Research UK) to link and enable access to administrative data from across the justice system and beyond for research. MoJ analysts provided an overview of the datasets the department makes available to researchers. The programme includes linked data from the magistrates’ courts, Crown Court, prison and probation services, civil courts, and family courts, as well as linked Family Courts-Cafcass and Census 2021 datasets and Ministry of Justice-Department for Education datasets, which joins the Police National Computer (PNC) to the National Pupil Database (NPD). The Offender Assessment System (OASys) dataset will be available soon.
Using a probabilistic matching algorithm platform called Splink, it enables analysis of the users’ journeys, interactions, and pathways across the system and with a range of other public services, including education and employment. There are over 110 million records linked across the justice system.
- The Ministry of Justice Cross-Justice linked dataset joins data from criminal courts, prison and probation services, and civil and family courts. This linkage enables research on which individuals reappear in various parts of the justice system, questions about the overlaps and intersections between users of services in different justice jurisdictions, and their journeys and pathways.
- The Data First Family Court–Cafcass and Census linked dataset contains information on adoption, divorce, Family Law Act, public law, and private law cases and their legal outcomes in England and Wales. Available via the SAIL Databank, the de-identified linked dataset permits unprecedented research of the family and household characteristics of family court users.
- The Ministry of Justice and Department for Education linked dataset allows analysis of the links between childhood characteristics, educational outcomes, and (re)offending. The dataset includes data from prisons, courts, the Police National Computer, the National Pupil Database, looked-after children, and children in need.
The MoJ Areas of Research Interest (2020) outlined the high-level areas of research interest for MoJ, intending to support ongoing and new engagement with the external research community. Data First can help us address some of the research priorities outlined in the ARI, but it connects with academics to fill in the evidence gaps.
Researchers who wish to receive access to Data First datasets, either through the ONS SRS or SAIL Databank, will need to be accredited researchers and complete several application forms linked in the attached presentation. The Data First team can support researchers in the application process or discuss the feasibility of projects.
The audience received a practical demonstration of the data. Also, the audience heard from two academic researchers who used the linked data within their research:
- Dr Georgina Mathlin (Data First Evaluation Fellowship) shared her project: “Using linked administrative data to evaluate policy interventions in the justice system” and a related blog post.
- Dr Tim McSweeney (ADR UK Data First Fellowship) discussed his project: “A national data linkage study to assess the extent, nature, and outcomes of ‘serious and organised crime’ prosecuted before the criminal courts in England and Wales.” More information about his work can be found here.
NEW for 2025
We welcomed back the Ministry of Justice's Data First team to provide a fresh overview of its available datasets and how to access them. With recent data refreshes expanding their linked datasets to include the Civil and Family courts, linking these to the criminal justice system datasets for the first time, the scope of the questions that can be addressed using Data First continues to widen. This included the Ministry of Justice Offender Assessment System (OASys) dataset, which provides data on offender assessments recorded for service users in custody and in the community system in England and Wales from 01/01/2011.
MoJ analysts have produced a new statistical report, "Profile of Repeat Offending by Children and Young People in England and Wales", using their linked datasets to explore repeat offending of young children and young people in England and Wales, an area of substantial policy interest. By linking criminal courts and offender assessment data, this analysis provided a completely fresh look at their offending patterns and how the characteristics and needs of young people who frequently offend differ from those who do not.