UCL Institute of Health Informatics


One Data Science: A-Level Student Access Scheme

One Data Science: Inspiring and empowering young people to learn about coding and health data science and opening up opportunities to pursue higher education in this area.

Photo of a diverse group of students in a classroom studying

The UCL Institute of Health Informatics specialises in the use of electronic health records for research to develop a better understanding of health and disease. We are inviting twenty A-Level students (or equivalent) with an interest in health data science to participate in an exciting opportunity to learn about health data science and coding. Led by experienced health data scientists and Ph.D. students at UCL, you will attend three 2-hour sessions where you will work with other students to carry out a small data science project and gain unique, hands-on experience in coding, data analysis, and data visualisation. The sessions will be run entirely online over Zoom or MS Teams.


Complete the application form

Application deadline: Wednesday 8 June 2022

Programme Details

Week 1 – Introduction to data science and programming 

Start by introducing data science and some use cases. Introduce programming languages and resources. Students will be introduced to the R Studio Cloud environment. Students will work in breakout rooms to download a dataset and import it into R. Students will be asked to explore the data using R by generating summary tables. Students will learn how to install R packages. 

Week 2 – Getting started with a data science project 

By the second week, students will already have some experience with R. This week’s event involves an introduction to the data analysis workflow. Here we will introduce how students can get started with a data science project in R where students will be provided sample datasets to work with. 
Topics to be covered are:
1) Data wrangling to transform the original dataset into other formats
2) Creating new variables from existing variables
3) Generating descriptive statistics
4) Data visualisation using ggplot2

Week 3 – Introduction to basic statistics 

Students will be introduced to common statistical models used by businesses and in the healthcare sector. Students will run statistical models such as logistic regression or linear regression. Students will learn to interpret model results. Students will submit a visualisation generated using ggplot for the Data Visualisation Competition. Three prizes to be won.