Introduction to Data Linkage

Date:

20/03/2015

Organised by:

University of Southampton/ADRC-E & Farr Institute

Presenter:

Katie Harron

Level:

Entry (no or almost no prior knowledge)

Contact:

adrce@soton.ac.uk

Map:

View in Google Maps  (NW1 2DA)

Venue:

Farr Institute, 222 Euston Road, London

Description:

Course places are limited and registration by 13 Mar 2015 is strongly recommended.

Course No. ADRCE-Training012 Harron

This course is jointly organised by ADRC-E/University of Southampton and Farr Institute.

Summary of Course:

This short course is designed to give participants a practical introduction to data linkage and is aimed at researchers either intending to use data linkage themselves or to analyse linked data. Examples of the uses of data linkage, data preparation, methods for linkage (including deterministic and probabilistic approaches) and issues for the analysis of linked data are covered. The main focus of this course will be health data, although the concepts will apply to many other areas. This course includes a practical example involving data to be linked, to enable participants to put theory into practice.

 

Course Contents:

The course covers:

  • Overview of data linkage (data linkage systems, benefits of data linkage, types of projects)
  • Overview of linkage methods (deterministic and probabilistic)
  • The linkage process (data preparation, blocking, classification)
  • Performing probabilistic linkage
  • Evaluating linkage quality (types of error, analysis of linked data)
  • Reporting analysis of linked data
  • Practical session using LinkPlus

 

Learning Outcomes:

By the end of the course participants will:

  • Understand the background and theory of data linkage methods
  • Perform deterministic and probabilistic linkage
  • Evaluate the success of data linkage
  • Appropriately report analysis based on linked data

 

Computer Software and Workshop:

This event includes computer workshops.

Excel, and LinkPlus software (freely available from http://www.cdc.gov/cancer/npcr/tools/registryplus/lp_tech_info.htm)

 

Presenter

Katie Harron is a Sir Henry Wellcome Postdoctoral fellow at the London School of Hygiene and Tropical Medicine. Katie is a statistician with research interests in using linkage of administrative data and electronic health records for health research. Her current research involves establishing how nationally representative, population-level administrative data can be used to identify and measure determinants of variation in service use and educational outcomes for children born preterm by linking information on maternal, household and social environments. Katie completed her PhD in Statistics at University College London. She is a co-editor of a Wiley commissioned book “Methodological developments in data linkage” with Prof Harvey Goldstein and Prof Chris Dibben.

 

Target Audience:

The course is aimed at researchers who need to gain an understanding of data linkage techniques. The course provides an introduction to data linkage theory and methods for those who might be using linked data in their own work. Participants may be academic researchers in the social and health sciences or may work in government, survey agencies, official statistics, for charities or the private sector.

 

Course Programme:

The course will start with registration and coffee at 9.45 with formal teaching starting at 10am and finishing at 4.15pm. Afterwards there will be an opportunity for participants to ask questions about the course and to discuss with the instructor how to link their own datasets (you can bring your own data to the course if you wish).

(this is a draft programme, subject to minor changes)

Session 1: 10.00am-11.15am

  • Overview
  • Linkage methods

 

Session 2: 11.30am-12.45pm

  • Linkage process
  • Probabilistic linkage

 

Session 3: 1.45pm-3.00pm

  • Practical session

 

Session 4: 3.15pm-4.15pm

  • Linkage evaluation
  • Reporting analyses of linked data

 

Following Session 4, there will be opportunity to discuss how to link your own datasets.

 

Pre-requisites:

The course does not assume any prior knowledge of data linkage. Some experience of using Excel will be useful for the practical session.

 

Course Materials:

Participants will receive written course notes.

Cost:

1. £30 - For UK registered postgraduate students
2. £60 – For staff at UK academic institutions, RCUK funded researchers, registered charity organisations and public sector workers.
3. £350 - For all other participants
4. Free Place for ADRC-E and ADRN/ADS staff
All fees include event materials, lunch, morning and afternoon tea. They do not include travel and accommodation costs.

Website and registration:

Region:

Greater London

Keywords:

Longitudinal Research , Analysis of administrative data, Quality in Quantitative Research, Data linkage

Related publications and presentations:

Longitudinal Research
Analysis of administrative data
Quality in Quantitative Research
Data linkage

Back to archive...