Data linkage in R

Open to
Government analysts
Training category
Analytical, Data science, Data linkage
Type of training
12 hours
Data Science Campus Faculty
Data Science Campus Faculty

Data linkage is the process of joining multiple datasets together and linking records. It ensures that the resources spent collecting data are most effectively used by increasing the ways each dataset can be used for various research needs.

This course covers the practical application of linking data in R. A similar course is available for those who prefer working in Python.

Learning outcomes

On this course you will learn how to perform tasks including:

  • pre-linkage and preparing data
  • exact matching
  • rule-based matching
  • score-based matching
  • Fellegi-Sunter probabilistic matching
  • post-linkage and quality evaluation

How to book

Please use your Learning Hub account to access the course online. If you do not have a Learning Hub account, please contact


If you would like more information about this course, please email 

Related courses

Introduction to R

Introduction to data linkage

Data linkage in Python