Data linkage in Python
- Open to
- Government analysts
- Training category
- Analytical, Data science
- Type of training
- Online
- Length
- 6 hours
- Organiser
- Data Science Campus Faculty
- Provider
- Data Science Campus Faculty
- Location
- Online
Data linkage is the process of joining multiple datasets together and linking records. It increases the number of ways that each dataset can be used for various research needs. This means that the resources spent collecting data are most effectively used.
This course covers the practical application of linking data in Python. A similar course will be available for those who prefer working in R.
Learning outcomes
On this course you will learn how to perform tasks including:
- pre-linkage and preparing data
- exact matching
- rule-based matching
- Score-based matching
- Fellegi-Sunter probabilistic matching
- post-linkage and quality evaluation
How to book
Please use your Learning Hub account to access the course online. If you do not have a Learning Hub account, please contact Data.Science.Campus.Faculty@ons.gov.uk.
Contact
If you would like more information about this course, please email Data.Science.Campus.Faculty@ons.gov.uk.