Prof. Peter Christen. Tutorial: Record Linkage – Introduction, Recent Advances, and Privacy Issues.Part 1. - Descripción
Summary: The aim is to make this tutorial as accessible as possible to a wide ranging audience from various backgrounds. The content will focus on concepts and techniques rather than details of algorithms. Basic understanding in databases, algorithms, and probabilities will be beneficial but not required.rnrnThe tutorial will be based on the book “Data Matching – Concepts and techniques for Record Linkage, Entity Resolution and Duplicate Detection” (Springer, 2012) written by the presenter.rnTutorial outline: Part 1: Record linkage introduction, short history of record linkage, applications, and the record linkage process (overview of the main steps); Part 2: Detailed discussion of all steps of the record linkage process (data cleaning and standardisation, indexing/blocking, field and record comparisons, classification, and evaluation), and core techniques used in these steps; Part 3: Advanced record linkage techniques with a focus on linking databases containing personal information (such as those occurring in health and national census), including collective, group and graph linking techniques, as well as advanced indexing techniques that enable large-scale record linkage, and if time-permitting linking temporal and dynamic data, as well as real-time record linkage.; Part 4: Major concepts, protocols and challenges used in privacy-preserving record linkage with the aim to link databases across organisations without revealing any private or confidential information.
Prof. Peter Christen. Tutorial: Record Linkage – Introduction, Recent Advances, and Privacy Issues.Part 1. - Biografía
Biography: Peter Christen is a professor at the Research School of Computer Science at the Australian National University. He received his Diploma in Computer Science Engineering from ETH Zurich in 1995 and his PhD in Computer Science from the University of Basel in 1999. His research interests are in data mining and record linkage, with a focus on machine learning and privacy-preserving techniques for record linkage. He has published over 140 articles in these areas, including in 2012 the book Data Matching published by Springer. For more details see: http://cs.anu.edu.au/people/Peter.Christen/ rnCharini Nanayakkara is currently working as a PhD student at the Australian National University (ANU), where the focus of her research is on record linkage techniques for complex historical birth, marriage, death, and census data. She received her BSc (Hons) degree in Computer Science from the University of Colombo School of Computing, Sri Lanka, in 2016. Prior to joining the ANU as a PhD student in March 2018, she was employed as a software engineer at WSO2 Lanka Pvt. Ltd for two years. Charini’s research is part of the Digitising Scotland project (https://www.lscs.ac.uk/projects/digitising-scotland/). Her publications can be found at: https://scholar.google.com.au/citations?user=bQwdgp0AAAAJ&hl=enrnThilina Ranbaduge is a research fellow at the Australian National University (ANU) Research School of Computer Science. His research interests are in data mining, and in multidatabase and privacy-preserving record linkage. He received his PhD in Computer Science from the ANU in 2018 and completed his PG.Dip and BSc (Hons) at the University of Moratuwa, Sri Lanka, in 2013 and 2009 respectively