Matching Data Sets at Multiple Sites

This is an EPSRC funded project to study the topic of matching data across multiple datasets. The project finished in April 2005. The aims of the project were:

  1. To develop and publicise a recently created framework for data analysis problems which match individuals across data files.
  2. To find novel uses for data analysis of this type.
  3. To develop a method for estimating the number of matches of individuals across multiple data sets in the presence of errors.
  4. To develop a method for estimating the reliability of estimates produced in point two above.
  5. To develop, document and publish free software to implement the methods developed in the first three points above.

The research on this project was being performed at the Deparment of Mathematics of the University of York. The main researcher is Richard G. Clegg and is part of the work of The Networks and Nonlinear Dynamics Group.


Information provided by Richard G. Clegg on 4/2/2005.