The Marvellous Mechanical Matching Engine --- Help Page

This page describes how to run the Mmme software. For descriptions of the algorithms it uses consult papers on the subject available at http://www.richardclegg.org/pubs/.

Data format expected

The data is expected to be in plain text files with each line of the file being an observation. Only the first "string" on a line is taken to be an observation with the rest considered as comments on the data. Lines beginning with a # are ignored as comments. An example file is shown below:

# Data gathered on HIGH STREET 12/4/03 12:43 -- 14:43
ABC123X 12:43  TAXI
DGA543Z 12:44  HGV
OAS12Y  12:44
.
.
.
Note that only the first column (up to a space character) will be matched.

File Menu

This is for loading and saving of configuration data. The data saved is the error rates and probabilities for matching. The commands, Open, Save and Save As perform the expected actions on configuration files which are plain text files usually ending in .mme containing data about the error rates and matching probabilities.

Data Menu

This allows users to add new files of observations. Add allows the user to choose a new file to make a match with --- the file must have the format discussed above. Remove, removes a file from a potential match and Purge removes all such files. Individual files can be checked and unchecked in this menu to get a selective match between just a few files. If two or more files have been read in the the Match selection does the work of performing a match. Raw matches are output followed by matches corrected for false matches (this should be lower than the raw matches) and matches corrected for errors in recording (this should be higher than the previous corrected figure).

Options Menu

This menu allows you to set the error rates and probabilities of false matches. The probabilities menu allows you to set p(2), p(3) up to p(8). These probabilities can be set individually or to default values. The York default values are callibrated for data sets observed in the city of York (UK) in 2000 and 2001. The exponential fall off button allows you to set the probabilities to fall-off by a factor of p(2) from the last edited probability. The default setting is to have p(2)= 0.0001, p(3) the square of this, p(4) the cube and so on.

The Error rate menu allows you to set the individual rate of erroneous recording at each site as a fraction.

For more information about this project go to http://www.richardclegg.org/matching/