Methods Inf Med 1991; 30(02): 117-123
DOI: 10.1055/s-0038-1634828
Medical Records
Schattauer GmbH

Record Linkage Strategies

Part I: Estimating Information and Evaluating Approaches
L. L. Roos
1   Manitoba Centre for Health Policy and Evaluation and Faculty of Medicine, University of Manitoba, Canada
,
A. Wajda
1   Manitoba Centre for Health Policy and Evaluation and Faculty of Medicine, University of Manitoba, Canada
› Author Affiliations
The authors gratefully acknowledge the help of the Manitoba Health Services Commission. This research was supported by National Health Research and Development Project No. 6607-1197-44, by Career Scientist Award No. 6607-1314-48, and by the Canadian Institute for Advanced Research. Interpretations and viewpoints contained in this paper are the authors’ own and do not necessarily epresent the opinion of the Manitoba ealth Services Commission, the Government of Manitoba, or Health and Welfare Canada. The authors also wish to thank Ruth Brazauskas, Kerry Meagher and Phyllis Jivan for help with the preparation of this manuscript.
Further Information

Publication History

Publication Date:
07 February 2018 (online)

Abstract

Record linkage techniques can help identify the same patient for matching diverse files (hospital discharge abstracts, insurance claims, registries, Vital Statistics data) which contain similar identifiers. Prior knowledge of whether a linkage is feasible is important to prevent wasted effort (additional data collection or data manipulation), which decreases the cost-effectiveness of the linkage. Using examples generated by linking the Manitoba Health Services Commission data with Vital Statistics files, a method of estimating the information in each data set is presented first. Further, the feasibility of several different record linkage strategies is described and tested, given varying amounts of information. At the margin, relatively small amounts of information (having just one more variable to match with) can make a great difference. Probabilistic linkage’s great advantage was found in those situations where only a moderate amount of extra information was available.

By using the above techniques when working with one or both files in a proposed record linkage project, a much more informed judgement can now be made as to whether a linkage will or will not work. In facilitating record linkage, flexibility of both software and the strategy for matching is very important.

 
  • REFERENCES

  • 1 Newcombe HB. Handbook of Record Linkage. New York, NY: Oxford University Press; 1988
  • 2 Smith ME. Record linkage: present status and methodology. J Clin Comp 1984; 13: 52-69.
  • 3 Roos LL, Wajda A, Nicol IP. The art and science of record linkage: methods that work with few identifiers. Comput Bioi Med 1986; 16: 45-57.
  • 4 Shannon HS, lamieson E, Walsh C. et al. Comparison of individual follow-up and computerized record linkage using the Canadian Mortality Data Base. Can J Public Health 1989; 80: 54-7.
  • 5 Buehler lW, Berkelman RL, Curran lW. Reporting of AIDS: tracking HIV morbidity and mortality. JAMA 1989; 262: 2896-7.
  • 6 Young TK, Roos NP, Hammarstrand KM. Estimating the burden of diabetes in a total population using health insurance claims data: a pilot study. Can Med Assoc J. (to be published).
  • 7 Cohen MM, Duncan PG. Physical status score and trends in anesthetic complications. J Clin Epidemiol 1988; 41: 83-90.
  • 8 Roos LL, Sharp SM, Wajda A. Assessing data quality: a computerized approach. Soc Sci Med 1989; 28: 175-82.
  • 9 Newcombe HB, Fair ME, Lalonde P. Discriminating powers of partial agreements of names for linking personal records (Part I: The logical basis). Meth Inform Med 1989; 28: 86-9l.
  • 10 Newcombe HB, Fair ME, Lalonde P. Discriminating powers of partial agreements of names for linking personal records (Part II: The empirical test). Meth Inform Med 1989; 28: 92-6.
  • 11 Newcombe HB. Abbatt 10. Probabilistic Record Linkage in Epidemiology: Principles Employed. Ottawa, Ontario: Eldorado Resources Limited; 1983
  • 12 Roos LL, Nicol IP, Roos NP. Using large- scale data banks - productivity and quality control. In: Bennett EM, Trute B. eds. Mental Health Information Systems: Problems and Prospects. New York: Edwin Mellen Press; 1984: 81-98.
  • 13 Smith ME, Newcombe HB. Automated follow-up facilities in Canada for monitoring delayed health effects. Am J Public Health 1980; 70: 1261-8.
  • 14 Howe GR, Lindsay I. A generalized iterative record linkage computer system for use in medical follow-up studies. Com put Biomed Res 1981; 14: 327-40.
  • 15 Newcombe HB, Smith ME, Howe GR, Mingay I, Strugnell A, Abbatt JD. Reliability of computerized versus manual death searches in a study of the health of Eldorado uranium workers. Comput Bioi Med 1983; 13: 157-69.
  • 16 Carmines EG, Zeller RA. Reliability and Validity Assessment. Beverly Hills, CA: Sage Publications; 1979
  • 17 Detsky AS. Using economic analysis to determine the resource consequences of choices made in planning clinical trials. J Chron Dis 1985; 38: 753-65.
  • 18 Arellano MG, Petersen GR, Petitti DB. et al. The California Automated Mortality Linkage System (CAMLIS). Am J Public Health 1984; 74: 1324-30.
  • 19 Greenes RA, Shortliffe EH. Medical informatics: an emerging academic discipline and institutional priority. JAMA 1990; 263: 1114-20.