rss
JAMIA 2006;13:334-343 doi:10.1197/jamia.M1823
  • Original Investigation
  • Research Paper

A System for Automated Lexical Mapping

  1. Jennifer Y Sun,
  2. Yao Sun
  1. Affiliations of the authors: Newborn Medicine Informatics Program, Children's Hospital, Boston, MA
  1. Correspondence and reprints: Jennifer Y. Sun, MD, MS, 57 Blossomcrest Road, Lexington, MA 02421-7103; e-mail: <jennifer.sun{at}childrens.harvard.edu>
  • Received 8 March 2005
  • Accepted 1 February 2006

Abstract

Objective To automate the mapping of disparate databases to standardized medical vocabularies.

Background Merging of clinical systems and medical databases, or aggregation of information from disparate databases, frequently requires a process whereby vocabularies are compared and similar concepts are mapped.

Design Using a normalization phase followed by a novel alignment stage inspired by DNA sequence alignment methods, automated lexical mapping can map terms from various databases to standard vocabularies such as the UMLS (Unified Medical Language System) and LOINC (Logical Observation Identifier Names and Codes).

Measurements This automated lexical mapping was evaluated using three real-world laboratory databases from different health care institutions. The authors report the sensitivity, specificity, percentage correct (true positives plus true negatives divided by total number of terms), and true positive and true negative rates as measures of system performance.

Results The alignment algorithm was able to map 57% to 78% (average of 63% over all runs and databases) of equivalent concepts through lexical mapping alone. True positive rates ranged from 18% to 70%; true negative rates ranged from 5% to 52%.

Conclusion Lexical mapping can facilitate the integration of data from diverse sources and decrease the time and cost required for manual mapping and integration of clinical systems and medical databases.

Footnotes

    Access policy for JAMIA

    All content published in JAMIA is deposited with PubMedCentral by the publisher but with varying embargo times. Authors/funders may pay an Unlocked fee of $2,000 to make the article free on the JAMIA website and PMC immediately on publication. Research funded by government and other recognised agencies is deposited with a 12 month embargo. All other content is deposited with a 36 month embargo.

    The Journal of the American Medical Informatics Association is published for the American Medical Informatics Association by BMJ Publishing Group Ltd.