rss
JAMIA 1994;1:161-174 doi:10.1136/jamia.1994.95236146
  • Original Investigation
  • Research Paper

A General Natural-language Text Processor for Clinical Radiology

  1. Carol Friedman,
  2. Philip O Alderson,
  3. John H M Austin,
  4. James J Cimino,
  5. Stephen B Johnson
  1. Affiliation of the authors: Columbia University (CF,JJC, SBJ) and Columbia Presbyterian Medical Center (POA,JHMA), New York, NY
  1. Correspondence and Reprints: Carol Friedman, PhD, Queens College of the City University of New York, Computer Science Department, Flushing, NY 11367-1597.

    Abstract

    Objective Development of a general natural-language processor that identifies clinical information in narrative reports and maps that information into a structured representation containing clinical terms.

    Design The natural-language processor provides three phases of processing, all of which are driven by different knowledge sources. The first phase performs the parsing. It identifies the structure of the text through use of a grammar that defines semantic patterns and a target form. The second phase, regularization, standardizes the terms in the initial target structure via a compositional mapping of multi- word phrases. The third phase, encoding, maps the terms to a controlled vocabulary. Radiology is the test domain for the processor and the target structure is a formal model for representing clinical information in that domain.

    Measurements The impression sections of 230 radiology reports were encoded by the processor. Results of an automated query of the resultant database for the occurrences of four diseases were compared with the analysis of a panel of three physicians to determine recall and precision.

    Results Without training specific to the four diseases, recall and precision of the system(combined effect of the processor and query generator) were 70% and 87%. Training of the query component increased recall to 85% without changing precision.

    Footnotes

    • Supported in part by Grant Number R29 LM05397 form the National Library of Medicine and Grant Number 6-61483 from the Research Foundation of CUNY.

    Access policy for JAMIA

    All content published in JAMIA is deposited with PubMedCentral by the publisher but with varying embargo times. Authors/funders may pay an Unlocked fee of $2,000 to make the article free on the JAMIA website and PMC immediately on publication. Research funded by government and other recognised agencies is deposited with a 12 month embargo. All other content is deposited with a 36 month embargo.

    AMIA members log in here to access the full text of JAMIA.

    Register for free content

    Individuals may register for a free 30 day online trial to all content.

    The Journal of the American Medical Informatics Association is published for the American Medical Informatics Association by BMJ Publishing Group Ltd.