rss
J Am Med Inform Assoc 2009;16:328-337 doi:10.1197/jamia.M3028
  • Original Investigation
  • Research Paper

Active Computerized Pharmacovigilance Using Natural Language Processing, Statistics, and Electronic Health Records: A Feasibility Study

  1. Xiaoyan Wanga,
  2. George Hripcsaka,
  3. Marianthi Markatoub,
  4. Carol Friedmana
  1. aDepartment of Biomedical Informatics, Columbia University, New York, NY
  2. bDepartment of Biostatistics, Columbia University, New York, NY
  1. Correspondence: Carol Friedman, PhD, Department of Biomedical Informatics, Columbia University, 622 West 168th Street, VC5, New York, NY 10032; e-mail: <friedman{at}dbmi.columbia.edu>
  • Received 9 October 2008
  • Accepted 31 January 2009

Abstract

Objective It is vital to detect the full safety profile of a drug throughout its market life. Current pharmacovigilance systems still have substantial limitations, however. The objective of our work is to demonstrate the feasibility of using natural language processing (NLP), the comprehensive Electronic Health Record (EHR), and association statistics for pharmacovigilance purposes.

Design Narrative discharge summaries were collected from the Clinical Information System at New York Presbyterian Hospital (NYPH). MedLEE, an NLP system, was applied to the collection to identify medication events and entities which could be potential adverse drug events (ADEs). Co-occurrence statistics with adjusted volume tests were used to detect associations between the two types of entities, to calculate the strengths of the associations, and to determine their cutoff thresholds. Seven drugs/drug classes (ibuprofen, morphine, warfarin, bupropion, paroxetine, rosiglitazone, ACE inhibitors) with known ADEs were selected to evaluate the system.

Results One hundred thirty-two potential ADEs were found to be associated with the 7 drugs. Overall recall and precision were 0.75 and 0.31 for known ADEs respectively. Importantly, qualitative evaluation using historic roll back design suggested that novel ADEs could be detected using our system.

Conclusions This study provides a framework for the development of active, high-throughput and prospective systems which could potentially unveil drug safety profiles throughout their entire market life. Our results demonstrate that the framework is feasible although there are some challenging issues. To the best of our knowledge, this is the first study using comprehensive unstructured data from the EHR for pharmacovigilance.

Footnotes

    Access policy for JAMIA

    All content published in JAMIA is deposited with PubMed Central by the publisher with a 12 month embargo. Authors/funders may pay an Unlocked fee of $2,000 to make the article free on the JAMIA website and PMC immediately on publication.

    All content older than 12 months is freely available on this website.

    AMIA members can log in with their JAMIA user name (email address) and password or via the AMIA website.