rss
JAMIA 2008;15:349-356 doi:10.1197/jamia.M2592
  • Original Investigation
  • Research Paper

Estimating Consumer Familiarity with Health Terminology: A Context-based Approach

  1. Qing Zeng-Treitlera,
  2. Sergey Goryacheva,
  3. Tony Tseb,
  4. Alla Keselmanb,c,
  5. Aziz Boxwalaa
  1. aDecision Systems Group, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
  2. bLister Hill National Center for Biomedical Communications, National Library of Medicine, National Institutes of Health, Department of Health and Human Services, Bethesda, MD
  3. cAquilent, Inc., Laurel, MD
  1. Correspondence: Qing Zeng-Treitler, PhD, Decision Systems Group, Department of Radiology, Brigham and Women's Hospital, Harvard Medical School, 900 Commonwealth Avenue, 3rd Floor, Boston, MA 02215 (e-mail: <qzeng{at}dsg.bwh.harvard.edu>)
  • Received 14 August 2007
  • Accepted 8 February 2008

Abstract

Objectives Effective health communication is often hindered by a “vocabulary gap” between language familiar to consumers and jargon used in medical practice and research. To present health information to consumers in a comprehensible fashion, we need to develop a mechanism to quantify health terms as being more likely or less likely to be understood by typical members of the lay public. Prior research has used approaches including syllable count, easy word list, and frequency count, all of which have significant limitations.

Design In this article, we present a new method that predicts consumer familiarity using contextual information. The method was applied to a large query log data set and validated using results from two previously conducted consumer surveys.

Measurements We measured the correlation between the survey result and the context-based prediction, syllable count, frequency count, and log normalized frequency count.

Results The correlation coefficient between the context-based prediction and the survey result was 0.773 (p < 0.001), which was higher than the correlation coefficients between the survey result and the syllable count, frequency count, and log normalized frequency count (p ≤ 0.012).

Conclusions The context-based approach provides a good alternative to the existing term familiarity assessment methods.

Footnotes

  • This work is supported by National Institutes of Health grants R01 LM07222 and R01 DK75837, and by the Intramural Research Program of the National Institutes of Health, National Library of Medicine.

Access policy for JAMIA

All content published in JAMIA is deposited with PubMedCentral by the publisher but with varying embargo times. Authors/funders may pay an Unlocked fee of $2,000 to make the article free on the JAMIA website and PMC immediately on publication. Research funded by government and other recognised agencies is deposited with a 12 month embargo. All other content is deposited with a 36 month embargo.

The Journal of the American Medical Informatics Association is published for the American Medical Informatics Association by BMJ Publishing Group Ltd.