rss
J Am Med Inform Assoc 14:651-661 doi:10.1197/jamia.M2215
  • Original Investigation

A Document Clustering and Ranking System for Exploring MEDLINE Citations

Table 4

“Breast cancer” Data set Computing Time for Each Phase in Our System

# of documents Computing Time (in seconds)
Text pre-processing 77,784 ∼45
Text clustering (using CLUTO) 77,784 14.28
Keyword Extraction
Cluster A 8,212 0.77
Cluster B 6,159 0.66
Cluster C 13,122 1.057
Cluster D 16,292 0.97
Cluster E 21,005 1.25
Cluster F 12,994 1.09
MeSH term Extraction
Cluster A 8,212 0.41
Cluster B 6,159 0.33
Cluster C 13,122 0.58
Cluster D 16,292 0.66
Cluster E 21,005 0.81
Cluster F 12,994 0.59
Document ranking
Cluster A 8,212 0.20
Cluster B 6,159 0.11
Cluster C 13,122 0.27
Cluster D 16,292 0.30
Cluster E 21,005 0.44
Cluster F 12,994 0.28
Total 70.06

Access policy for JAMIA

All content published in JAMIA is deposited with PubMed Central by the publisher with a 12 month embargo. Authors/funders may pay an Unlocked fee of $2,000 to make the article free on the JAMIA website and PMC immediately on publication.

All content older than 12 months is freely available on this website.

AMIA members can log in with their JAMIA user name (email address) and password or via the AMIA website.