rss
J Am Med Inform Assoc 13:497-507 doi:10.1197/jamia.M2085
  • Focus on Automated Categorization Technique

Quantitative Assessment of Dictionary-based Protein Named Entity Tagging

Table 4

The coverage assessment for BioThesaurus using the test set of the BioCreAtive workshop. The percentage in the first column shows the percentage of terms in BioCreAtive text present in BioThesaurus. The percentages in the second, third, and last columns were acquired by inversing the ambiguity which refer to the precisions of a base system that randomly picks an associated entity for those ambiguous terms.

The Coverage of Genes in the Evaluation Set Present in BioThesaurus The Ambiguity of Matched Terms in BioThesaurus
Organisms Mentioned in Abstracts Including Systematic Ambiguity Ignoring Systematic Ambiguity Limited to Specific Organism
Yeast 557/595(93.6%) 55.7(1.8%) 23.8(4.2%) 1.23(81.3%)
Mouse 346/378(91.5%) 28.0(3.6%) 13.4(7.5%) 1.13(88.5%)
Fly 359/370(98.1%) 50.7(2.0%) 17.7(5.6%) 7.34(13.6%)
Total 1,262/1,343(94.0%) 46.9(2.1%) 19.1(5.3%) 3.02(33.1%)

This Article

Access policy for JAMIA

All content published in JAMIA is deposited with PubMed Central by the publisher with a 12 month embargo. Authors/funders may pay an Unlocked fee of $2,000 to make the article free on the JAMIA website and PMC immediately on publication.

All content older than 12 months is freely available on this website.

AMIA members can log in with their JAMIA user name (email address) and password or via the AMIA website.