“Breast cancer” Data set Computing Time for Each Phase in Our System
| # of documents | Computing Time (in seconds) | |
| Text pre-processing | 77,784 | ∼45 |
| Text clustering (using CLUTO) | 77,784 | 14.28 |
| Keyword Extraction | ||
| Cluster A | 8,212 | 0.77 |
| Cluster B | 6,159 | 0.66 |
| Cluster C | 13,122 | 1.057 |
| Cluster D | 16,292 | 0.97 |
| Cluster E | 21,005 | 1.25 |
| Cluster F | 12,994 | 1.09 |
| MeSH term Extraction | ||
| Cluster A | 8,212 | 0.41 |
| Cluster B | 6,159 | 0.33 |
| Cluster C | 13,122 | 0.58 |
| Cluster D | 16,292 | 0.66 |
| Cluster E | 21,005 | 0.81 |
| Cluster F | 12,994 | 0.59 |
| Document ranking | ||
| Cluster A | 8,212 | 0.20 |
| Cluster B | 6,159 | 0.11 |
| Cluster C | 13,122 | 0.27 |
| Cluster D | 16,292 | 0.30 |
| Cluster E | 21,005 | 0.44 |
| Cluster F | 12,994 | 0.28 |
| Total | 70.06 |









