Comparison and validation of genomic predictors for anticancer drug sensitivity
- Simon Papillon-Cavanagh1,
- Nicolas De Jay1,
- Nehme Hachem1,
- Catharina Olsen2,
- Gianluca Bontempi2,
- Hugo J W L Aerts3,4,
- John Quackenbush4,
- Benjamin Haibe-Kains1
- 1Bioinformatics and Computational Genomics Laboratory, Institut de recherches cliniques de Montréal, University of Montreal, Montreal, Quebec, Canada
- 2Machine Learning Group, Université Libre de Bruxelles, Bruxelles, Belgium
- 3Department of Radiation Oncology, Dana-Farber Cancer Institute, Harvard University, Boston, Massachusetts, USA
- 4Department or Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Harvard University, Boston, Massachusetts, USA
- Correspondence to Dr Benjamin Haibe-Kains, Bioinformatics and Computational Genomics Laboratory, Institut de recherches cliniques de Montréal, 110 Avenue des Pins Ouest, Montreal, Quebec, Canada H2W 1R7;
- Received 25 October 2012
- Revised 25 October 2012
- Accepted 5 January 2013
- Published Online First 26 January 2013
Background An enduring challenge in personalized medicine lies in selecting the right drug for each individual patient. While testing of drugs on patients in large trials is the only way to assess their clinical efficacy and toxicity, we dramatically lack resources to test the hundreds of drugs currently under development. Therefore the use of preclinical model systems has been intensively investigated as this approach enables response to hundreds of drugs to be tested in multiple cell lines in parallel.
Methods Two large-scale pharmacogenomic studies recently screened multiple anticancer drugs on over 1000 cell lines. We propose to combine these datasets to build and robustly validate genomic predictors of drug response. We compared five different approaches for building predictors of increasing complexity. We assessed their performance in cross-validation and in two large validation sets, one containing the same cell lines present in the training set and another dataset composed of cell lines that have never been used during the training phase.
Results Sixteen drugs were found in common between the datasets. We were able to validate multivariate predictors for three out of the 16 tested drugs, namely irinotecan, PD-0325901, and PLX4720. Moreover, we observed that response to 17-AAG, an inhibitor of Hsp90, could be efficiently predicted by the expression level of a single gene, NQO1.
Conclusion These results suggest that genomic predictors could be robustly validated for specific drugs. If successfully validated in patients’ tumor cells, and subsequently in clinical trials, they could act as companion tests for the corresponding drugs and play an important role in personalized medicine.