rss
JAMIA 2008;15:138-149 doi:10.1197/jamia.M2522
  • Focus on caBIG Models
  • Model Formulation

caGrid 1.0: An Enterprise Grid Infrastructure for Biomedical Research

  1. Scott Ostera,
  2. Stephen Langellaa,
  3. Shannon Hastingsa,
  4. David Ervina,
  5. Ravi Maddurib,
  6. Joshua Phillipsd,
  7. Tahsin Kurca,
  8. Frank Siebenlistb,
  9. Peter Covitzc,
  10. Krishnakant Shanbhagc,
  11. Ian Fosterb,
  12. Joel Saltza
  1. aDepartment of Biomedical Informatics, The Ohio State University, Columbus, OH
  2. bMathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL
  3. cNational Cancer Institute Center for Bioinformatics, Rockville, MD
  4. dSemanticBits, Reston, VA
  1. Correspondence and reprints: Tahsin Kurc, Biomedical Informatics Department, Ohio State University, 3184 Graves Hall, 333 West 10th Ave, Columbus, OH, 43210 (e-mail: <kurc{at}bmi.osu.edu>)
  • Received 30 May 2007
  • Accepted 7 December 2007

Abstract

Objective To develop software infrastructure that will provide support for discovery, characterization, integrated access, and management of diverse and disparate collections of information sources, analysis methods, and applications in biomedical research.

Design An enterprise Grid software infrastructure, called caGrid version 1.0 (caGrid 1.0), has been developed as the core Grid architecture of the NCI-sponsored cancer Biomedical Informatics Grid (caBIG) program. It is designed to support a wide range of use cases in basic, translational, and clinical research, including 1) discovery, 2) integrated and large-scale data analysis, and 3) coordinated study.

Measurements The caGrid is built as a Grid software infrastructure and leverages Grid computing technologies and the Web Services Resource Framework standards. It provides a set of core services, toolkits for the development and deployment of new community provided services, and application programming interfaces for building client applications.

Results The caGrid 1.0 was released to the caBIG community in December 2006. It is built on open source components and caGrid source code is publicly and freely available under a liberal open source license. The core software, associated tools, and documentation can be downloaded from the following URL: https://cabig.nci.nih.gov/workspaces/Architecture/caGrid.

Conclusions While caGrid 1.0 is designed to address use cases in cancer research, the requirements associated with discovery, analysis and integration of large scale data, and coordinated studies are common in other biomedical fields. In this respect, caGrid 1.0 is the realization of a framework that can benefit the entire biomedical community.

Footnotes

  • The caBIG In-vivo Imaging Domain Workspace is developing common data elements and terminology for annotation and image markup. With the availability of such community accepted data elements and vocabulary, the developers in our example could choose to reuse those data elements for better interoperability with other imaging applications in the in-vivo imaging domain.

Access policy for JAMIA

All content published in JAMIA is deposited with PubMedCentral by the publisher but with varying embargo times. Authors/funders may pay an Unlocked fee of $2,000 to make the article free on the JAMIA website and PMC immediately on publication. Research funded by government and other recognised agencies is deposited with a 12 month embargo. All other content is deposited with a 36 month embargo.

The Journal of the American Medical Informatics Association is published for the American Medical Informatics Association by BMJ Publishing Group Ltd.