rss
JAMIA 2001;8:17-33 doi:10.1136/jamia.2001.0080017
  • Focus on Neuroinformatics
  • Model Formulation

Common Data Model for Neuroscience Data and Data Model Exchange

  1. Daniel Gardner,
  2. Kevin H Knuth,
  3. Michael Abato,
  4. Steven M Erde,
  5. Thomas White,
  6. Robert DeBellis,
  7. Esther P Gardner
  1. Affiliations of the authors: Weill Medical College of Cornell University (DG, KHK, MA, SME, TW, RD) and New York University School of Medicine (EPG), New York, New York
  1. Correspondence and reprints: Daniel Gardner, PhD, Department of Physiology and Biophysics, Weill Medical College of Cornell University, 1300 York Avenue, New York, NY 10021; e-mail: 〈dan{at}aplysia.med.cornell.edu
  • Received 19 June 2000
  • Accepted 15 September 2000

Abstract

Objective Generalizing the data models underlying two prototype neurophysiology databases, the authors describe and propose the Common Data Model (CDM) as a framework for federating a broad spectrum of disparate neuroscience information resources.

Design Each component of the CDM derives from one of five superclasses—data, site, method, model, and reference—or from relations defined between them. A hierarchic attribute-value scheme for metadata enables interoperability with variable tree depth to serve specific intra- or broad inter-domain queries. To mediate data exchange between disparate systems, the authors propose a set of XML-derived schema for describing not only data sets but data models. These include biophysical description markup language (BDML), which mediates interoperability between data resources by providing a meta-description for the CDM.

Results The set of superclasses potentially spans data needs of contemporary neuroscience. Data elements abstracted from neurophysiology time series and histogram data represent data sets that differ in dimension and concordance. Site elements transcend neurons to describe subcellular compartments, circuits, regions, or slices; non-neuroanatomic sites include sequences to patients. Methods and models are highly domain-dependent.

Conclusions True federation of data resources requires explicit public description, in a metalanguage, of the contents, query methods, data formats, and data models of each data resource. Any data model that can be derived from the defined superclasses is potentially conformant and interoperability can be enabled by recognition of BDML-described compatibilities. Such metadescriptions can buffer technologic changes.

Footnotes

  • This work was supported by the Human Brain Project through grant MH57153 from the National Institute of Mental Health, grant NS36043 from the National Institute of Neurological Diseases and Stroke, and grant BIR/DBI-9506171 from the National Science Foundation

  • Preliminary versions of this work were presented, and published only in abstract form related to, the 1998 and 1999 annual meetings of the Society for Neuroscience and the 1999 and 2000 annual meetings of the Biophysical Society.31 35

Access policy for JAMIA

All content published in JAMIA is deposited with PubMedCentral by the publisher but with varying embargo times. Authors/funders may pay an Unlocked fee of $2,000 to make the article free on the JAMIA website and PMC immediately on publication. Research funded by government and other recognised agencies is deposited with a 12 month embargo. All other content is deposited with a 36 month embargo.

AMIA members log in here to access the full text of JAMIA.

Register for free content

Individuals may register for a free 30 day online trial to all content.

The Journal of the American Medical Informatics Association is published for the American Medical Informatics Association by BMJ Publishing Group Ltd.