Applying the provenance data model to a bioinformatics case

Paul Groth, Steve Munroe, Simon Miles, Luc Moreau

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Scientists and, more generally end users of computer systems, need to be able to trust the data they use. Understanding the origin or provenance of data can provide this trust. Attempts have been made to develop systems for recording provenance, however, most are not generic and cannot be applied in a general manner across different systems and different technologies. Moreover, many existing systems confuse the concept of provenance with its representation. In this article, we discuss an open, technology neutral model for provenance. The model can be applied across many different systems and makes the important distinction between provenance and the way it can be generated from a concrete representation of process. The model is described and applied to a grid-based example bioinformatics application.

Original languageEnglish
Title of host publicationHigh Performance Computing and Grids in Action
PublisherIOS Press BV
Pages250-264
Number of pages15
ISBN (Print)9781586038397
StatePublished - 2008
Externally publishedYes

Publication series

NameAdvances in Parallel Computing
Volume16
ISSN (Print)0927-5452

Keywords

  • bioinformatics
  • data model
  • Provenance

Fingerprint

Dive into the research topics of 'Applying the provenance data model to a bioinformatics case'. Together they form a unique fingerprint.

Cite this