Provenance: The bridge between experiments and data

Simon Miles, Paul Groth, Ewa Deelman, Karan Vahi, Gaurang Mehta, Luc Moreau

Research output: Contribution to journalArticlepeer-review

30 Scopus citations

Abstract

Prototype-based Provenance-Aware Service Oriented Architecture (PASOA) allows scientists to determine the processes and the abstract workflow involved in the processing of a given data item. The generation of a workflow is followed by the descriptions in the workflow-generation process, providing the provenance for the executable workflow. Pegasus workflow compiler system, which maps high-level abstract workflow descriptions onto available distributed resources, consists of five primary steps such as reduction, site selection, data staging, registration, and clustering. The PAOSA project must include support querying that is provided by independent source of process documentation using the same data model. Each refiner documents the relationships between nodes in the workflow by recording identicalTo, siteSelectionOf, stagingInroducedFor, and clusteringOf.

Original languageEnglish
Article number4488063
Pages (from-to)38-46
Number of pages9
JournalComputing in Science and Engineering
Volume10
Issue number3
DOIs
StatePublished - May 2008
Externally publishedYes

Keywords

  • Grid computing
  • Provenance
  • Service-oriented architecture
  • Traceability
  • Workflow compilation

Fingerprint

Dive into the research topics of 'Provenance: The bridge between experiments and data'. Together they form a unique fingerprint.

Cite this