Recording process documentation for provenance

Paul Groth, Luc Moreau

Research output: Contribution to journalArticlepeer-review

33 Scopus citations

Abstract

Scientific and business communities are adopting large-scale distributed systems as a means to solve a wide range of resource-intensive tasks. These communities also have requirements in terms of provenance. We define the provenance of a result produced by a distributed system as the process that led to that result. This paper describes a protocol for recording documentation of a distributed system's execution. The distributed protocol guarantees that documentation with characteristics suitable for accurately determining the provenance of results is recorded. These characteristics are confirmed through a number of proofs based on an abstract state machine formalization.

Original languageEnglish
Pages (from-to)1246-1259
Number of pages14
JournalIEEE Transactions on Parallel and Distributed Systems
Volume20
Issue number9
DOIs
StatePublished - 2009
Externally publishedYes

Keywords

  • Data protocols
  • Distributed systems
  • Grids
  • Lineage
  • Provenance

Fingerprint

Dive into the research topics of 'Recording process documentation for provenance'. Together they form a unique fingerprint.

Cite this