Channels ▼
RSS

Web Development

Tracking Data Provenance


The first community model capable of tracing the origins of computer-generated information is now available. University of Southampton School of Electronics and Computer Science professor and researcher Luc Moreau, says that the new model will lead to better degrees of trust online.

A new paper titled "The open provenance model core specification", written by Moreau and a community of international researchers, describes the Open Provenance Model (OPM), designed to represent the provenance of information.

“Provenance is a term used in diverse areas such as art, archaeology and palaeontology, which describes the history of an object since its creation,” said Moreau. “Its main focus is to establish that the object has not been forged or altered, and we have found that we can now do the same with computer-generated data. By understanding where data comes from, users can decide to trust data.”

In 2006, Professor Moreau launched the Provenance Challenge series, an international, multidisciplinary effort aimed at exchanging provenance between information systems.  It led to the design of the OPM, its actual use in the Provenance Challenge, and its revision according to an open-source-like community process.

The team has developed a model that traces the origins of information and allows these provenance details to be shared between systems. The new model has already had some take-up by academia and industry. The next step is for a provenance data model of this kind to receive a seal of approval from the standardization body.

“Provenance is well understood in the context of art or digital libraries, where it refers respectively to the documented history of an art object, or the documentation of processes in a digital object's life cycle,” said Moreau. “Interest in provenance in the e-science community is also growing, since it is perceived as a crucial component of workflow systems that can help scientists ensure reproducibility of their scientific analyses and processes.”


Related Reading


More Insights






Currently we allow the following HTML tags in comments:

Single tags

These tags can be used alone and don't need an ending tag.

<br> Defines a single line break

<hr> Defines a horizontal line

Matching tags

These require an ending tag - e.g. <i>italic text</i>

<a> Defines an anchor

<b> Defines bold text

<big> Defines big text

<blockquote> Defines a long quotation

<caption> Defines a table caption

<cite> Defines a citation

<code> Defines computer code text

<em> Defines emphasized text

<fieldset> Defines a border around elements in a form

<h1> This is heading 1

<h2> This is heading 2

<h3> This is heading 3

<h4> This is heading 4

<h5> This is heading 5

<h6> This is heading 6

<i> Defines italic text

<p> Defines a paragraph

<pre> Defines preformatted text

<q> Defines a short quotation

<samp> Defines sample computer code text

<small> Defines small text

<span> Defines a section in a document

<s> Defines strikethrough text

<strike> Defines strikethrough text

<strong> Defines strong text

<sub> Defines subscripted text

<sup> Defines superscripted text

<u> Defines underlined text

Dr. Dobb's encourages readers to engage in spirited, healthy debate, including taking us to task. However, Dr. Dobb's moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing or spam. Dr. Dobb's further reserves the right to disable the profile of any commenter participating in said activities.

 
Disqus Tips To upload an avatar photo, first complete your Disqus profile. | View the list of supported HTML tags you can use to style comments. | Please read our commenting policy.
 
Dr. Dobb's TV