Quick Search:

uzh logo
Browse by:

Zurich Open Repository and Archive

Permanent URL to this publication: http://dx.doi.org/10.5167/uzh-24450

Glavic, B; Dittrich, K R (2007). Data provenance: A Cctegorization of existing approaches. In: 12. Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme" , Aachen, Germany, 7 March 2007 - 9 March 2007, 227-241.



In many application areas like e-science and data-warehousing detailed information about the origin of data is required. This kind of information is often referred to as data provenance or data lineage. The provenance of a data item includes information about the processes and source data items that lead to its creation and current representation. The diversity of data representation models and application domains has lead to a number of more or less formal definitions of provenance. Most of them are limited to a special application domain, data representation model or data processing facility. Not surprisingly, the associated implementations are also restricted to some application domain and depend on a special data model. In this paper we give a survey of data provenance models and prototypes, present a general categorization scheme for provenance models and use this categorization scheme to study the properties of the existing approaches. This categorization enables us to distinguish between different kinds of provenance information and could lead to a better understanding of provenance in general. Besides the categorization of provenance types, it is important to include the storage, transformation and query requirements for the different kinds of provenance information and application domains in our considerations. The analysis of existing approaches will assist us in revealing open research problems in the area of data provenance.



93 downloads since deposited on 16 Dec 2009
49 downloads since 12 months

Detailed statistics

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, further contribution
Communities & Collections:03 Faculty of Economics > Department of Informatics
Dewey Decimal Classification:000 Computer science, knowledge & systems
Uncontrolled Keywords:provenance, survey
Event End Date:9 March 2007
Deposited On:16 Dec 2009 08:59
Last Modified:09 Jul 2012 04:01
Publisher:Gesellschaft für Informatik (GI)
Series Name:GI-Edition - Lecture notes in informatics (LNI). Proceedings
Additional Information:12. GI-Fachtagung für Datenbanksysteme in Business, Technologie und Web 5. bis 9. März 2007 – Aachen
Free access at:Official URL. An embargo period may apply.
Official URL:http://www.btw2007.de/paper/p227.pdf
Related URLs:http://opac.nebis.ch/F/?local_base=NEBIS&con_lng=GER&func=find-b&find_code=SYS&request=005515364

Users (please log in): suggest update or correction for this item

Repository Staff Only: item control page