Abstract: A decade of research on provenance, a standardisation of provenance at the World Wide Web Consortium (PROV), and applications, toolkits and services adopting provenance have led to the recognition that provenance is a critical facet of good data governance for businesses, governments and organisations in general. Provenance, which is defined as a record that describes the people, institutions, entities, and activities involved in producing, influencing, or delivering a piece of data or a thing, is now regarded as an essential function of data-intensive applications, to provide a trusted account of what they performed. In this tutorial, I will explain the motivation for provenance, introduce the standard PROV, present some applications of provenance, and review some tools for provenance. We will aim to conclude with a practical session on provenance modelling.
Bio: Dong Huynh is research fellow at King’s with extensive experience in provenance technology and provenance analytics. He is the lead developer of the PROV Python package and ProvStore, an online repository for provenance documents at openprovenance.org.