Adding Boris' details

pull/1/head
Oliver Kennedy 2016-01-04 13:07:23 -05:00
parent 4f80c8a6a8
commit fbd9350092
1 changed files with 5 additions and 1 deletions

View File

@ -7,9 +7,13 @@ schedule:
- when: Feb. 1
what: TBD
- when: Feb. 8
what: Title TBD
what: Reenacting Transactional Histories to Compute Their Provenance
who: Boris Glavic (IIT)
where: Location TBD
abstract: |
Provenance for database queries, information about how the outputs of a query where derived from its inputs, has recently gained traction in the database community resulting in the development of several models and their implementation in prototype systems. However, currently there is no system or model that supports transactional updates limiting the applicability of provenance to databases which are never updated. In this talk, I introduce reenactment, a novel declarative replay technique for transactional histories, and demonstrate how reenactment can be used to retroactively compute the provenance of past updates, transactions, and histories. The foundation of this research are MV-semirings, our extension of the well-established semiring provenance model for queries to updates and transactions running under multi-versioning concurrency control protocols. In this model, any transactional history (or part thereof) can be simulated through a query, i.e., any state of a relation R produced by a history can be reconstructed by a query. We call this process reenactment. More formally, the reenactment query for a transactional history H is equivalent (in the sense of query equivalence) to the history under MV-semiring semantics. These formal underpinnings are the basis of an efficient approach for computing provenance of past transactions using a standard relational DBMS. I will show how reenactment queries can be constructed from an audit log, a log of past SQL operations, and how queries with MV-semiring semantics can be encoded as standard relational queries. A naive implementation would either require replay of the complete history from the beginning or proactive materialization of provenance while transactions are run. However, as long as a transaction time history is available, reenactment can be started from any past history state. Since most modern DBMS support audit logs and time travel (querying transaction time histories) out of the box and these features incur only moderate overhead on transaction execution, this approach enables efficient provenance computation for transactions on-top of standard database systems. I present encouraging experimental results based on our implementation of these techniques in our GProM (Generic Provenance Middleware) provenance database middleware.
bio: |
Boris Glavic is an Assistant Professor of Computer Science at the Illinois Institute of Technology where he leads the IIT database group (<a href="http://www.cs.iit.edu/%7Edbgroup/">http://www.cs.iit.edu/~dbgroup/</a>). Before coming to IIT, Boris spent to two years as a PostDoc in the <a href="http://www.cs.toronto.edu/">Department of Computer Science</a> at the <a href="http://www.utoronto.ca/">University of Toronto</a> working at the <a href="http://dblab.cs.toronto.edu/home/">Database Research Group</a> under <a href="http://www.cs.toronto.edu/%7Emiller">Renée J. Miller</a>. He received a Diploma (Master) in Computer Science from the <a href="http://www.informatik.rwth-aachen.de/">RWTH Aachen</a> in Germany, and a PhD in Computer Science from the University of Zurich in Switzerland being advised by <a href="http://www.ifi.uzh.ch/dbtg/Staff/Boehlen">Michael Böhlen</a> and <a href="http://people.inf.ethz.ch/alonso/">Gustavo Alonso</a>. Boris is a professed database guy enjoying systems research based on solid theoretical foundations. His main research interests are provenance and information integration. He has build several provenance-aware systems (see <a href="http://cs.iit.edu/%7Edbgroup/research/index.html">http://cs.iit.edu/~dbgroup/research/index.html</a>) including Perm (relational databases), Ariadne (stream processing), GProM (database provenance middleware), Vagabond, and LDV (database virtualization and repeatability). 
- when: Feb. 15
who: Zack Ives (UPenn)
what: Title TBD