Login | Register

Supporting lineage tracing in mediator-based information integration systems


Supporting lineage tracing in mediator-based information integration systems

Taghizadeh-Azari, Ali (2005) Supporting lineage tracing in mediator-based information integration systems. Masters thesis, Concordia University.

[thumbnail of MR04452.pdf]
Text (application/pdf)
MR04452.pdf - Accepted Version


Information integration provides users with a uniform interface to multiple (possibly heterogeneous) data sources. Two main approaches to information integration are data warehousing and mediator-based. The problem of providing explanation for a query answer is referred to as lineage tracing. This problem has been studied extensively in the context of data warehouse systems, however, for mediator-based systems, this is identified as a research problem [HC'03]. In such a system, the mediator does not store data. This means for query processing as well as for tracing, the mediator has to "communicate" with the data sources. While this communication could be expensive, the real issue is that after a query is being processed, lineage tracing could be more difficult or even impossible, if the structure of some contributing sources changes, or if the content of such sources change or a source become unavailable. This means, to support lineage tracing, we need to collect "enough" data and metadata information during query processing. In this work, we study this problem, and introduce data structures and algorithms to support lineage tracing in two modes: batch and interactive. We have successfully developed a prototype, called ELIT , for Exploration and LIneage Tracing . We also study query optimization in the content of ELIT and implemented some basic optimization techniques. While more sophisticated techniques are required in this context, we believe the ideas proposed in this work lend themselves to useful analysis and tracing tools in mediator-based systems.

Divisions:Concordia University > Gina Cody School of Engineering and Computer Science > Computer Science and Software Engineering
Item Type:Thesis (Masters)
Authors:Taghizadeh-Azari, Ali
Pagination:ix, 91 leaves : ill. ; 29 cm.
Institution:Concordia University
Degree Name:M. Comp. Sc.
Program:Computer Science and Software Engineering
Thesis Supervisor(s):Shiri, Nematollaah
Identification Number:QA 76.9 U83T34 2005
ID Code:8260
Deposited By: Concordia University Library
Deposited On:18 Aug 2011 18:20
Last Modified:13 Jul 2020 20:03
Related URLs:
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Downloads per month over past year

Research related to the current document (at the CORE website)
- Research related to the current document (at the CORE website)
Back to top Back to top