Data-Driven Understanding and Refinement of Schema Mappings

Authors: 
Yan, Ling Ling; Miller, Renee; Haas, Laura; Fagin, Ronald
Author: 
Yan, L
Miller, R
Haas, L
Fagin, R
Year: 
2001
Venue: 
Proc SIGMOD, 2001, p.485-496
URL: 
http://www.almaden.ibm.com/cs/people/fagin/sigmod01.pdf
DOI: 
http://doi.acm.org/10.1145/375663.375729
Citations: 
218
Citations range: 
100 - 499
AttachmentSize
Yan2001DataDrivenUnderstandingand.pdf256.72 KB

At the heart of many data-intensive applications is the problem of quickly and accurately transforming data into a new form. Database researchers have long advocated the use of declarative queries for this process. Yet tools for creating, managing and understanding the complex queries necessary for data transformation are still too primitive to permit widespread adoption of this approach. We present a new framework that uses data examples as the basis for understanding and refining declarative schema mappings. We identify a small set of intuitive operators for manipulating examples. These operators permit a user to follow and refine an example by walking through a data source. We show that our operators are powerful enough both to identify a large class of schema mappings and to distinguish effectively between alternative schema mappings. These operators permit a user to quickly and intuitively build and refine complex data transformation queries that map one data source into another.