Schema Versioning in Data Warehouses: Enabling Cross-Version Querying via Schema Augmentation?

Authors: 
Golfarelli, M; Lechtenborger, J; Rizzi, S; Vossen, G
Author: 
Golfarelli, M
Lechtenborger, J
Rizzi, S
Vossen, G
Year: 
2006
Venue: 
Data and Knowledge Engineering (DKE), 2006
URL: 
http://dbms.uni-muenster.de/publications/downloads/dke-versioning.pdf
DOI: 
doi:10.1016/j.datak.2005.09.004
Citations: 
46
Citations range: 
10 - 49
AttachmentSize
Golfarelli2006SchemaVersioninginData.pdf831.91 KB

As several mature implementations of data warehousing systems are fully operational, a crucial role in preserving their up-to-dateness is played by the ability to manage the changes that the data warehouse (DW) schema undergoes over time in response to evolving business requirements. In this paper we propose an approach to schema versioning in DWs, where the designer may decide to undertake some actions on old data aimed at increasing the flexibility in formulating cross-version queries, i.e., queries spanning multiple schema versions. First, we introduce a representation of DW schemata as graphs of simple functional dependencies, and discuss its properties. Then, after defining an algebra of schema graph modification operations aimed at creating new schema versions, we discuss how augmented schemata can be introduced to increase flexibility in cross-version querying. Next, we show how a history of versions for DW schemata is managed and discuss the relationship between the temporal horizon spanned by a query and the schema on which it can consistently be formulated.