CCO - 631 - Data Warehouse and OLAP
Total of Credits: 8
Hours for Theoretical Classes: 60
Hours for Exercises or Seminars: 60
Objective
To present concepts, techniques and tools commonly used in analytical data processing applications. The concept of dimensional data modeling will be studied and the differences between on-line analytical processing (OLAP) and online transaction processing (OLTP) will be discussed. The main characteristics of OLAP solutions and their architectures will also be studied.
Catalog Description
- Introduction to the concepts of Online Analytical Processing (OLAP): definition of Data Warehouse, Data Warehousing environment and OLAP
- Differences between Online Analytical Processing (OLAP) and Online Transaction Processing (OLTP)
- Data warehouse characteristics and organization
- Architecture of the data warehousing environment
- Dimensional data modeling: static and dynamic aspects
- Representation of the dimensional data model in logical level using relational structures (ROLAP
- Practical examples of dimensional data modeling
- Materialized views: concept, identification of which views to materialize, update and use in OLAP queries
- Data warehousing tools
- Advanced data warehouse concepts: spatial data warehouse (SDW), spatio-temporal data warehouse, techniques for improving the processing of OLAP and SOLAP (spatial OLAP) queries, such as data fragmentation, data partitioning, indexing and materialization of views
Main Bibliography
- Berson, A., Smith, S. Data Warehousing, Data Mining, and OLAP. McGraw-Hill, 1997. ISBN 0-07-006272-2. INMON, W.H. Building the Data Warehouse, 4th edition. Wiley Publishing Inc, 2005. ISBN 0-7645-9944-5.
- KIMBALL, R., CASERTA, J. The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data. Wiley Publishing Inc., 2004. ISBN 0-764-56757-8.
- IMHOFF, C., GALEMMO, N., GEIGER, J.G. Mastering Data Warehouse Design: Relational and Dimensional Techniques. Wiley Publishing Inc., 2003. ISBN 0-471-32421-3.
- MALINOWSKI, E.; ZIMÁNYI, E. Advanced Data Warehouse Design: From Conventional to Spatial and Temporal Applications. Springer, 1st edition, 2008. ISBN 978-3-540-74404-7 (Impresso), 978-3-540-74405-4 (Online).
Complementary Bibliography
- MUNDY, J., BECKER, B. The Data Warehouse Lifecycle Toolkit: Practical Techniques Building Data Warehouse and Business Intelligence Systems, 2nd edition. Wiley Publishing Inc., 2008. ISBN 0-470-14977-9.
- O’NEIL, E.; O’NEIL, P.; WU, K. Bitmap Index Design Choices and Their Performance Implications. In: 11TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM. Proceedings… Banff, Canada: IEEE Computer Society, 2007. p. 72-84.
- O'NEIL, P.; GRAEFE, G. Multi-table joins through bitmapped join indices. ACM SIGMOD Record, v.24, n.3, p.8-11, 1995.
- PAPADIAS, D.; KALNIS, P.; ZHANG, J.; TAO, Y. Efficient OLAP Operations in Spatial Data Warehouses. Proceedings of the 7th International Symposium On Spatial And Temporal Databases. Redondo Beach, CA, USA: Springer-Verlag, 2001. p.443-459.
- SARAWAGI, S. Indexing OLAP Data. Bulletin of the Technical Committee on Data Engineering, v.20, n.1, p.36-43, 1997.
- SIQUEIRA, T., CIFERRI, C., TIMES, V., OLIVEIRA, A., CIFERRI, R. The impact of spatial data redundancy on SOLAP query performance. Journal of the Brazilian Computer Society.
- SIQUEIRA, T., CIFERRI, R., TIMES, V., CIFERRI, C. A Spatial Bitmap-based Index for Geographical Data Warehouses. In: The 24th Annual ACM Symposium on Applied Computing, 2009, Honolulu , Hawaii, USA. Proceedings of the 24th Annual ACM Symposium on Applied Computing, 2009. v. 3. p. 1336-1342.
- SIQUEIRA, T., CIFERRI, R., TIMES, V., CIFERRI, C. Investigating the Effects of Spatial Data Redundancy in Query Performance over Geographical Data Warehouses. In: X Brazilian Symposium on Geoinformatics, 2008, Rio de Janeiro. Proceedings of the X Brazilian Symposium on GeoInformatics, 2008. p. 1-12.
- WREMBEL, R., KONCILIA, C. Data Warehouses and OLAP: Concepts, Architectures and Solutions, 1st edition. IRM Press, 2007. ISBN 1-59904365-3. Artigos de periódicos e eventos científicos referentes ao processamento analítico de dados.