CCO - 631 - Data Warehouse and OLAP

Total of Credits: 8
Hours for Theoretical Classes: 60
Hours for Exercises or Seminars: 60

Objective

To present concepts, techniques and tools commonly used in analytical data processing applications. The concept of dimensional data modeling will be studied and the differences between on-line analytical processing (OLAP) and online transaction processing (OLTP) will be discussed. The main characteristics of OLAP solutions and their architectures will also be studied.

Catalog Description

  • Introduction to the concepts of Online Analytical Processing (OLAP): definition of Data Warehouse, Data Warehousing environment and OLAP
  • Differences between Online Analytical Processing (OLAP) and Online Transaction Processing (OLTP)
  • Data warehouse characteristics and organization
  • Architecture of the data warehousing environment
  • Dimensional data modeling: static and dynamic aspects
  • Representation of the dimensional data model in logical level using relational structures (ROLAP
  • Practical examples of dimensional data modeling
  • Materialized views: concept, identification of which views to materialize, update and use in OLAP queries
  • Data warehousing tools
  • Advanced data warehouse concepts: spatial data warehouse (SDW), spatio-temporal data warehouse, techniques for improving the processing of OLAP and SOLAP (spatial OLAP) queries, such as data fragmentation, data partitioning, indexing and materialization of views

Main Bibliography

  1. Berson, A., Smith, S. Data Warehousing, Data Mining, and OLAP. McGraw-Hill, 1997. ISBN 0-07-006272-2. INMON, W.H. Building the Data Warehouse, 4th edition. Wiley Publishing Inc, 2005. ISBN 0-7645-9944-5.
  2. KIMBALL, R., CASERTA, J. The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data. Wiley Publishing Inc., 2004. ISBN 0-764-56757-8.
  3. IMHOFF, C., GALEMMO, N., GEIGER, J.G. Mastering Data Warehouse Design: Relational and Dimensional Techniques. Wiley Publishing Inc., 2003. ISBN 0-471-32421-3. 
  4. MALINOWSKI, E.; ZIMÁNYI, E. Advanced Data Warehouse Design: From Conventional to Spatial and Temporal Applications. Springer, 1st edition, 2008. ISBN 978-3-540-74404-7 (Impresso), 978-3-540-74405-4 (Online). 

Complementary Bibliography

  1. MUNDY, J., BECKER, B. The Data Warehouse Lifecycle Toolkit: Practical Techniques Building Data Warehouse and Business Intelligence Systems, 2nd edition. Wiley Publishing Inc., 2008. ISBN 0-470-14977-9.
  2. O’NEIL, E.; O’NEIL, P.; WU, K. Bitmap Index Design Choices and Their Performance Implications. In: 11TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM. Proceedings… Banff, Canada: IEEE Computer Society, 2007. p. 72-84. 
  3. O'NEIL, P.; GRAEFE, G. Multi-table joins through bitmapped join indices. ACM SIGMOD Record, v.24, n.3, p.8-11, 1995.
  4. PAPADIAS, D.; KALNIS, P.; ZHANG, J.; TAO, Y. Efficient OLAP Operations in Spatial Data Warehouses. Proceedings of the 7th International Symposium On Spatial And Temporal Databases. Redondo Beach, CA, USA: Springer-Verlag, 2001. p.443-459.
  5. SARAWAGI, S. Indexing OLAP Data. Bulletin of the Technical Committee on Data Engineering, v.20, n.1, p.36-43, 1997.
  6. SIQUEIRA, T., CIFERRI, C., TIMES, V., OLIVEIRA, A., CIFERRI, R. The impact of spatial data redundancy on SOLAP query performance. Journal of the Brazilian Computer Society. 
  7. SIQUEIRA, T., CIFERRI, R., TIMES, V., CIFERRI, C. A Spatial Bitmap-based Index for Geographical Data Warehouses. In: The 24th Annual ACM Symposium on Applied Computing, 2009, Honolulu , Hawaii, USA. Proceedings of the 24th Annual ACM Symposium on Applied Computing, 2009. v. 3. p. 1336-1342.
  8. SIQUEIRA, T., CIFERRI, R., TIMES, V., CIFERRI, C. Investigating the Effects of Spatial Data Redundancy in Query Performance over Geographical Data Warehouses. In: X Brazilian Symposium on Geoinformatics, 2008, Rio de Janeiro. Proceedings of the X Brazilian Symposium on GeoInformatics, 2008. p. 1-12.
  9. WREMBEL, R., KONCILIA, C. Data Warehouses and OLAP: Concepts, Architectures and Solutions, 1st edition. IRM Press, 2007. ISBN 1-59904365-3. Artigos de periódicos e eventos científicos referentes ao processamento analítico de dados.