Lecture “Data Warehousing and Data Mining Techniques”

Information
Classification: 
Master Informatik, Master Wirtschaftsinformatik
Credits: 
4 or 5 (depending on course of study and exam regulations)
Regular Dates: 
very Thursday, from 09:45 till 12:15, starting the 16th of April
Lecture will take place at the Informatikzentrum, Mühlenpfordtstraße 23, room 161
Contents
Contents: 

Exam Review (Einsicht): 3rd of September, from 14:00 till 15:00 in Raum IZ 251.

In this course, we examine the aspects of building, maintaining, and operating data warehouses and give an insight into the main knowledge discovery techniques. The course deals with basic issues like the storage of data, execution of analytical queries and data mining procedures.

This course will be completely tought in English.

The general structure of the course is as follows:

  • Typical DW use case scenarios
  • Basic architecture of DW
  • Data modelling on conceptual, logical and physical levels
  • Multidimensional E/R modelling
  • Cubes, dimensions, measures
  • Query processing, OLAP queries (OLAP vs OLTP), roll-up, drill down, slice, dice, pivot
  • MOLAP, ROLAP, HOLAP
  • SQL99 OLAP operators, MDX
  • Snowflake, star and starflake schemas for relational storage
  • Multimedia physical storage (linearization)
  • DW Indexing as search optimization mean: R-Trees, UB-Trees, Bitmap indexes
  • Other optimization procedures: data partitioning, star join optimization, materialized views
  • ETL
  • Association rule mining, sequence patterns, time series
  • Classification: Decision trees, naive Bayes classifications, SVM
  • Cluster analysis: K-means, hierarchical clustering, agglomerative clustering, outlier analysis

The Written Exam will be held on the 26th of August 2015 at midday -  12 O'clock, in Raum ZI 24.2 (Grotriangebaeude 2)

Note: Bring Calculators with you to the exam.

 

 

Materials

Download

Date Topic Slides Exercises Videos
16.04 Introduction Slides - Print Slides None Video
23.04 Architecture  Slides - Print Slides None Video
30.04 Modeling Slides - Print Slides None Video
07.05 Indexes Slides - Print Slides Exercise - Solution Video
14.05 NO LECTURE - Christi Himmelfahrt - - -
21.05 Optimization Slides - Print Slides None Video
28.05 NO LECTURE - Excursion Week - - -
04.06 OLAP Operations & Queries Slides - Print Slides None Video
11.06 Build the DW, ETL Slides - Print Slides Exercise

Video

The Google refine Video is

uploaded in the attachment

section below

18.06 Real-Time DW Slides - Print Slides None Video
25.06 DM Overview & Association Rule Mining  Slides - Print Slides None Video
02.07 Sequence patterns & Time series  Slides - Print Slides Exercise - Solution Video
09.07 Classification Slides - Print Slides None Video
16.07 Clustering Slides - Print Slides None Video
23.07 Meta-Algorithms for Classification Slides - Print Slides None Video

 

AttachmentDateSize
File DW-SS-2015-VL2.mp424/04/15 2:02 pm118.43 MB
File Google Refine 2.0 - Data Transformation.mp411/06/15 3:27 pm64.5 MB