Datenbankenlernen.de - Prof. Jens Dittrich

About this Project/News

Oct 9 2020: slides for undergrad lecture Big Data Engineering available here.

July 9 2019: pdf of the book is now available for free here.

March 3 2016: Request a lecturers copy of patterns in data management here.

March 1 2016: both the ebook and the paperback (with color graphics!) are now available at amazon: Patterns in Data Management: A Flipped Textbook (English Edition)

Feb 3 2016: ebook now available at amazon: Patterns in Data Management: A Flipped Textbook (English Edition)

Dec 9 2015: excerpt from my new book Patterns in Data Management available for download

Dec 14 2015: book walkthrough:

This page contains a directory of instructional videos in English about database technology. As of September 2014 roughly half of them, this is about 80 videos, are in English (the ones starting with 14.**). Those videos are mainly about the algorithms, index structures, data layouts and other important technologies required to make databases perform very well on large datasets. People are also calling this stuff "big data" these days, whatever that hype term means. So if you feel like that sounds cooler in your CV, you may assume that thes videos are about big data technologies. The video material handles traditional disk-based and main-memory techniques at the same time wherever possible. You will learn that often very similar techniques are used at different levels of the storage hierarchy. At Saarland University I use this material for my flipped classroom (aka inverted classroom) on database technology. At Saarland University this course is offered as a 4+2 "core" lecture, i.e. this lecture can either be attended in the third year of your BSc or as part of your MSc. The first instance of this flipped classroom happened in summer term 2014. The second will happen in winter term 2014/15 (with some minor changes here and there, in particular a slightly reduced workload).

Notice that below each video in the listing below there is a link to downloadable slides used to produced that video, each in two versions: 1. original slides , and 2. inverted colors for easier printing.

The videos are also publicly available through my youtube channel which you may subscribe to here to not miss any new videos:

If you have any comments on this material, any praise, bugs, etc: send me an email.

Über dieses Projekt

Diese Seite enthält ein Verzeichnis von kurzen Lehrvideos auf Deutsch zum Thema "Einführung in Datenbanken" sowie weiter Videos zu Datenbanktechnologie auf Englisch (siehe "About this Project"). Die Themen der deutschen Videos sind Entity-Relationship-Modellierung, Relationales Modell, Relationale Algebra, SQL, etc. D.h. es wird der gesamte Zyklus vom Planen einer Datenbank bis zur Erstellung und Nutzung unterrichtet.

Es gibt derzeit ca. 70 Lehrvideos auf Deutsch. Unter jedem Video sind zusätzlich die Folien zum Herunterladen verfügbar in zwei Versionen: 1. orignal, 2. invertiert (d.h. mit invertierten Farben, um das Ausdrucken zu vereinfachen). Diese und weitere Videos sind auch öffentlich zugäglich auf youtube in meinem Kanal jensdit. Den Kanal kann man hier abonnieren:

Einiges nützliches Material zu den Videos:
Fotoagentur: Schema und Daten für PostgreSQL; E/R-Modell und Relationales Modell
Datenbanken: Was muss ich wissen?

Im Sommersemester 2013 hatte ich angefangen, dieses Material zu erstellen, und für einen Flipped Classroom (auch Inverted Classroom genannt) an der Universität des Saarlandes genutzt. Die Vorlesung wurde von der Fachschaft Informatik mit dem Busy Beaver Teaching Award für die beste Grundvorlesung der Informatik ausgezeichnet.

Für Dozenten im Bereich Informatik: Meine Erfahrungen mit der Vorlesung im Sommersemester 2013 kann man hier nachlesen: Die Umgedrehte Vorlesung - Chancen für die Informatiklehre erschienen im Datenbank-Spektrum, November 2013.

Im Sommersemester 2014 lief die Vorlesung mit diversen Änderungen zum zweiten Mal. Für das Sommersemester 2015 sind wiederum ein paar Änderungen und Erweiterungen geplant.

Falls Sie irgendwelche Anmerkungen, Kommentare, Fehler, etc. zu dem Material haben, schicken Sie mir bitte eine email.

Search Results

Database Systems

1 Hardware and Storage

2 Data Layouts

3 Indexes

4 Query Processing Algorithms

5 Query Planning and Optimization

6 Recovery

6.1 Core Concepts
6.2 ARIES

7 NoSQL, Big Data, MapReduce, Hadoop

Informationssysteme

0 Einführung

0 Einführung

1 Motivation

2 Entity Relationship Modellierung

3 Relationales Modell

3 Relationales Modell

4 Umsetzung von ER ins Relationale Modell

5 Relationale Algebra

6 PostgreSQL und andere DBMS

7 SQL

8 Schema und Daten erstellen und ändern

9 Trigger, Regeln, JDBC

10 Schemadesign, Normalformen

11 NoSQL und MapReduce

11.1 Was ist eigentlich NoSQL?
11.2 MapReduce

12 B-Bäume

12 B-Bäume

0.1 Course Overview and Motivation

0.1.1 The Truth about Databases

Why is database systems a vertical topic? Which are those topics? Is this lecture only about database systems or software in general?

Additional Material
Further Reading	E Database Management System

Additional Material
Literature	E Memory Hierarchy E Main Memory G 9.1
Further Reading	E Storage Management E Storage Security E Write Once Read Many W List of device bandwidths P 5

Additional Material
Literature	E Tree-based Indexing
Further Reading	E Index Tuning

Additional Material
Literature	E Query Plan E Database Catalog (aka Data Dictionay, aka System Catalog Web PostgreSQL System Catalog
Video	Web 13.47 CREATE, ALTER, DROP, DESCRIBE TABLE, Katalog, psql Shell (Datendefinitionssprache, DDL) [video from our undergrad lecture explaining how to query the PostgreSQL catalog, in German]

Additional Material
Literature	E Disk
Further Reading	E Active Storage
Video	W a Hard Disk in action

Additional Material
Literature	E RAID R RAID: High-Performance, Reliable Secondary Storage P 6.9 G 9.2
Further Reading	R Disk failures in the real world:What does an MTTF of 1,000,000 hours mean to you? R A Comprehensive Study on RAID-6 Codes: Horizontal vs. Vertical

Additional Material
Literature	E Replication for High Availability E Replication
Further Reading	E Partial Replication

Additional Material
Literature	E Query Optimization E System R (R*) Optimizer E Query Processing
Further Reading	Web Catalan Number W Catalan Number R Measuring the complexity of join enumeration in query optimization

Additional Material
Literature	E Query Optimization (in Relational Databases)
Further Reading	W Dynamic Programming

Additional Material
Literature	R ARIES: a transaction recovery method supporting fine-granularity locking and partial rollbacks using write-ahead logging
Further Reading	R Repeating History Beyond ARIES (Impact of ARIES and different variants) E Multi-Level Recovery and the ARIES Algorithm (explaining fully logical undo)

Additional Material
Literature	E Buffer Manager E Buffer Pool E Memory Locality
Further Reading	E Buffer Management

Additional Material
Literature	J 2 J 3 E I/O Model of Computation
Further Reading	E Performance Monitoring Tools

Additional Material
Literature	G 13.1 - 13.2
Video	Web Main Memory Merge Sort Recap Web Merge-sort with Transylvanian-saxon (German) folk dance Web Why it is important to know about Sorting in any Job

Additional Material
Literature	W Visitor Pattern R Just-in-time compilation for SQL query processing (Overview)
Code Example	Web Java Code Example for Compiling Predicates
Code Example Slides	Web slides

Additional Recap Material (from undergrad lecture, in German)
Literature	K § 20.5
Video	Web 13.66 MapReduce, Semantik von map() und reduce() Web 13.67a MapReduce, Einordnung der Architektur Web 13.67b MapReduce, Prinzipielle Verarbeitung Web 13.68 MapReduce, Verarbeitung eines MapReduce Jobs auf einem Cluster
Slides	Web 13.66 original Web 13.66 invertiert Web 13.67a original Web 13.67a invertiert Web 13.67b original Web 13.68 original

Additional Material
Literature	R The Implementation and Performance of Compressed Databases
Further Reading	R How to Barter Bits for Chronons: Compression and Bandwidth Trade Offs for Database Scans

Additional Material
Literature	E Hash Functions E Hash-based Indexing G 8.3.1 G 11.1
Further Reading	Web Which hashing algorithm is best for uniqueness and speed?

Additional Material
Literature	E Bitmap Index
Further Reading	E Bitmap-based Index Structures

Additional Material
Further Reading	R Optimizing Bitmap Indices With Efficient Compression
Java Libary	Web javaewah

Zusätzliches Material
Literatur	K § 5-5.3 PGD § 5.3
Presse	Web Datensalat im Finanzamt

About this Project/News

Über dieses Projekt

Search Results

Database Systems

0 Introduction

1 Hardware and Storage

2 Data Layouts

3 Indexes

4 Query Processing Algorithms

5 Query Planning and Optimization

6 Recovery

7 NoSQL, Big Data, MapReduce, Hadoop

Informationssysteme

0 Einführung

1 Motivation

2 Entity Relationship Modellierung

3 Relationales Modell

4 Umsetzung von ER ins Relationale Modell

5 Relationale Algebra

6 PostgreSQL und andere DBMS

7 SQL

8 Schema und Daten erstellen und ändern

9 Trigger, Regeln, JDBC

10 Schemadesign, Normalformen

11 NoSQL und MapReduce

12 B-Bäume

0.1 Course Overview and Motivation

0.1.1 The Truth about Databases

0.1.2 Architecture of a DBMS

0.1.3 Structure of this Course

1.1 Storage Hierarchies

1.1.1 A Simple Storage Hierarchy

1.1.2 The All Levels are Equal Pattern

1.1.3 Multicore Storage Hierarchies, NUMA

2.1 Overview

3.1 Motivation for Index Structures, Selectivities, Scan vs. Index Access on Disk and Main Memory

4.1 Join Algorithms

4.1.1 Applications of Join Algorithms, Nested-Loop Join, Index Nested-Loop Join

4.1.2 Simple Hash Join

4.1.3 Sort-Merge Join, Co-Grouping

4.1.4 Generalized Co-Grouped Join (on Disk, NUMA, and Distributed Systems)

4.1.5 Double-Pipelined Hash Join, Relationship to Index Nested-Loop Join

5.1 Overview and Challenges

5.1.1 Query Optimizer Overview

5.1.2 Challenges in Query Optimization: Rule-Based Optimization

5.1.3 Challenges in Query Optimization: Join Order, Costs, and Index Access

5.1.4 An Overview of Query Optimization in Relational Systems

6.1 Core Concepts

6.1.1 Crash Recovery, Error Scenarios, Recovery in any Software, Impact on ACID

6.1.2 Log-Based Recovery, Stable Storage, Write-Ahead Logging (WAL)

6.1.3 What to log, Physical, Logical, and Physiological Logging, Trade-Offs, Main Memory versus Disk-based Systems

7.1 Introduction to NoSQL

0.2 History of Relational Databases

0.2.1 A Footnote about the Young History of Database Systems

0.2.2 Relational Database --- A Practical Foundation of Productivity

1.2 Storage Media

1.2.1 Tape

1.2.2 Hard Disks, Sectors, Zone Bit Recording, Sectors vs Blocks, CHS, LBA, Sparing

1.2.3 Hard Disks, Sequential Versus Random Access

1.2.4 Hard Disks, Controller Caching

1.2.5 The Batch Pattern

1.2.6 Hard Disk Failures and RAID 0, 1, 4, 5, 6

1.2.7 Nested RAID Levels 1+0, 10, 0+1, 01

1.2.8 The Data Redundancy Pattern

1.2.9 Flash Memory and Solid State Drives (SSDs)

1.2.10 Example Hard Disks, SSDs and PCI-connected Flash Memory

2.2 Page Organizations

2.2.1 Slotted Pages: Basics

2.2.2 Slotted Pages: Fixed-size versus Variable-size Components

2.2.3 Finding Free Space

3.2 B-Trees

3.2.1 Three Reasons for Using B-Tree Indexes, Intuition, Properties, find(), ISAM, find_range()

3.2.2 B-Tree insert, split, delete, merge

3.2.3 Clustered, Unclustered, Dense, Sparse, Coarse-Granular Index

3.2.4 Covering and Composite Index, Duplicates, Overflow Pages, Composite Keys

3.2.5 Bulk-loading B-Trees or other Tree-structured Indexes

4.2 Implementing Grouping and Aggregation

5.2 Cost-based Optimization

5.2.1 Cost-Based Optimization, Plan Enumeration, Search Space, Catalan Numbers, Identical Plans

5.2.2 Dynamic Programming: Core Idea, Requirements, Join Graph