Algorithmic Foundations of Data Science

 

Lecture in the summer term 2021

 
data stream consisting of zeros and ones Copyright: © Chair i7
 
 
Presentation Video
 

Content

In the age of "big data" and "advanced analytics", data processing faces new challenges. Queries become more complex and often involve data mining and machine learning tasks, and the scale of the datasets requires new algorithmic approaches.

This course will cover the "theoretical foundations" of modern data processing and analytics. This includes topics from database theory, such as data models, the analysis of query languages, and basic algorithmic and complexity theoretic questions related to query processing. It also includes topics from algorithmic learning theory, such as basic machine learning algorithms, support vector machines, the PAC model, and VC-Dimension. Furthermore, it includes new models of computation on massive datasets, such as the streaming model and the map-reduce paradigm, and algorithms for these models.

We will focus on "computational aspects" of the theory. Statistics, though undoubtedly one of the foundations of data science, will not play a central role in this course.

Prerequisites

This lecture can be taken only as a masters course.

There are no prerequisites required.

 

Organization

All information of this lecture can be found in RWTHmoodle

The course will be held in english.

Lecturer

Martin Grohe

 

External Links