String Processing and Compression

 

Lecture in Winter Term 2015/2016

Contact

Phone

work
+49 241 80 21700

Email

E-Mail
 
 

Content

String data is ubiquitous in many areas of computer science. For example, web applications use encodings as strings to exchange data, and most data in bioinformatics, like DNA sequences, are strings. The lecture presents algorithms and data structures tailored to process string-encoded data. This includes the problem of finding patterns in strings and aligning sequences as well as sorting strings and searching for strings.

Compressing data is important if we want to save resources like memory and bandwidth. The lecture starts to introduce the information-theoretic background of compression techniques. Based on this introduction, the lecture presents lossless compression techniques for strings.

Topics

String-Processing Algorithms

  • Radix Trees
  • Radix Sorts
  • Pattern Matching
  • Approximate Pattern Matching
  • Probabilistic Pattern Matching
  • Suffix Trees
  • Suffix Arrays
  • String Assembly
  • Hard String Problems

Data-Compression Techniques

  • Coding Theory
  • Information Theory
  • Dictionary-Based Compression
  • Context-Based Compression
  • Grammar-Based Compression
 

Organization

Lecture will be held in english.

Lecturer

Michael Elberfeld

 

External Links