Information Management (IM)

In this lecture, we discuss the foundations of information management. In the first part of the course, we focus on relational databases. In the second part, we provide an introduction to automatic language- and knowledge processing with a particular focus on text analytics.

Organization

Last offered Summer Semester 2024
Lecturer Prof. Dr. Carsten Binnig (DB part)
Prof. Dr. Iryna Gurevych / Dr. Thomas Arnold (NL part)
Assistants Johannes Wehrstein, Roman Heinrich
Exam Written exam, 90min
See TuCAN link above for additional information (e.g., rooms & appointments)
  • Foundations of information management
    • Characteristics of information systems
    • Information storage and retrieval
    • Declarative and navigational access
  • Database systems and architectures
    • Data redundancy and data integration
    • Data independence
    • Database architecture models
  • Conceptual data modeling
    • Foundations of knowledge representation
    • Entity relationship model
    • UML structure diagrams
  • Logical data modeling
    • Relational model
    • Relational algebra
    • Relational calculus
  • Relational database design
    • Database anomalies
    • Normalization
  • Query languages
    • SQL
    • Query-by-example
  • Transactions and multi-user environment
    • Serialization
    • Locks, 2PL
    • ACID criteria, isolation levels
  • Index structures and performance
  • Recent database concepts
    • In-memory databases
    • NoSQL
  • Foundations of language and knowledge processing
  • Text as a knowledge resource
    • Text formats and databases
    • XML
  • Pattern search
  • Linguistic preprocessing
  • Text corpora
  • Lexical resources and ontologies
  • Information retrieval
    • Vector space models
  • Information extraction
    • Named entity recognition
  • Fundamentals of machine learning

Recommended literature include:

  • Gunter Saake, Kai-Uwe Sattler, Andreas Heuer: Datenbanken: Konzepte und Sprachen. 4. Auflage, Heidelberg: mitp, 2010. ISBN: 978-38266-9057-0 Titel anhand dieser ISBN in Citavi-Projekt übernehmen.
  • Ramez Elmasri, Shamkant B. Navathe: Fundamentals of Database Systems. 6th edition. Boston et al.: Addison-Wesley, 2011. ISBN: 978-0-136-08620-8 Titel anhand dieser ISBN in Citavi-Projekt übernehmen.
  • Hector Garcia-Molina, Jeffrey D. Ullman, Jennifer Widom: Database Systems: The Complete Book. 2nd edition. Upper Saddle River, NJ: Pearson Prentice Hall, 2009. ISBN: 978-0-131-87325-4 Titel anhand dieser ISBN in Citavi-Projekt übernehmen.
  • Gottfried Vossen: Datenmodelle, Datenbanksprachen und Datenbankmanagementsysteme. 5. Auflage. München: Oldenbourg, 2008. ISBN 978-3-486-27574-2 Titel anhand dieser ISBN in Citavi-Projekt übernehmen.
  • Daniel Jurafsky, James H. Martin: Speech and Language Processing. An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Second Edition. Upper Saddle River: Prentice Hall, 2009. ISBN: 978-0-13-187321-6 Titel anhand dieser ISBN in Citavi-Projekt übernehmen.
  • Steven Bird, Ewan Klein, Edward Loper: Natural Language Processing with Python, O’Reilly, 2009. ISBN: 978-0-596-51649-9 Titel anhand dieser ISBN in Citavi-Projekt übernehmen.
  • Kai-Uwe Carstensen, Christian Ebert, Cornelia Endriss, Susanne Jekat, Ralf Klabunde: Computerlinguistik und Sprachtechnologie. Eine Einführung. 3. Auflage. Heidelberg: Spektrum, 2009. ISBN: 978-3-8274-20123-7.
  • Ruslan Mitkov (Ed.): The Oxford Handbook of Computational Linguistics. Oxford: Oxford University Press. 2005. ISBN: 978-0-19-927634-9