Lexical-Semantic Methods for Language Understanding

Lexical-Semantic Methods for Language Understanding

Organization

  • Lecture: Thursday 09:50 – 11:30, Room S202/C120
  • Practice class: Thursday from 11:40 – 13:20, Room S101 A02

The learning material is available from the Moodle eLeaning platform.

The required passcode will be distributed during the lecture.

The first lecture will be on April 12 and the first meeting of the practice class is scheduled for April 19, 2018.

Teaching Staff

  • Prof. Dr. Iryna Gurevych
  • Dr. Thomas Arnold (no fixed office hour, please just ask for an appointment per mail)
  • Jan-Christoph Klie (no fixed office hour, please just ask for an appointment per mail)

Please contact Thomas Arnold or Jan-Christoph Klie for any organizational issues.

Course Content

Algorithms for analyzing the meaning of words in documents are of crucial importance for a wide range of applications including information retrieval, automatic summarization, or automatic keyphrase extraction. The lecture puts a special focus on the semantic resources used to provide knowledge for these algorithms. Besides classical semantic wordnets like Princeton WordNet, the lecture also introduces Web 2.0 resources like Wikipedia and Wiktionary, Knowledge bases and emerging vectorial resources based on neural language models, such as word embeddings.

  • Introduction to Natural Language Processing and Lexical-Semantics
  • Lexical-Semantic Resources: WordNet, Wikipedia, Wiktionary
  • Standardizing resources
  • Lexical-semantic methods: Text Similarity,Word Sense Disambiguation
  • Vector Space Models of Word Meaning
  • Word and Sense Embeddings
  • Applications of Embeddings
  • Introduction to knowledge bases (Wikidata, Freebased, DBPedia)
  • Automatic Knowledge Base Construction
  • Knowledge Base Embeddings

Core Literature

  • Daniel Jurafsky, James H. Martin. Speech and Language Processing (2nd Edition). Prentice Hall. ISBN-13: 978-0131873216
  • Ruslan Mitkov (Ed.). The Oxford Handbook of Computational Linguistics. Oxford University Press. 2005. ISBN-13: 978-0199276349

Other Relevant Literature

  • Kai-Uwe Carstensen, Christian Ebert, Cornelia Endriss, Susanne Jekat, Ralf Klabunde, Computerlinguistik und Sprachtechnologie. Eine Einführung, Heidelberg: Spektrum-Verlag, März 2004. (2. Auflage) ISBN 3827414075
  • D.A. Cruse. Lexical Semantics. Cambridge University Press, 1986. ISBN 0-521-27643-8