Text Analytics

Text Analytics: NLP for Document Processing

This seminar will cover the latest research on document processing for natural language processing.


All information will be distributed via the Moodle eLearning platform.

The first sessions will consist of introductory lectures to cover the basics of machine learning methods used for NLP tasks. The program for the remainder of the seminar will be determined according to the number of participants..

Teaching Staff

  • Mert Tiftikci
  • Prof. Dr. Iryna Gurevych

Course content

Text Analytics: NLP for Code Intelligence

Natural language processing (NLP) has made considerable progress in the past few years, especially with transformer-based Machine Learning models. These advances have greatly powered improvements across various real-world applications, such as question answering, summarization, code generation (e.g., AlphaCode), chatbots (e.g., Amazon Alexa), drug discovery, or image generation from natural language (NL) description, and vice versa.

In this seminar, we will explore the latest research in NLP for source code, also known as Code Intelligence (CI), with a specific focus on transformers and structure injection.

Code Intelligence is an emerging field that applies NLP and Machine Learning to automatically analyze the source code of software and enables intelligent assistance to programmers.

We will review existing methods and benchmarks, and investigate downstream applications such as code generation, clone detection, code translation, code documentation generation, etc.


Will be announced during the seminar.


When you should send me a request for the office hour: 2 weeks before your presentation (if you are the first week presenter, you can send it 1 week before)

What you should tell me in your e-mail: (1) Preferred half an hour time-slot if you have any preference; (2) Your name and your paper;

When you should send me your presentation draft: As early as possible, not later than 3 days before our meeting