Multimodal Grounded Learning

The Multimodal Grounded Learning lab was founded in 2023 by Prof. Anna Rohrbach and is part of the Department of Computer Science at TU Darmstadt. Together with the Multimodal Reliable AI lab, we form the Multimodal AI Lab. The lab aims to develop multimodal AI models that can communicate with humans and, importantly, are grounded in reality.

We are interested in a variety of problems, such as image and video description, visual grounding, text-to-image synthesis, multimodal fact-checking, and beyond. To learn about our previous work, please see Prof. Anna Rohrbach’s Google Scholar page.