The Multimodal Grounded Learning lab was founded in 2023 by and is part of the Prof. Anna Rohrbach at TU Darmstadt. Together with the Department of Computer Science lab, we form the Multimodal Reliable AI Lab. The lab aims to develop multimodal AI models that can communicate with humans and, importantly, are grounded in reality. Multimodal AI
We are interested in a variety of problems, such as image and video description, visual grounding, text-to-image synthesis, multimodal fact-checking, and beyond. To learn about our previous work, please see Prof. Anna Rohrbach’s page. Google Scholar