We are seeking a highly motivated PhD student/researcher to join our team at , Germany, working on efficient multimodal (vision and language) models. The goal of this research is to develop highly efficient multimodal models w.r.t. computational budget with an optimal trade-off w.r.t. performance. TU Darmstadt
The position starts as soon as possible.
Responsibilities
- Conduct research on efficient multimodal AI models, including developing novel architectures, training methods, and optimization techniques.
- Collaborate with an industry partner to ensure that the research has a real-world impact.
- Publish high-quality research papers in top venues such as CVPR, NeurIPS, and ICCV.
- Present research findings at conferences and workshops.
Qualifications
- MSc in Computer Science or a related field.
- Research experience in efficient deep models and/or multimodal AI.
- Some success in publishing, ideally in venues such as CVPR, NeurIPS, ICCV, ICLR, ACL and AAAI.
- Excellent communication and interpersonal skills.