Aditya Arora M.Sc.

Multimodal Reliable AI

Working area(s)

Multimodal Generation, Diffusion Models

Contact

Work S4/23 217
Landwehrstr. 50A
64293 Darmstadt

Links

Mar 2025 ongoing ELLIS PhD student at the Multimodal AI group (Supervisor: Prof. Dr. Marcus Rohrbach), ELLIS co-supervisor: Dr. Pau Rodriguez (Apple Barcelona)
Jul 2024 Feb 2025 Research Intern, Snap Inc. (New York City, NY)
May 2023 Aug 2023 Teaching Assistant, Vector Institute (Toronto, Canada)
Sep 2022 Sep 2024 MSc in Computer Science at York University, Canada -> Supervisor: Prof. Konstantinos G. Derpanis and Michael S. Brown
Jan 2022 Aug 2022 Data Scientist, Bayanat for Mapping & Surveying (Abu Dhabi, UAE)
Aug 2018 Jan 2022 Research Engineer, Inception Institute of Artificial Intelligence (Abu Dhabi, UAE)

My research centers on multimodal generative modeling – especially diffusion = for coherent, consistent, and controllable text-to-image, image-to-image, and text-to-video/audio systems. I’m interested in representations that align visual, auditory, and linguistic signals and then enforce global coherence: identity and style preservation across shots, stable layouts and scene geometry, long-horizon temporal smoothness, and tight cross-modal synchronization. I want to build generative systems that understand and compose across modalities while staying consistent for reliable, controllable content creation.