I am a third-year PhD Student the University of Stuttgart, Germany advised by Prof. Andreas Bulling. My research focuses broadly on multi-modal deep models at the intersection of computer vision and natural language processing. Specifically, I develop novel AI model especially geared towards conversational tasks such as visual and video dialog. Furthermore, I research efficient mechanisms of marrying graph neural networks with large multi-modal transformers. Finally, I develop pre-training strategies of multi-modal foundation models specifically geared towards improving the downstream visual/video conversational tasks. I hold a BSc and a MSc degrees in the elite study program Simulation Technology at the University of Stuttgart. The latter was awarded with distinction and received the outstanding work award from the Industrial Consortium SimTech (IC SimTech).


  • [20.02.2024] 🔥 One paper got accepted to COLING’24, Turin, Italy 🇮🇹
  • [01.02.2024] 🆕 Started an internship at the Multimodal AI lab, TU Darmstadt with Marcus Rohrbach and Anna Rohrbach
  • [01.11.2023] 📜 Reviewing for ACL Rolling Review’23-24
  • [24.10.2023] 🔥 One paper got accepted to WACV’24, Hawaii, USA 🇺🇸
  • [29.09.2023] 📜 Reviewing for CHI’24
  • [15.04.2023] 📜 Reviewing for ACM MM’23
  • [02.02.2023] 📜 Reviewing for the Journal of Artificial Intelligence’23
  • [16.08.2022] 🔥 One paper got accepted to COLING’22 (oral), Gyeongju, Korea 🇰🇷
  • [27.03.2022] 🔥 One paper got accepted to ACL-W’22, Dublin, Ireland 🇮🇪
  • [01.04.2021] 🥇 My master’s thesis received the the outsanding work award from IC SimTech
  • [28.02.2021] 🎓 I defended my MSc Thesis with distinction