V2Dial: Unification of Video and Visual Dialog via Multimodal Experts

Published in Computer Vision and Pattern Recognition Conference (CVPR), 2025