Video Language Co-Attention with Multimodal Fast-Learning Feature Fusion for VideoQAPublished in Workshop on Representation Learning for NLP @ ACL, 2022Share on Twitter Facebook LinkedIn Previous Next