Contextual AD narration with interleaved multimodal sequence Jan 1, 2025· Hanlin Wang , Zhan Tong , Kecheng Zheng , Yujun Shen Limin Wang · 0 min read Cite URL Type Conference paper Publication Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Last updated on Jan 1, 2025 Authors Limin Wang Nanjing University ← CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding Jan 1, 2025 LeviTor: 3D trajectory oriented image-to-video synthesis Jan 1, 2025 →