Contextual AD narration with interleaved multimodal sequence

Jan 1, 2025·

Hanlin Wang

,

Zhan Tong

,

Kecheng Zheng

,

Yujun Shen

Limin Wang

Limin Wang

· 0 min read

Cite URL

Type

Conference paper

Publication

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

Last updated on Jan 1, 2025

Limin Wang

Authors

Nanjing University

← CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding Jan 1, 2025

LeviTor: 3D trajectory oriented image-to-video synthesis Jan 1, 2025 →