Task preference optimization: improving multimodal large language models with vision task alignment

2025年1月1日·

Ziang Yan

,

Zhilin Li

,

Yinan He

,

Chenting Wang

,

Kunchang Li

,

Xinhao Li

,

Xiangyu Zeng

,

Zilei Wang

,

Yali Wang

,

Yu Qiao

· 0 分钟阅读时长

引用 URL

类型

出版物

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

最近更新于 2025年1月1日

← Steady progress beats stagnation: mutual aid of foundation and conventional models in mixed domain semi-supervised medical image segmentation 2025年1月1日

Taste more, taste better: diverse data and strong model boost semi-supervised crowd counting 2025年1月1日 →