多模态 视频
输入“/”快速插入内容
多模态 视频
用户8190
用户8190
2025年3月13日修改
https://zhuanlan.zhihu.com/p/701193435?utm_psn=1780927156282458112
https://zhuanlan.zhihu.com/p/699982733?utm_psn=1778360989782077440
https://zhuanlan.zhihu.com/p/699650216?utm_psn=1778359132900741120
Chameleon探讨多模态视觉语言模型的一些有趣结论
https://mp.weixin.qq.com/s/zouNu-g-33_7JoX3Uscxtw
标题:A Survey on Multimodal Large Language Models
作者:Shukang Yin, Chaoyou Fu, Sirui Zhao, Ke Li, Xing Sun, Tong Xu, Enhong Chen
单位:Department of Data Science, University of Science and Technology of China; Tencent YouTu Lab
标签:#多模态大型语言模型 #人工智能 #机器学习 #自然语言处理
链接:
https://arxiv.org/abs/2306.13549v2
https://mp.weixin.qq.com/s/Aja6HKwWszBuagYIYMm4bw
https://arxiv.org/abs/2405.09818
https://github.com/junkunyuan/Awesome-Large-Multimodal-Models
https://arxiv.org/abs/2307.10802
https://github.com/BradyFU/Awesome-Multimodal-Large-Language-Models
https://github.com/pliang279/awesome-multimodal-ml
https://arxiv.org/abs/2404.18930
https://github.com/junkunyuan/Awesome-Large-Multimodal-Models