楼主: Mama-2022
203 0

[其他] 多模态专题文献等/AI 人工智能 [推广有奖]

  • 0关注
  • 14粉丝

已卖:1260份资源

院士

92%

还不是VIP/贵宾

-

威望
0
论坛币
754 个
通用积分
309.1617
学术水平
25 点
热心指数
114 点
信用等级
16 点
经验
67363 点
帖子
2884
精华
0
在线时间
1707 小时
注册时间
2022-5-14
最后登录
2025-11-20

楼主
Mama-2022 发表于 2024-7-20 07:21:55 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
Taming Transformers for High-Resolution Image Synthesis.pdf Sequential Modeling Enables Scalable Learning for Large Vision Models.pdf
NExT-GPT.Any-to-Any Multimodal LLM.pdf Visual Instruction Tuning.pdf
PROGRESS MEASURES FOR GROKKING VIA MECHANISTIC INTERPRETABILITY.pdf
MiniGPT-v2 Large Language Model As a Unified Interface for Vision-Language Multi-task Learning.pdf
Swin Transformer Hierarchical Vision Transformer using Shifted Windows.pdf
IMAGEBIND One Embedding Space To Bind Them All.pdf
CoDi-2 In-Context,Interleaved,and Interactive Any-to-Any Generation.pdf
Meta-Transformer.A Unified Framework for Multimodal Learning.pdf
Neural Discrete Representation Learning.pdf
Learning Transferable Visual Models From Natural Language Supervision.pdf
MINIGPT-4 ENHANCING VISION-LANGUAGE UNDERSTANDING WITH ADVANCED LARGE LANGUAGE MODELS.pdf
AN IMAGE IS WORTH 16 16 WORDS TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE.pdf
InstructBLIP Towards General-purpose Vision-Language Models with Instruction Tuning.pdf
BLIP-2 Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models.pdf
Improved Baselines with Visual Instruction Tuning.pdf
阿里巴巴「AI剧组」--大模型驱动的影视短视频智能生产实践.pdf

多模态.part1.rar (98 MB, 需要: RMB 10 元) 多模态.part2.rar (25.73 MB, 需要: RMB 10 元)


二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:人工智能 多模态 Transformers Presentation Hierarchical

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
jg-xs1
拉您进交流群
GMT+8, 2026-1-7 19:06