CORC

浏览/检索结果: 共144条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Scene text recognition via dual character counting-aware visual and semantic modeling network 期刊论文
SCIENCE CHINA-INFORMATION SCIENCES, 2024, 卷号: 67, 期号: 3, 页码: 2
作者:  Xiao, Ke;  Zhu, Anna;  Iwana, Brian Kenji;  Liu, Cheng-Lin
收藏  |  浏览/下载:0/0  |  提交时间:2024/03/13
VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 9
作者:  Yi, Guofeng;  Fan, Cunhang;  Zhu, Kang;  Lv, Zhao;  Liang, Shan
收藏  |  浏览/下载:0/0  |  提交时间:2024/02/22
AnyFace++: A Unified Framework for Free-style Text-to-Face Synthesis and Manipulation 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 页码: 1-15
作者:  Sun, Jianxin;  Deng, Qiyao;  Li, Qi;  Sun, Muyi;  Liu, Yunfan
收藏  |  浏览/下载:0/0  |  提交时间:2024/02/23
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying
收藏  |  浏览/下载:5/0  |  提交时间:2023/11/17
Multi-modal spatial relational attention networks for visual question answering 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 13
作者:  Yao, Haibo;  Wang, Lipeng;  Cai, Chengtao;  Sun, Yuxin;  Zhang, Zhi
收藏  |  浏览/下载:1/0  |  提交时间:2024/02/22
So Many Heads, So Many Wits: Multimodal Graph Reasoning for Text-Based Visual Question Answering 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 12
作者:  Zheng, Wenbo;  Yan, Lan;  Wang, Fei-Yue
收藏  |  浏览/下载:1/0  |  提交时间:2023/12/21
GAIA-Universe: Everything is Super-Netify 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 10, 页码: 11856-11868
作者:  Peng, Junran;  Chang, Qing;  Yin, Haoran;  Bu, Xingyuan;  Sun, Jiajun
收藏  |  浏览/下载:2/0  |  提交时间:2023/11/16
Knowledge-Embedded Mutual Guidance for Visual Reasoning 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 页码: 13
作者:  Zheng, Wenbo;  Yan, Lan;  Chen, Long;  Li, Qiang;  Wang, Fei-Yue
收藏  |  浏览/下载:3/0  |  提交时间:2023/11/16
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming
收藏  |  浏览/下载:2/0  |  提交时间:2023/11/16
Siamese Network-based Framework for Open-set Domain Generalization 会议论文
北京, 2023-5
作者:  Geng Liu
收藏  |  浏览/下载:4/0  |  提交时间:2023/07/04


©版权所有 ©2017 CSpace - Powered by CSpace