面向视频的细粒度多模态实体链接

doi:10.13328/j.cnki.jos.007078

微信小程序

微信服务号

微信订阅号

首页 > 过刊浏览>2024年第35卷第3期 >1140-1153. DOI:10.13328/j.cnki.jos.007078

PDF HTML阅读 XML下载导出引用引用提醒

面向视频的细粒度多模态实体链接
DOI:
                        10.13328/j.cnki.jos.007078
                    
CSTR:
                        32375.14.jos.007078
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家重点研发计划(2020AAA0109302);国家自然科学基金(62072323,62102095);上海市科技创新行动计划(22511105902,22511104700);上海市科技重大专项(2021SHZDZX0103);上海市科学技术委员会资助项目(22511105902)

Fine-grained Multimodal Entity Linking for Videos

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

随着互联网和大数据的飞速发展,数据规模越来越大,种类也越来越多.视频作为其中重要的一种信息方式,随着近期短视频的发展,占比越来越大.如何对这些大规模视频进行理解分析,成为学界关注的热点.实体链接作为一种背景知识补全方式,可以提供丰富的外部知识.视频上的实体链接可以有效地帮助理解视频内容,从而实现对视频内容的分类、检索、推荐等.但是现有的视频链接数据集和方法的粒度过粗,因此提出面向视频的细粒度实体链接,并立足于直播场景,构建了细粒度视频实体链接数据集.此外,依据细粒度视频链接任务的难点,提出利用大模型抽取视频中的实体及其属性,并利用对比学习得到视频和对应实体的更好表示.实验结果表明,该方法能够有效地处理视频上的细粒度实体链接任务.

Abstract:

With the rapid development of the Internet and big data, the scale and variety of data are increasing. Video, as an important form of information, is becoming increasingly prevalent, particularly with the recent growth of short videos. Understanding and analyzing large-scale videos has become a hot topic of research. Entity linking, as a way of enriching background knowledge, can provide a wealth of external information. Entity linking in videos can effectively assist in understanding the content of video, enabling classification, retrieval, and recommendation of video content. However, the granularity of existing video linking datasets and methods is too coarse. Therefore, this study proposes a video-based fine-grained entity linking approach, focusing on live streaming scenarios, and constructs a fine-grained video entity linking dataset. Additionally, based on the challenges of fine-grained video linking tasks, this study proposes the use of large models to extract entities and their attributes from videos, as well as utilizing contrastive learning to obtain better representations of videos and their corresponding entities. The results demonstrate that the proposed method can effectively handle fine-grained entity linking tasks in videos.

参考文献

相似文献

引证文献

引用本文

赵海全,王续武,李金亮,李直旭,肖仰华.面向视频的细粒度多模态实体链接.软件学报,2024,35(3):1140-1153

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2023-07-18
最后修改日期:2023-09-05
录用日期:
在线发布日期: 2023-11-08
出版日期: 2024-03-06

微信小程序

微信服务号

微信订阅号

引用本文

相关视频

分享

文章指标

历史

文章二维码