多模态信息抽取研究综述

doi:10.13328/j.cnki.jos.007245

微信服务号

微信订阅号

2025年6月30日 0:58 星期一

首页 > 过刊浏览>2025年第36卷第4期 >1665-1691. DOI:10.13328/j.cnki.jos.007245

PDF HTML阅读 XML下载导出引用引用提醒

多模态信息抽取研究综述
DOI:
                        10.13328/j.cnki.jos.007245
                    
CSTR:
                        32375.14.jos.007245
                    
作者:
                        王永胜王永胜
苏州大学 计算机科学与技术学院, 江苏 苏州 215006
在期刊界中查找
在百度中查找
在本站中查找
李培峰李培峰
苏州大学 计算机科学与技术学院, 江苏 苏州 215006
在期刊界中查找
在百度中查找
在本站中查找
王中卿王中卿
苏州大学 计算机科学与技术学院, 江苏 苏州 215006
在期刊界中查找
在百度中查找
在本站中查找
朱巧明朱巧明
苏州大学 计算机科学与技术学院, 江苏 苏州 215006
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家自然科学基金(62276177, 61836007); 江苏高校优势学科建设工程项目

Survey on Multimodal Information Extraction Research

Author:

WANG Yong-Sheng
WANG Yong-Sheng
School of Computer Science and Technology, Soochow University, Suzhou 215006, China
在期刊界中查找
在百度中查找
在本站中查找
LI Pei-Feng
LI Pei-Feng
School of Computer Science and Technology, Soochow University, Suzhou 215006, China
在期刊界中查找
在百度中查找
在本站中查找
WANG Zhong-Qing
WANG Zhong-Qing
School of Computer Science and Technology, Soochow University, Suzhou 215006, China
在期刊界中查找
在百度中查找
在本站中查找
ZHU Qiao-Ming
ZHU Qiao-Ming
School of Computer Science and Technology, Soochow University, Suzhou 215006, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

多模态信息抽取任务是指从非结构化或半结构化的多模态数据(包含文本和图像等)中提取结构化知识. 其研究内容主要包含多模态命名实体识别、多模态实体关系抽取和多模态事件抽取. 首先对多模态信息抽取任务进行分析, 然后对多模态命名实体识别、多模态实体关系抽取和多模态事件抽取这3个子任务的共同部分, 即多模态表示和融合模块进行归纳和总结. 随后梳理上述3个子任务的常用数据集和主流研究方法. 最后总结多模态信息抽取的研究趋势并分析该研究存在的问题和挑战, 为后续相关研究提供参考.

关键词:多模态信息抽取;多模态命名实体识别;多模态实体关系抽取

Abstract:

Multimodal information extraction is a task to extract structured knowledge from unstructured or semi-structured multimodal data (such as text and images). It includes multimodal named entity recognition, multimodal relation extraction, and multimodal event extraction. This study analyzes multimodal information extraction tasks and summarizes the common part of the above three subtasks, i.e., a multimodal representation and fusion module. Moreover, it sorts out the commonly used datasets and mainstream research methods of the above three subtasks. Finally, it outlines research trends in multimodal information extraction and analyzes the existing problems and challenges in this field to provide a reference for future research.

Key words:multimodal information extraction (MIE);multimodal named entity recognition (MNER);multimodal entity relation extraction (MERE)

引用本文

王永胜,李培峰,王中卿,朱巧明.多模态信息抽取研究综述.软件学报,2025,36(4):1665-1691

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2023-09-13
最后修改日期:2024-02-25
录用日期:
在线发布日期: 2024-12-09
出版日期:

微信服务号

微信订阅号

引用本文

相关视频

分享

文章指标

历史

文章二维码

微信服务号

微信订阅号

引用本文

相关视频

分享

微信扫一扫：分享

文章指标

历史

文章二维码