面向多模态模型训练的高效样本检索技术

doi:10.13328/j.cnki.jos.007073

微信服务号

微信订阅号

首页 > 过刊浏览>2024年第35卷第3期 >1125-1139. DOI:10.13328/j.cnki.jos.007073

PDF HTML阅读 XML下载导出引用引用提醒

面向多模态模型训练的高效样本检索技术
DOI:
                        10.13328/j.cnki.jos.007073
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家重点研发计划(2022YFB3304100)

Efficient Sample Retrieval Techniques for Multimodal Model Training

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

深度学习中,多模态模型的训练通常需要大量高质量不同类型的标注数据,如图像、文本、音频等.然而,获取大规模的多模态标注数据是一项具有挑战性和昂贵的任务.为了解决这一问题,主动学习作为一种有效的学习范式被广泛应用,能够通过有针对性地选择最有信息价值的样本进行标注,从而降低标注成本并提高模型性能.现有的主动学习方法往往面临着低效的数据扫描和数据位置调整问题,当索引需要进行大范围的更新时,会带来巨大的维护代价.为解决这些问题,提出了一种面向多模态模型训练的高效样本检索技术So-CBI.该方法通过感知模型训练类间边界点,精确评估样本对模型的价值;设计了半有序的高效样本索引,通过结合数据排序信息和部分有序性,降低了索引维护代价和时间开销.在多组多模态数据集上通过与传统主动学习训练方法实验对比,验证了So-CBI方法在主动学习下的训练样本检索问题上的有效性.

Abstract:

Training multimodal models in deep learning often requires a large amount of high-quality annotated data from diverse modalities such as images, text, and audio. However, acquiring such data in large quantities can be challenging and costly. Active learning has emerged as a powerful paradigm to address this issue by selectively annotating the most informative samples, thereby reducing annotation costs and improving model performance. However, existing active learning methods encounter limitations in terms of inefficient data scanning and costly maintenance when dealing with large-scale updates. To overcome these challenges, this study proposes a novel approach called So-CBI (semi-ordered class boundary index) that efficiently retrieves samples for multimodal model training. So-CBI incorporates inter-class boundary perception and a semi-ordered indexing structure to minimize maintenance costs and enhance retrieval efficiency. Experimental evaluations on various datasets demonstrate the effectiveness of So-CBI in the context of active learning.

参考文献

相似文献

引证文献

引用本文

唐秀,伍赛,侯捷,陈刚.面向多模态模型训练的高效样本检索技术.软件学报,2024,35(3):1125-1139

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2023-07-17
最后修改日期:2023-09-05
录用日期:
在线发布日期: 2023-11-08
出版日期: 2024-03-06

微信服务号

微信订阅号

引用本文

分享

文章指标

历史

文章二维码