多模态引导的局部特征选择小样本学习方法

doi:10.13328/j.cnki.jos.006771

微信服务号

微信订阅号

2025年5月1日 7:09 星期四

首页 > 过刊浏览>2023年第34卷第5期 >2068-2082. DOI:10.13328/j.cnki.jos.006771

PDF HTML阅读 XML下载导出引用引用提醒

多模态引导的局部特征选择小样本学习方法
DOI:
                        10.13328/j.cnki.jos.006771
                    
CSTR:
                        
                    
作者:
                        吕天根吕天根
合肥工业大学 计算机与信息学院, 安徽 合肥 230031
在期刊界中查找
在百度中查找
在本站中查找
洪日昌洪日昌
合肥工业大学 计算机与信息学院, 安徽 合肥 230031;合肥综合性国家科学中心数据空间研究院, 安徽 合肥 230036
在期刊界中查找
在百度中查找
在本站中查找
何军何军
合肥工业大学 计算机与信息学院, 安徽 合肥 230031;合肥综合性国家科学中心数据空间研究院, 安徽 合肥 230036
在期刊界中查找
在百度中查找
在本站中查找
胡社教胡社教
合肥工业大学 计算机与信息学院, 安徽 合肥 230031
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:吕天根(1997-),男,硕士生,CCF学生会员,主要研究领域为小样本学习;洪日昌(1981-),男,博士,教授,博士生导师,CCF专业会员,主要研究领域为多媒体技术,人工智能,大数据;何军(1992-),男,博士,主要研究领域为模式识别,小样本学习,弱监督学习;胡社教(1964-),男,博士,教授,主要研究领域为智能检测与信号处理,智能配变终端系统,嵌入式控制系统
通讯作者:洪日昌，hongrc.hfut@gmail.com
中图分类号:
基金项目:国家自然科学基金（61932009）

Multimodal-guided Local Feature Selection for Few-shot Learning

Author:

LÜ Tian-Gen
LÜ Tian-Gen
School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230031, China
在期刊界中查找
在百度中查找
在本站中查找
HONG Ri-Chang
HONG Ri-Chang
School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230031, China;Institute of Dataspace of Hefei Comprehensive National Science Center, Hefei 230036, China
在期刊界中查找
在百度中查找
在本站中查找
HE Jun
HE Jun
School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230031, China;Institute of Dataspace of Hefei Comprehensive National Science Center, Hefei 230036, China
在期刊界中查找
在百度中查找
在本站中查找
HU She-Jiao
HU She-Jiao
School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230031, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

深度学习模型取得了令人瞩目的成绩，但其训练依赖于大量的标注样本，在标注样本匮乏的场景下模型表现不尽人意.针对这一问题，近年来以研究如何从少量样本快速学习的小样本学习被提了出来，方法主要采用元学习方式对模型进行训练，取得了不错的学习效果.但现有方法：1）通常仅基于样本的视觉特征来识别新类别，信息源较为单一；2）元学习的使用使得模型从大量相似的小样本任务中学习通用的、可迁移的知识，不可避免地导致模型特征空间趋于一般化，存在样本特征表达不充分、不准确的问题.为解决上述问题，将预训练技术和多模态学习技术引入小样本学习过程，提出基于多模态引导的局部特征选择小样本学习方法.所提方法首先在包含大量样本的已知类别上进行模型预训练，旨在提升模型的特征表达能力；而后在元学习阶段，方法利用元学习对模型进行进一步优化，旨在提升模型的迁移能力或对小样本环境的适应能力，所提方法同时基于样本的视觉特征和文本特征进行局部特征选择来提升样本特征的表达能力，以避免元学习过程中模型特征表达能力的大幅下降；最后所提方法利用选择后的样本特征进行小样本学习.在MiniImageNet、CIFAR-FS和FC-100这3个基准数据集上的实验表明，所提的小样本学习方法能够取得更好的小样本学习效果.

关键词:小样本学习;多模态融合;图像分类;表示学习

Abstract:

Deep learning models have yielded impressive results in many tasks. However, the success hinges on the availability of a large number of labeled samples for model training, and deep learning models tend to perform poorly in scenarios where labeled samples are scarce. In recent years, few-shot learning (FSL) has been proposed to study how to learn quickly from a small number of samples and has achieved good performance mainly by the use of meta-learning for model training. Nevertheless, two issues exist: 1) Existing FSL methods usually manage to recognize novel classes solely with the visual features of samples, without integrating information from other modalities. 2) By following the paradigm of meta-learning, a model aims at learning generic and transferable knowledge from massive similar few-shot tasks, which inevitably leads to a generalized feature space and insufficient and inaccurate representation of sample features. To tackle the two issues, this study introduces pre-training and multimodal learning techniques into the FSL process and proposes a new multimodal-guided local feature selection strategy for few-shot learning. Specifically, model pre-training is first conducted on known classes with abundant samples to greatly improve the feature representation ability of the model. Then, in the meta-learning stage, the pre-trained model is further optimized by meta-learning to improve its transferability or its adaptability to the few-shot environment. Meanwhile, the local feature selection is carried out on the basis of visual features and textual features of samples to enhance the ability to represent sample features and avoid sharp degradation of the model’s representation ability. Finally, the resultant sample features are utilized for FSL. The experiments on three benchmark datasets, namely, MiniImageNet, CIFAR-FS, and FC-100, demonstrate that the proposed FSL method can achieve better results.

Key words:few-shot learning (FSL);multimodal fusion;image classification;representation learning

引用本文

吕天根,洪日昌,何军,胡社教.多模态引导的局部特征选择小样本学习方法.软件学报,2023,34(5):2068-2082

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2022-04-18
最后修改日期:2022-05-29
录用日期:
在线发布日期: 2022-09-20
出版日期: 2023-05-06

微信服务号

微信订阅号

引用本文

分享

文章指标

历史

文章二维码

微信服务号

微信订阅号

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码