基于注意力机制及类别层次结构的弱监督目标定位

doi:10.13328/j.cnki.jos.006675

微信服务号

微信订阅号

2025年4月4日 12:03 星期五

首页 > 过刊浏览>2023年第34卷第10期 >4916-4929. DOI:10.13328/j.cnki.jos.006675

PDF HTML阅读 XML下载导出引用引用提醒

基于注意力机制及类别层次结构的弱监督目标定位
DOI:
                        10.13328/j.cnki.jos.006675
                    
CSTR:
                        
                    
作者:
                        冯迅冯迅
南京理工大学 计算机科学与工程学院 模式计算与应用实验室, 江苏 南京 210094;高维信息智能感知与系统教育部重点实验室 (南京理工大学), 江苏 南京 210094;江苏省社会安全图像与视频理解重点实验室 (南京理工大学), 江苏 南京 210094
在期刊界中查找
在百度中查找
在本站中查找
杨健杨健
南京理工大学 计算机科学与工程学院 模式计算与应用实验室, 江苏 南京 210094;高维信息智能感知与系统教育部重点实验室 (南京理工大学), 江苏 南京 210094;江苏省社会安全图像与视频理解重点实验室 (南京理工大学), 江苏 南京 210094
在期刊界中查找
在百度中查找
在本站中查找
周涛周涛
南京理工大学 计算机科学与工程学院 模式计算与应用实验室, 江苏 南京 210094;高维信息智能感知与系统教育部重点实验室 (南京理工大学), 江苏 南京 210094;江苏省社会安全图像与视频理解重点实验室 (南京理工大学), 江苏 南京 210094
在期刊界中查找
在百度中查找
在本站中查找
宫辰宫辰
南京理工大学 计算机科学与工程学院 模式计算与应用实验室, 江苏 南京 210094;高维信息智能感知与系统教育部重点实验室 (南京理工大学), 江苏 南京 210094;江苏省社会安全图像与视频理解重点实验室 (南京理工大学), 江苏 南京 210094
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:冯迅(1993-),男,硕士,主要研究领域为计算机视觉;杨健(1973-),男,博士,教授,博士生导师,CCF高级会员,主要研究领域为模式识别,计算机视觉,机器学习;周涛(1986-),男,博士,教授,CCF专业会员,主要研究领域为机器学习,计算机视觉,医学图像处理与分析;宫辰(1988-),男,博士,教授,博士生导师,CCF专业会员,主要研究领域为机器学习,数据挖掘,计算机视觉.
通讯作者:宫辰,E-mail:chen.gong@njust.edu.cn
中图分类号:TP391
基金项目:国家自然科学基金(61973162,62172228);江苏省自然科学基金(BZ2021013);中央高校基本科研业务费专项资金(30920032202,30921013114)

Weakly Supervised Object Localization Based on Attention Mechanism and Categorical Hierarchy

Author:

FENG Xun
FENG Xun
Pattern Computing and Application Laboratory, School of Computer Science and Engineering, Nanjing University of Science & Technology, Nanjing 210094, China;Key Laboratory of Intelligent Perception and Systems for High-dimensional Information of Ministry of Education (Nanjing University of Science & Technology), Nanjing 210094, China;Key Laboratory of Image and Video Understanding for Social Security of Jiangsu Province (Nanjing University of Science & Technology), Nanjing 210094, China
在期刊界中查找
在百度中查找
在本站中查找
YANG Jian
YANG Jian
Pattern Computing and Application Laboratory, School of Computer Science and Engineering, Nanjing University of Science & Technology, Nanjing 210094, China;Key Laboratory of Intelligent Perception and Systems for High-dimensional Information of Ministry of Education (Nanjing University of Science & Technology), Nanjing 210094, China;Key Laboratory of Image and Video Understanding for Social Security of Jiangsu Province (Nanjing University of Science & Technology), Nanjing 210094, China
在期刊界中查找
在百度中查找
在本站中查找
ZHOU Tao
ZHOU Tao
Pattern Computing and Application Laboratory, School of Computer Science and Engineering, Nanjing University of Science & Technology, Nanjing 210094, China;Key Laboratory of Intelligent Perception and Systems for High-dimensional Information of Ministry of Education (Nanjing University of Science & Technology), Nanjing 210094, China;Key Laboratory of Image and Video Understanding for Social Security of Jiangsu Province (Nanjing University of Science & Technology), Nanjing 210094, China
在期刊界中查找
在百度中查找
在本站中查找
GONG Chen
GONG Chen
Pattern Computing and Application Laboratory, School of Computer Science and Engineering, Nanjing University of Science & Technology, Nanjing 210094, China;Key Laboratory of Intelligent Perception and Systems for High-dimensional Information of Ministry of Education (Nanjing University of Science & Technology), Nanjing 210094, China;Key Laboratory of Image and Video Understanding for Social Security of Jiangsu Province (Nanjing University of Science & Technology), Nanjing 210094, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

弱监督目标定位是指仅利用图像级的类别标注信息来训练目标定位器, 而不需要使用精确的目标位置标注信息来进行算法训练. 当前的一些方法往往只能定位出目标对象中最具鉴别性的部分而无法准确地标识出完整的目标对象, 或者易受背景无关信息干扰从而导致定位结果不精确. 为了解决上述问题, 提出一种基于注意力机制和类别层次结构的弱监督目标定位方法. 该方法通过对卷积神经网络的注意力图进行均值分割提取更完整的目标区域. 进一步, 通过类别层次结构网络实现对背景区域注意力的削弱, 从而提高对感兴趣目标的定位精度. 基于多个网络结构和公共数据集上的大量实验结果表明, 相比目前已有的弱监督定位方法, 所提方法在多个评价指标下均能够获得更好的定位效果.

关键词:弱监督目标定位;网络注意力;背景干扰;层次结构网络;卷积神经网络(CNN)

Abstract:

Weakly supervised object localization aims to train target locators only by image-level labels instead of accurate location annotations for algorithm training. Some existing methods can only identify the most discriminative region of the target object and are incapable of covering the complete object, or can easily be misled by irrelevant background information, thereby leading to inaccurate object locations. Therefore, this study proposes a weakly supervised object localization algorithm based on attention mechanism and categorical hierarchy. The proposed method extracts a more complete object area by performing mean segmentation on the attention map of the convolutional neural network. In addition, the category hierarchy network is utilized to weaken the attention caused by background areas, which achieves more accurate object location results. Extensive experimental results on multiple public datasets show that the proposed method can yield better localization effects than other weakly supervised object localization methods under various evaluation metrics.

Key words:weakly supervised object localization;network attention;background interference;hierarchical network;convolutional neural network (CNN)

引用本文

冯迅,杨健,周涛,宫辰.基于注意力机制及类别层次结构的弱监督目标定位.软件学报,2023,34(10):4916-4929

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2021-03-22
最后修改日期:2021-10-25
录用日期:
在线发布日期: 2023-04-04
出版日期: 2023-10-06

微信服务号

微信订阅号

引用本文

分享

文章指标

历史

文章二维码

微信服务号

微信订阅号

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码