Object Detection Model for Examination Classroom Based on Cascade Attention and Point Supervision Mechanism

doi:10.13328/j.cnki.jos.006289

微信服务号

微信订阅号

2025-4-6- 10

Home > Archive>Volume 33, Issue 7, 2022 >2633-2645. DOI:10.13328/j.cnki.jos.006289

PDF HTML XML Export Cite reminder

Object Detection Model for Examination Classroom Based on Cascade Attention and Point Supervision Mechanism
DOI:
                        10.13328/j.cnki.jos.006289
                    
Author:
                        TIAN Zhuo-YuTIAN Zhuo-Yu
School of Computer Science, Shaanxi Normal University, Xi’an 710119, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
MA MiaoMA Miao
School of Computer Science, Shaanxi Normal University, Xi’an 710119, China;Key Laboratory of Modern Teaching Technology of Ministry of Education (Shaanxi Normal University), Xi’an 710062, China;National Engineering Laboratory for Integrated Aero-Space-Ground-Ocean Big Data Application Technology, Xi’an 710129, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
YANG Kai-FangYANG Kai-Fang
School of Computer Science, Shaanxi Normal University, Xi’an 710119, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:TP391
Fund Project:

Article

Figures

Metrics

Reference [34]

Related [20]

Cited by

Materials

Comments

Abstract:

Smart examination classroom is an important part of smart campus, and accurately and quickly detecting students in the examination classroom is a basic task of building a smart classroom. However, due to the dense distribution and imaging difference of the examinees in an examination classroom, most of the existing object detection methods can not precisely detect all the examinees in real-time. Moreover, most of the object detection methods rely on predefined anchor boxes, which are lack of portability. Aiming at the above problems, this study proposes an efficient one-stage object detection model based on fully convolutional network, which is anchor-free, with a prediction on the input image in pixel-level. In this model, a feature enhancement module is firstly designed based on cascade attention, which can effectively enhance the discriminability of the feature map by gradually refining and modifying the features. Secondly, in order to enable the network to distinguish overlapping objects in the examination classroom, a point supervision mechanism is proposed. Finally, this study verifies the above model on the special dataset of standardized examination classroom. With the cascade attention module and point supervision mechanism, the proposed model achieves 92.9% in mAP at the speed of 22.1 f/s, and is superior to most the state-of-the-art detection models. Especially, for object detection in new classroom environments, the proposed model achieves the best results.

Key words:object detection;smart examination classroom;anchor-free method;attention mechanism;point supervision mechanism

Reference

[1] Yong L, Dongjian H. Video-based detection of abnormal behavior in the examination room. In:Proc. of the 2010 IEEE Int'l Forum on Information Technology and Applications. 2010. 295-298.

[2] Zhang YX, Ma XC, Yang JB, Xu XN. The examinee's abnormal behavior detection and recognition in video based on Kalman filter. Journal of Qiqihar University, 2017, 33(6):16-19(in Chinese with English abstract).

[3] Krizhevsky A, Sutskever I, Hinton G. ImageNet classification with deep convolutional neural networks. In:Proc. of the Int'l Conf. on Neural Information Processing Systems. 2012. 1097-1105.

[4] Deng J, Dong W, Socher R, Li L, Li K, Li FF. ImageNet:A large-scale hierarchical image database. In:Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition. 2009. 248-255.

[5] Girshick R, Donahue J, Darrell T, Malik J. Rich feature hierarchies for accurate object detection and semantic segmentation. In:Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition. 2014. 580-587.

[6] Girshick R. Fast R-CNN. In:Proc. of the IEEE Int'l Conf. on Computer Vision. 2015. 1440-1448.

[7] Ren S, He K, Girshick R, Sun J. Faster R-CNN:Towards real-time object detection with region proposal networks. IEEE Trans. on Pattern Analysis and Machine Intelligence, 2017, 39(6):1137-1149.

[8] Redmon J, Divvala S, Girshick R, Farhadi A. You only look once:Unified, real-time object detection. In:Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition. 2016. 779-788.

[9] Redmon J, Farhadi A. Yolo9000:Better, faster, stronger. In:Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition. 2017. 7263-7271.

[10] Redmon J, Farhadi A. Yolov3:An incremental improvement. arXiv:1804.02767, 2018.

[11] Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC. SSD:Single shot multiBox detector. In:Proc. of the European Conf. on Computer Vision. 2016. 21-37.

[12] Lin TY, Goyal P, Girshick R, He K, Dollár P. Focal loss for dense object detection. In:Proc. of the IEEE Int'l Conf. on Computer Vision. 2017. 2980-2988.

[13] Law H, Deng J. Cornernet:Detecting objects as paired keypoints. In:Proc. of the European Conf. on Computer Vision. 2018. 734-750.

[14] Zhou X, Wang D, Krhenbühl P. Objects as points. arXiv:1904.07850, 2019.

[15] Zhou X, Zhuo J, Krhenbühl P. Bottom-up object detection by grouping extreme and center points. In:Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition. 2019. 850-859.

[16] Huang L, Yang Y, Deng Y, Yu Y. Densebox:Unifying landmark localization with end to end object detection. arXiv:1509.04874, 2015.

[17] Kong T, Sun F, Liu H, Jiang Y, Shi J. Foveabox:Beyond anchor-based object detector. arXiv:1904.03797, 2019.

[18] Tian Z, Shen C, Chen H, He T. FCOS:Fully convolutional one-stage object detection. In:Proc. of the IEEE Int'l Conf. on Computer Vision. 2019. 9627-9636.

[19] He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. In:Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition. 2016. 770-778.

[20] Larochelle H, Hinton GE. Learning to combine foveal glimpses with a third-order Boltzmann machine. In:Proc. of the Int'l Conf. on Neural Information Processing Systems. 2010. 1243-1251.

[21] Hu J, Shen L, Albanie S, Sun G, Wu E. Squeeze-and-excitation networks. In:Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition. 2018. 7132-7141.

[22] Woo S, Park J, Lee JY, Kweon IS. CBAM:Convolutional block attention module. In:Proc. of the European Conf. on Computer Vision. 2018. 3-19.

[23] Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I. Attention is all you need. In:Proc. of the Int'l Conf. on Neural Information Processing Systems. 2017. 5998-6008.

[24] Yu J, Jiang Y, Wang Z, Cao Z, Huang T. Unitbox:An advanced object detection network. In:Proc. of the ACM Int'l Conf. on Multimedia. 2016. 516-520.

[25] Tao LL. Research on the detection method of students in examination classroom based on SSD[MS. Thesis]. Xi'an:Shaanxi Normal University, 2020(in Chinese with English abstract).

[26] Tzutalin. LabelImg. Git code. 2015. https://github.com/tzutalin/labelImg

[27] Everingham M, Gool LV, Williams CKI, Winn J, Zisserman A. The pascal visual object classes (VOC) challenge. Int'l Journal of Computer Vision, 2010, 88(2):303-338.

[28] Cai Z, Vasconcelos N. Cascade R-CNN:Delving into high quality object detection. In:Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition. 2018. 6154-6162.

[29] He K, Gkioxari G, Dollar P, Ross G. Mask R-CNN. In:Proc. of the IEEE Int'l Conf. on Computer Vision. 2017. 2961-2969.

[30] Lin TY, Dollar P, Girshick R, He K, Hariharan B, Belongie S. Feature pyramid networks for object detection. In:Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition. 2017. 936-944.

[31] Zhang H, Goodfellow I, Metaxas D, Odena A. Self-attention generative adversarial networks. In:Proc. of the Int'l Conf. on Machine Learning. 2019. 7354-7363.

附中文参考文献:

[2] 张银霞,马小川,杨季彪,徐雪南.基于卡尔曼滤波的考生异常行为检测与识别.齐齐哈尔大学学报(自然科学版), 2017, 33(6):16-19.

[25] 陶丽丽.基于SSD的考场考生检测方法研究[硕士学位论文].西安:陕西师范大学, 2020.

Get Citation

田卓钰,马苗,杨楷芳.基于级联注意力与点监督机制的考场目标检测模型.软件学报,2022,33(7):2633-2645

Copy

Article Metrics

Abstract:887
PDF: 2449
HTML: 1433
Cited by: 0

History

Received:May 08,2020
Revised:December 03,2020
Adopted:
Online: July 16,2022
Published: July 06,2022

You are the first2033333Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History