Object Detection Model for Examination Classroom Based on Cascade Attention and Point Supervision Mechanism
Author:
Affiliation:

Clc Number:

TP391

  • Article
  • | |
  • Metrics
  • |
  • Reference [34]
  • |
  • Related [20]
  • | | |
  • Comments
    Abstract:

    Smart examination classroom is an important part of smart campus, and accurately and quickly detecting students in the examination classroom is a basic task of building a smart classroom. However, due to the dense distribution and imaging difference of the examinees in an examination classroom, most of the existing object detection methods can not precisely detect all the examinees in real-time. Moreover, most of the object detection methods rely on predefined anchor boxes, which are lack of portability. Aiming at the above problems, this study proposes an efficient one-stage object detection model based on fully convolutional network, which is anchor-free, with a prediction on the input image in pixel-level. In this model, a feature enhancement module is firstly designed based on cascade attention, which can effectively enhance the discriminability of the feature map by gradually refining and modifying the features. Secondly, in order to enable the network to distinguish overlapping objects in the examination classroom, a point supervision mechanism is proposed. Finally, this study verifies the above model on the special dataset of standardized examination classroom. With the cascade attention module and point supervision mechanism, the proposed model achieves 92.9% in mAP at the speed of 22.1 f/s, and is superior to most the state-of-the-art detection models. Especially, for object detection in new classroom environments, the proposed model achieves the best results.

    Reference
    [1] Yong L, Dongjian H. Video-based detection of abnormal behavior in the examination room. In:Proc. of the 2010 IEEE Int'l Forum on Information Technology and Applications. 2010. 295-298.
    [2] Zhang YX, Ma XC, Yang JB, Xu XN. The examinee's abnormal behavior detection and recognition in video based on Kalman filter. Journal of Qiqihar University, 2017, 33(6):16-19(in Chinese with English abstract).
    [3] Krizhevsky A, Sutskever I, Hinton G. ImageNet classification with deep convolutional neural networks. In:Proc. of the Int'l Conf. on Neural Information Processing Systems. 2012. 1097-1105.
    [4] Deng J, Dong W, Socher R, Li L, Li K, Li FF. ImageNet:A large-scale hierarchical image database. In:Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition. 2009. 248-255.
    [5] Girshick R, Donahue J, Darrell T, Malik J. Rich feature hierarchies for accurate object detection and semantic segmentation. In:Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition. 2014. 580-587.
    [6] Girshick R. Fast R-CNN. In:Proc. of the IEEE Int'l Conf. on Computer Vision. 2015. 1440-1448.
    [7] Ren S, He K, Girshick R, Sun J. Faster R-CNN:Towards real-time object detection with region proposal networks. IEEE Trans. on Pattern Analysis and Machine Intelligence, 2017, 39(6):1137-1149.
    [8] Redmon J, Divvala S, Girshick R, Farhadi A. You only look once:Unified, real-time object detection. In:Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition. 2016. 779-788.
    [9] Redmon J, Farhadi A. Yolo9000:Better, faster, stronger. In:Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition. 2017. 7263-7271.
    [10] Redmon J, Farhadi A. Yolov3:An incremental improvement. arXiv:1804.02767, 2018.
    [11] Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC. SSD:Single shot multiBox detector. In:Proc. of the European Conf. on Computer Vision. 2016. 21-37.
    [12] Lin TY, Goyal P, Girshick R, He K, Dollár P. Focal loss for dense object detection. In:Proc. of the IEEE Int'l Conf. on Computer Vision. 2017. 2980-2988.
    [13] Law H, Deng J. Cornernet:Detecting objects as paired keypoints. In:Proc. of the European Conf. on Computer Vision. 2018. 734-750.
    [14] Zhou X, Wang D, Krhenbühl P. Objects as points. arXiv:1904.07850, 2019.
    [15] Zhou X, Zhuo J, Krhenbühl P. Bottom-up object detection by grouping extreme and center points. In:Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition. 2019. 850-859.
    [16] Huang L, Yang Y, Deng Y, Yu Y. Densebox:Unifying landmark localization with end to end object detection. arXiv:1509.04874, 2015.
    [17] Kong T, Sun F, Liu H, Jiang Y, Shi J. Foveabox:Beyond anchor-based object detector. arXiv:1904.03797, 2019.
    [18] Tian Z, Shen C, Chen H, He T. FCOS:Fully convolutional one-stage object detection. In:Proc. of the IEEE Int'l Conf. on Computer Vision. 2019. 9627-9636.
    [19] He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. In:Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition. 2016. 770-778.
    [20] Larochelle H, Hinton GE. Learning to combine foveal glimpses with a third-order Boltzmann machine. In:Proc. of the Int'l Conf. on Neural Information Processing Systems. 2010. 1243-1251.
    [21] Hu J, Shen L, Albanie S, Sun G, Wu E. Squeeze-and-excitation networks. In:Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition. 2018. 7132-7141.
    [22] Woo S, Park J, Lee JY, Kweon IS. CBAM:Convolutional block attention module. In:Proc. of the European Conf. on Computer Vision. 2018. 3-19.
    [23] Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I. Attention is all you need. In:Proc. of the Int'l Conf. on Neural Information Processing Systems. 2017. 5998-6008.
    [24] Yu J, Jiang Y, Wang Z, Cao Z, Huang T. Unitbox:An advanced object detection network. In:Proc. of the ACM Int'l Conf. on Multimedia. 2016. 516-520.
    [25] Tao LL. Research on the detection method of students in examination classroom based on SSD[MS. Thesis]. Xi'an:Shaanxi Normal University, 2020(in Chinese with English abstract).
    [26] Tzutalin. LabelImg. Git code. 2015. https://github.com/tzutalin/labelImg
    [27] Everingham M, Gool LV, Williams CKI, Winn J, Zisserman A. The pascal visual object classes (VOC) challenge. Int'l Journal of Computer Vision, 2010, 88(2):303-338.
    [28] Cai Z, Vasconcelos N. Cascade R-CNN:Delving into high quality object detection. In:Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition. 2018. 6154-6162.
    [29] He K, Gkioxari G, Dollar P, Ross G. Mask R-CNN. In:Proc. of the IEEE Int'l Conf. on Computer Vision. 2017. 2961-2969.
    [30] Lin TY, Dollar P, Girshick R, He K, Hariharan B, Belongie S. Feature pyramid networks for object detection. In:Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition. 2017. 936-944.
    [31] Zhang H, Goodfellow I, Metaxas D, Odena A. Self-attention generative adversarial networks. In:Proc. of the Int'l Conf. on Machine Learning. 2019. 7354-7363.
    附中文参考文献:
    [2] 张银霞,马小川,杨季彪,徐雪南.基于卡尔曼滤波的考生异常行为检测与识别.齐齐哈尔大学学报(自然科学版), 2017, 33(6):16-19.
    [25] 陶丽丽.基于SSD的考场考生检测方法研究[硕士学位论文].西安:陕西师范大学, 2020.
    Cited by
    Comments
    Comments
    分享到微博
    Submit
Get Citation

田卓钰,马苗,杨楷芳.基于级联注意力与点监督机制的考场目标检测模型.软件学报,2022,33(7):2633-2645

Copy
Share
Article Metrics
  • Abstract:887
  • PDF: 2449
  • HTML: 1433
  • Cited by: 0
History
  • Received:May 08,2020
  • Revised:December 03,2020
  • Online: July 16,2022
  • Published: July 06,2022
You are the first2033333Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063