• Article
  • | |
  • Metrics
  • |
  • Reference [1]
  • |
  • Related
  • |
  • Cited by [8]
  • | |
  • Comments
    Abstract:

    For speech recognition systems under noisy environment, lip-reading technique c an effectively reduce the influence of noise and improve the accurate rate o f speech recognition system by adding visual information to acoustic channel. In this paper, an effective and robust approach for lip and mouth locating and tra cking is presented to enable the information extraction under abnormal illumina tion and without special marks. This approach first locates face region with skin-color model, then finds the eyes from the face region with iterative algo rithm, modifies the position and size of face according to the position of eyes, transforms the lower part of face by specific color coordinators to clearly dis tinguish lip color from skin color, and finally describes the outline of upper l ip and lower lip with deformable template.

    Reference
    1  Hennecke M, Prasad K, Stork D. Using deformable templates to in fer visual speech dynamics. In: Proceddings of the 28th Annual Asilomar Conferen ce on Signals, Systems and Computers, Vol 1. Pacific Grove: IEEE Computer Societ y Press, 1994. 578~582 2  Wolff G, Prasad K, Stork D et al. Lip-reading by neural networks: visual preprocessing, learning and sensory integration. In: Cowan J, Tesauro G, Alspector J. eds. Proceedings of the Neural Information Processing Systems-6. S an Mateo, CA: Morgan Kaufmann Publishers, Inc., 1994. 1027~1034 3  Petajan E D. Automatic lip-reading to enhance speech recognition [Ph. D. Thesis]. University of Illinois at Urbana-Champain, 1984 4  Coianiz T, Torresani L, Caprile B. 2D deformable model for visual speec h analysis. In: Stork D, Hennecke M eds. Speechreading by Humans and Machines: M odels, Methods, and Applications. Volume 150, NATO-ASI Series, Series F: Comput er and Systems Sciences. Berlin: Springer-Verlag, 1995 5  Kass M, Witkin A, Terzopoulus D. Snakes: active contour models. In: Pro ceedings of the 1st International Conference on Computer Vision. New York: IEEE Computer Society Press, 1987. 259~268 6  Finn K, Montgomery A. Automatic optically-based recognition of speech. Pattern Recognition, 1988,8(3):159~164 7  Mase K, Pentland A. Automatic lip-reading by optical flow analysis. Sy stems and Computers in Japan, 1991,22(6):67~76 8  Bregler C, Konig Y. “Eigenlips” for robust speech recognition. In: Bo ngner R E ed. Proceedings of the IEEE International Conference on Acoustics, Spe ech and Signal Processing. Adelaide: Adelaide Convention Center, 1994. 667~674 9  Bregler C, Hild H et al. Improving connected letter recognition by lip-reading. In: Kaveh M ed. Proceedings of the IEEE International Conference o n Acousticsm, Speech and Signal Processing. Minnesota: Minneapolis Convention Ce nter, 1993. 557~560 10  Kinmanlam, Yan H. Locating and extracting the eye in human face images. Patt ern Recognition, 1996,29(5):771~779
    Related
    Comments
    Comments
    分享到微博
    Submit
Get Citation

姚鸿勋,高文,李静梅,吕雅娟,王瑞.用于口型识别的实时唇定位方法.软件学报,2000,11(8):1126-1132

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:May 17,1999
  • Revised:September 09,1999
You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063