This paper presents an approach to robust speech recognition based on neighborhood space, which can achieve performance robustness under mismatch between training and testing conditions. This approach uses neighborhood space of each underlying model to produce Bayesian predictive density as observation probability density. Experimental results show that the proposed method improves the performance robustness.
[1]Hirsch H,Ehrlicher C.Noise estimation technique for robust speech recognition.In:Proc.of the IEEE Int'l Conf.on Acoustic,Speech and Signal Processing.1995.153-156.
[2]Hermansky H,Tibrewala S,Pavel M.Towards ASR on partially corrupted speech.In:Proc.of the Int'l Conf.on Spoken Language Processing.1996.458-461.http://ieeexplore.ieee.org/xpls/abs_all.jsp?tp=&arnumber=607154
[3]Zhang XY,Wang F,Zheng F,Xu MX,Wu WH.Integrating sub-band information into feature extraction for robust speech recognition.Journal of Chinese Information Processing,2002,16(1):19-24 (in Chinese with English abstract).
[4]Zhu QF,Alwan A.Non-Linear feature extraction for robust speech recognition in stationary and non-stationary noise.Computer Speech and Language,2003,17(4):381-402.
[5]Yuo KH,Wang HC.Robust features for noisy speech recognition based on temporal trajectory filtering of short-time autocorrelation sequences.Speech Communication,1999,28(1):13-24.
[6]Gauvain,JL,Lee CH.Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains.IEEE Trans.on Speech and Audio Processing,1994,2(2):291-298.
[7]Gales,MJF,Young SJ.Robust continuous speech recognition using parallel model combination.IEEE Trans.on Speech and Audio Processing,1996,4(5):352-359.
[8]Renevey P,Drygajlo A.Statistical estimation of unreliable features for robust speech recognition.In:Proc.of the IEEE Int'l Conf.on Acoustic,Speech and Signal Processing.2000.1731-1734.
[9]Leggetter CJ,Woodland PC.Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models.Computer Speech and Language,1995,9(2):171-185.
[10]Sankar A,Lee CH.A maximum likelihood approach to stochastic matching for robust speech recognition.IEEE Trans.on Speech and Audio Processing,1996,4(3):190-202.
[11]Huo Q,Lee CH.On-Line adaptive learning of the continuous density hidden Markov model based on approximate recursive Bayes estimate.IEEE Trans.on Speech and Audio Processing,1997,5(2):161-172.
[12]Huo Q,Jiang H,Lee CH.A Bayesian predictive classification approach to robust speech recognition.In:Proc.of the IEEE Int'l Conf.on Acoustic,Speech and Signal Processing.1997.1547-1550.
[13]Jiang H,Hirose K,Huo Q.Robust speech recognition based on viterbi Bayesian predictive classification.In:Proc.of the IEEE Int'l Conf.on Acoustic,Speech and Signal Processing.1997.1551-1554.
[14]Vapnik V.The Nature of Statistical Learning Theory.2nd ed.,New York:Springer-Verlag,2000.
You are the first2038773Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.