Error Analysis of Intention Classification and Speech Recognition in Human-Computer Dialog
Author:
Affiliation:

Fund Project:

National Key Research & Development Plan of China (2016YFB1001404); National High-Tech R&D Program of China (863) (2015AA016305); National Natural Science Foundation of China (61425017, 61403386, 61305003, 61332017, 61375027, 61273288, 61233009, 61203258); Strategic Priority Research Program of the Chinese Academy of Sciences (XDB02080006); Program of Guangxi Cooperative Innovation Center of cloud computing and Big Data, Guangxi Colleges and Universities Key Laboratory of cloud computing and complex systems (YD16E11); Program of Guangxi Key Laboratory of Trusted Software (kx201601)

  • Article
  • | |
  • Metrics
  • |
  • Reference [13]
  • |
  • Related
  • | | |
  • Comments
    Abstract:

    In the natural human-computer dialogue system, environmental noises, accents and some other factors may cause the speech recognition errors which leads to computers' error responses to human. The dialogs are often interrupted by the system's bad responses. Three types of human computer interruptions are considered in this paper:improper feedback for topic, improper response for a vague user query, and improper feedback for an exact user query. According to the records of the user and computer dialogue analysis, the interruptions caused by three situations above are compared and used to analyze the importance of intention classification in human-computer conversation. The statistical data find that the dialogue interruption caused by the inappropriate topic feedback is the most obvious problem, amounting to 60.1%. Under the correct feedback of the topic, the interrupt ratio of the subject caused by accurate answer and fuzzy answer is 22.2% and 21.6% respectively. In the case of error speech recognition, semantic analysis can bring more feedback error to the error of speech recognition. The analysis of experimental data shows that the speech recognition errors, can effectively reduce the man-machine conversation interrupt and improve the naturalness of human-computer dialogue system according to the context information to improve the accuracy of the computer on the topic of user feedback,. This paper provides the importance of intention classification in human machine dialogue, which helps to improve the performance of human-computer dialogue system.

    Reference
    [1] Zue V, Seneff S, Polifroni J, Phillips M, Pao C, Goodine D, Goddeau D, Glass J. PEGASUS:A spoken dialogue interface for on-line air travel planning. Speech Communication, 1994,15(3-4):331-340.
    [2] http://www.zabaware.com/
    [3] http://www.pandorabots.com/pandora/talk?botid=f5d922d97e345aa1
    [4] Schwärzlery S, Maiery S, Schenk J, Wallhoff F, Rigoll G. Using graphical models for mixed-initiative dialog management systems with realtime policies. In:Proc. of the Annual Conf. of the Int'l Speech Communication Association-INTERSPEECH. 2009. 260-263.
    [5] Pietquin O, Dutoit T. A probabilistic framework for dialog simulation and optimal strategy learning. IEEE Trans. on Audio, Speech, and Language Processing, 2006,14(2):589-599.
    [6] Schatzmann J, Weilhammer K, Stuttle M, Young S. A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies. Knowledge Engineering Review, 2006,21(2):97-126.
    [7] Williams JD, Poupart P, Young P. Partially observable Markov decision processes with continuous observations for dialogue management. In:Proc. of the 6th SigDial Workshop on Discourse and Dialogue Lisbon. 2005.
    [8] Young S. Using POMDPs for dialog management. In:Proc. of the IEEE Workshop on Spoken Language Technology. 2006.
    [9] Yang MH, Tao JH, Li H, Chao LL. Nature multimodal human-computer-interaction dialog system. Computer Science, 2014,41(10):12-18,35(in Chinese with English abstract).
    [10] Yang MH, Tao JH, Mu KH, Li Y, Che JF. A multimodal approach of generating 3D human-like talking agent. Journal on Multimodal User Interfaces, 2012,5(1-2):61-68.
    [11] http://yuyin.baidu.com/asr
    附中文参考文献:
    [9] 杨明浩,陶建华,李昊,巢林林.面向自然交互的多通道人机对话系统.计算机科学,2014,41(10):12-18,35.
    Related
    Cited by
    Comments
    Comments
    分享到微博
    Submit
Get Citation

杨明浩,高廷丽,陶建华,张大伟,孙梦伊,李昊,巢林林.对话意图及语音识别错误对交互体验的影响.软件学报,2016,27(S2):69-75

Copy
Share
Article Metrics
  • Abstract:1961
  • PDF: 4339
  • HTML: 0
  • Cited by: 0
History
  • Received:June 01,2015
  • Revised:January 05,2016
  • Online: January 10,2017
You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063