Research and Implementation of the Techniques for Chinese Dictation Machines
DOI:
Author:
Affiliation:

Clc Number:

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    In this paper, the search strategies in the acoustic layer of the CSR (continuous speech recognition) and the CDM (Chinese dictation machine) are addressed in two aspects, the acoustic recognition unit and the syntax-constrained word search tree. The SKB-FSS (statistical knowledge based frame synchronous search) algorithm and the syntax-constrained WST (word search tree) structure are proposed, they form the TLSN (two-level search network) in the acoustic layer. The statistical knowledge used by the algorithm includes differential state dwell distribution, the feature difference sum and so on, which result in an improvement of 36.6% in CSR. The principles of a modified back-off estimation algorithm and the search algorithms for the N-gram based language models are also introduced. Finally, by integrating the authors' techniques, a Chinese dictation machine engine (CDME) is implemented. A speaker-independent CDM text editor named ST97 and a voice command system named CMD97 are established for personal computers (PCs) based on the CDME.

    Reference
    Related
    Cited by
Get Citation

郑 方,牟晓隆,徐明星,武 健,宋战江.汉语语音听写机技术的研究与实现.软件学报,1999,10(4):436-444

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:February 24,1998
  • Revised:May 12,1998
  • Adopted:
  • Online:
  • Published:
You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063