基于虚拟不定长的语音库裁剪方法

微信服务号

微信订阅号

首页 > 过刊浏览>2006年第17卷第5期 >983-990

基于虚拟不定长的语音库裁剪方法
DOI:
                        
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:Supported bythe National High-Tech Research and Development Plan of China under Grant No.2004AA114030(国家高技术研究发展计划(863))

Virtual Non-Uniform Synthesis Instances Pruning Approach for Corpus-Based Speech Synthesis System

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

语音库裁剪或语音库去冗余,是大语料库语音合成技术的一个重要问题.提出了虚拟不定长替换的概念,以弥补不定长的损失.结合合成使用变体的频度,构建了语音库裁剪算法StaRp-VPA.该算法能够以任意比例裁剪语音库.实验表明:当裁剪率小于50%时,合成自然度几乎没有下降;当裁剪率大于50%时,合成自然度也不会严重降低.

Abstract:

Tailoring voice font, or pruning redundant synthesis instances, is an important issue of scalable Corpus-based Text To Speech (TTS) system. However, pruning redundant synthesis instances, usually results in the loss of non-uniform. In order to solve this problem, the concept of virtual non-uniform is proposed. According to this concept and the synthesis frequency of each instance, an algorithm named StaRp-VPA is constructed to make TTS scalable to hardware. In experiments, the naturalness scored by Mean Opinion Score (MOS) remains almost unchanged when less than 50% instances are pruned off, and the MOS does not severely degrade when the reduction rate is above 50%.

参考文献

相似文献

引证文献

引用本文

张巍,吴晓如,赵志伟,王仁华.基于虚拟不定长的语音库裁剪方法.软件学报,2006,17(5):983-990

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2005-05-20
最后修改日期:2005-10-10
录用日期:
在线发布日期:
出版日期:

微信服务号

微信订阅号

引用本文

分享

文章指标

历史

文章二维码