TY - GEN
T1 - Recording script design for corpus-based TTS system based on coverage of various phonetic elements
AU - Isogai, Mitsuaki
AU - Mizuno, Hideyuki
AU - Mano, Kazunori
PY - 2005
Y1 - 2005
N2 - This paper describes a new recording script generation method that can create speech databases for corpus-based TTS systems. This method is efficient due to its two features; (1) It has a 2-stage algorithm to generate the recording script with consideration of the balance of triphone, syllable and morpheme elements. (2) It can control types of phonetic elements included in the recording script via the weight coefficients of the phonetic elements. An evaluation shows that the 2-stage algorithm is effective in raising the coverage of phonetic elements and that this method yields a recording script containing various phonetic elements. A preference test shows that changing the selection criteria influences the quality of the synthesized speech. The same test also shows that it is better to take account of morpheme-based elements than syllable-based elements in generating a task-specific recording script.
AB - This paper describes a new recording script generation method that can create speech databases for corpus-based TTS systems. This method is efficient due to its two features; (1) It has a 2-stage algorithm to generate the recording script with consideration of the balance of triphone, syllable and morpheme elements. (2) It can control types of phonetic elements included in the recording script via the weight coefficients of the phonetic elements. An evaluation shows that the 2-stage algorithm is effective in raising the coverage of phonetic elements and that this method yields a recording script containing various phonetic elements. A preference test shows that changing the selection criteria influences the quality of the synthesized speech. The same test also shows that it is better to take account of morpheme-based elements than syllable-based elements in generating a task-specific recording script.
UR - http://www.scopus.com/inward/record.url?scp=33646781552&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33646781552&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2005.1415110
DO - 10.1109/ICASSP.2005.1415110
M3 - Conference contribution
AN - SCOPUS:33646781552
SN - 0780388747
SN - 9780780388741
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 301
EP - 304
BT - 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05
Y2 - 18 March 2005 through 23 March 2005
ER -