Building an English speech synthesis system from a Japanese ALS patient's voice

Akemi Ishii, Jun Ito, Shimpei Kajima, Tsutomu Sugawara

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper reports on the development of an English speech synthesis system for a Japanese amyotropic lateral sclerosis patient as part of the project of developing a bilingual communication aid for this patient. The patient had a tracheotomy three years ago and anticipates the possibility of losing his phonatory function. His English speech database for Festival, a free speech synthesis system, was generated from his reading of a US diphone list. There were two problems with the recording. The first was the noise that the artificial ventilator made and the second was his difficulty in pronouncing English. Although the speaker's English database was successfully built by Festvox and the voice was recognized as his voice, the utterance was unintelligible. We therefore proposed reconstructing the patient's database by partially combining it with an English native speaker's database. Results showed that the proposed approach can be promising for those facing this problem.

Original languageEnglish
Title of host publicationINTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP
PublisherInternational Speech Communication Association
Pages1994-1997
Number of pages4
Volume4
ISBN (Print)9781604234497
Publication statusPublished - 2006 Jan 1
Externally publishedYes
EventINTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP - Pittsburgh, PA, United States
Duration: 2006 Sep 172006 Sep 21

Other

OtherINTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP
CountryUnited States
CityPittsburgh, PA
Period06/9/1706/9/21

Fingerprint

Speech synthesis
Communication

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Ishii, A., Ito, J., Kajima, S., & Sugawara, T. (2006). Building an English speech synthesis system from a Japanese ALS patient's voice. In INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP (Vol. 4, pp. 1994-1997). International Speech Communication Association.

Building an English speech synthesis system from a Japanese ALS patient's voice. / Ishii, Akemi; Ito, Jun; Kajima, Shimpei; Sugawara, Tsutomu.

INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP. Vol. 4 International Speech Communication Association, 2006. p. 1994-1997.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Ishii, A, Ito, J, Kajima, S & Sugawara, T 2006, Building an English speech synthesis system from a Japanese ALS patient's voice. in INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP. vol. 4, International Speech Communication Association, pp. 1994-1997, INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP, Pittsburgh, PA, United States, 06/9/17.
Ishii A, Ito J, Kajima S, Sugawara T. Building an English speech synthesis system from a Japanese ALS patient's voice. In INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP. Vol. 4. International Speech Communication Association. 2006. p. 1994-1997
Ishii, Akemi ; Ito, Jun ; Kajima, Shimpei ; Sugawara, Tsutomu. / Building an English speech synthesis system from a Japanese ALS patient's voice. INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP. Vol. 4 International Speech Communication Association, 2006. pp. 1994-1997
@inproceedings{27549c411bec4ef686169bcd9cac1a72,
title = "Building an English speech synthesis system from a Japanese ALS patient's voice",
abstract = "This paper reports on the development of an English speech synthesis system for a Japanese amyotropic lateral sclerosis patient as part of the project of developing a bilingual communication aid for this patient. The patient had a tracheotomy three years ago and anticipates the possibility of losing his phonatory function. His English speech database for Festival, a free speech synthesis system, was generated from his reading of a US diphone list. There were two problems with the recording. The first was the noise that the artificial ventilator made and the second was his difficulty in pronouncing English. Although the speaker's English database was successfully built by Festvox and the voice was recognized as his voice, the utterance was unintelligible. We therefore proposed reconstructing the patient's database by partially combining it with an English native speaker's database. Results showed that the proposed approach can be promising for those facing this problem.",
author = "Akemi Ishii and Jun Ito and Shimpei Kajima and Tsutomu Sugawara",
year = "2006",
month = "1",
day = "1",
language = "English",
isbn = "9781604234497",
volume = "4",
pages = "1994--1997",
booktitle = "INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP",
publisher = "International Speech Communication Association",

}

TY - GEN

T1 - Building an English speech synthesis system from a Japanese ALS patient's voice

AU - Ishii, Akemi

AU - Ito, Jun

AU - Kajima, Shimpei

AU - Sugawara, Tsutomu

PY - 2006/1/1

Y1 - 2006/1/1

N2 - This paper reports on the development of an English speech synthesis system for a Japanese amyotropic lateral sclerosis patient as part of the project of developing a bilingual communication aid for this patient. The patient had a tracheotomy three years ago and anticipates the possibility of losing his phonatory function. His English speech database for Festival, a free speech synthesis system, was generated from his reading of a US diphone list. There were two problems with the recording. The first was the noise that the artificial ventilator made and the second was his difficulty in pronouncing English. Although the speaker's English database was successfully built by Festvox and the voice was recognized as his voice, the utterance was unintelligible. We therefore proposed reconstructing the patient's database by partially combining it with an English native speaker's database. Results showed that the proposed approach can be promising for those facing this problem.

AB - This paper reports on the development of an English speech synthesis system for a Japanese amyotropic lateral sclerosis patient as part of the project of developing a bilingual communication aid for this patient. The patient had a tracheotomy three years ago and anticipates the possibility of losing his phonatory function. His English speech database for Festival, a free speech synthesis system, was generated from his reading of a US diphone list. There were two problems with the recording. The first was the noise that the artificial ventilator made and the second was his difficulty in pronouncing English. Although the speaker's English database was successfully built by Festvox and the voice was recognized as his voice, the utterance was unintelligible. We therefore proposed reconstructing the patient's database by partially combining it with an English native speaker's database. Results showed that the proposed approach can be promising for those facing this problem.

UR - http://www.scopus.com/inward/record.url?scp=44949252239&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=44949252239&partnerID=8YFLogxK

M3 - Conference contribution

SN - 9781604234497

VL - 4

SP - 1994

EP - 1997

BT - INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP

PB - International Speech Communication Association

ER -