Comparison between random and daily speech database in the speech visualization

Nur Syafikah Binti Samsudin, Kazunori Mano

研究成果: Conference contribution

抜粋

This paper presents a new technique using animated texts as the speech features' visualization medium for checking and detecting language learners' pronunciation. The proposed visualization tool will transform learners' speech features such as pitch, tempo or rhythm into animated texts form, and the mispronounce parts can be located by comparing them with the correct sample. In our previous experiments, Japanese language learners gave positive feedback on the animated texts designs as their speech visualization tool. By practicing this tool, learners found that the mispronounce parts in their speech can be detected and confirmed easily. However, in order to have more practical feedback and response from the learners, besides using the plain speech data set, which only express random speech contents, the daily conversation speech data set is proposed as the data sample. In this paper, the comparison between both database samples for determining the proposed visualization tool's approachability was observed. Evaluation experiments results showed that participants gave positive responses to the animated texts visualization tool and were able to understand speech features in the visualized texts form better by using the daily conversation as the speech sample.

元の言語English
ホスト出版物のタイトル2017 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2017
出版者Institute of Electrical and Electronics Engineers Inc.
ページ3135-3140
ページ数6
2017-January
ISBN(電子版)9781538616451
DOI
出版物ステータスPublished - 2017 11 27
イベント2017 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2017 - Banff, Canada
継続期間: 2017 10 52017 10 8

Other

Other2017 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2017
Canada
Banff
期間17/10/517/10/8

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Human-Computer Interaction
  • Control and Optimization

フィンガープリント Comparison between random and daily speech database in the speech visualization' の研究トピックを掘り下げます。これらはともに一意のフィンガープリントを構成します。

  • これを引用

    Samsudin, N. S. B., & Mano, K. (2017). Comparison between random and daily speech database in the speech visualization. : 2017 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2017 (巻 2017-January, pp. 3135-3140). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/SMC.2017.8123109