Creation and Analysis of Emotional Speech Database for Multiple Emotions Recognition

Ryota Sato, Ryohei Sasaki, Norisato Suga, Toshihiro Furukawa

研究成果: Conference contribution

抄録

Speech emotion recognition (SER) is one of the latest challenge in human-computer interaction. In conventional SER classification methods, a single emotion label is outputted per one utterance as the estimation result. This is because conventional speech emotional databases which are used to train SER models have a single emotion label for one utterance. However, it is often the case that multiple emotions are expressed simultaneously with different intensities in human speech. In order to realize more natural SER than ever, existence of multiple emotions in one utterance should be taken into account. Therefore, we created an emotional speech database which contains multiple emotions and their intensities labels. The creation experiment was conducted by extracting speech utterance parts where emotions appear from existing video works. In addition, we evaluated the created database by performing statistical analysis on the database. As a result, 2,025 samples were obtained, of which 1,525 samples contained multiple emotions.

本文言語English
ホスト出版物のタイトルProceedings of 2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2020
出版社Institute of Electrical and Electronics Engineers Inc.
ページ33-37
ページ数5
ISBN(電子版)9781728198965
DOI
出版ステータスPublished - 2020 11月 5
外部発表はい
イベント23rd Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2020 - Virtual, Yangon, Myanmar
継続期間: 2020 11月 52020 11月 7

出版物シリーズ

名前Proceedings of 2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2020

Conference

Conference23rd Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2020
国/地域Myanmar
CityVirtual, Yangon
Period20/11/520/11/7

ASJC Scopus subject areas

  • コンピュータ サイエンスの応用
  • 情報システム
  • 情報システムおよび情報管理
  • 言語学および言語

フィンガープリント

「Creation and Analysis of Emotional Speech Database for Multiple Emotions Recognition」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル