Creation and Analysis of Emotional Speech Database for Multiple Emotions Recognition

Ryota Sato, Ryohei Sasaki, Norisato Suga, Toshihiro Furukawa

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Speech emotion recognition (SER) is one of the latest challenge in human-computer interaction. In conventional SER classification methods, a single emotion label is outputted per one utterance as the estimation result. This is because conventional speech emotional databases which are used to train SER models have a single emotion label for one utterance. However, it is often the case that multiple emotions are expressed simultaneously with different intensities in human speech. In order to realize more natural SER than ever, existence of multiple emotions in one utterance should be taken into account. Therefore, we created an emotional speech database which contains multiple emotions and their intensities labels. The creation experiment was conducted by extracting speech utterance parts where emotions appear from existing video works. In addition, we evaluated the created database by performing statistical analysis on the database. As a result, 2,025 samples were obtained, of which 1,525 samples contained multiple emotions.

Original languageEnglish
Title of host publicationProceedings of 2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages33-37
Number of pages5
ISBN (Electronic)9781728198965
DOIs
Publication statusPublished - 2020 Nov 5
Externally publishedYes
Event23rd Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2020 - Virtual, Yangon, Myanmar
Duration: 2020 Nov 52020 Nov 7

Publication series

NameProceedings of 2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2020

Conference

Conference23rd Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2020
Country/TerritoryMyanmar
CityVirtual, Yangon
Period20/11/520/11/7

Keywords

  • emotion estimation
  • emotion recognition
  • emotional intensity
  • emotional speech database
  • multiple emotions
  • speech corpus
  • speech emotion

ASJC Scopus subject areas

  • Computer Science Applications
  • Information Systems
  • Information Systems and Management
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Creation and Analysis of Emotional Speech Database for Multiple Emotions Recognition'. Together they form a unique fingerprint.

Cite this