Naturalistic emotional speech collection paradigm with online game and its psychological and acoustical assessment

Yoshiko Arimoto, Hiromi Kawatsuz, Sumio Ohno, Hitoshi Iida

Research output: Contribution to journalArticlepeer-review

28 Citations (Scopus)


For the purpose of constructing a naturalistic emotional speech database, a novel paradigm of collecting naturalistic emotional speech during a spontaneous Japanese dialog was proposed. The proposed paradigm was assessed by investigating whether the collected speech contains and conveys rich emotions psychologically and acoustically. To encourage speakers to experience and express their natural and vivid emotions, a Massively Multiplayer Online Role-Playing Game (MMORPG) was adopted as a task for speakers. They were asked to play the MMORPG together while discussing strategies to achieve their tasks through a voice chat system. The recording was performed for one hour per speaker. The total recording time was approximately 14 hours. The results of emotional labeling for the collected speech supported the validity of the paradigm showing higher interlabeler agreement than the chance levels. In addition, it was revealed that the paradigm is superior in the quantity of emotional speech to other paradigm by showing a significantly higher rate of labeling instances for our speech material (73%, χ 2 27659:87, p < 0:001) than other speech materials. Finally, an acoustical analysis supported the validity of the paradigm, showing a significant difference between the nonemotional utterances and the emotional utterances (p < 0:05).

Original languageEnglish
Pages (from-to)359-369
Number of pages11
JournalAcoustical Science and Technology
Issue number6
Publication statusPublished - 2012


  • Acoustic analysis
  • Emotional speech
  • Online game
  • Spoken dialog
  • Voice chat

ASJC Scopus subject areas

  • Acoustics and Ultrasonics


Dive into the research topics of 'Naturalistic emotional speech collection paradigm with online game and its psychological and acoustical assessment'. Together they form a unique fingerprint.

Cite this