A satisfaction-based model for affect recognition from conversational features in spoken dialog systems

authors

LEBAI LUTFI, SYAHEERAH
FERNANDEZ MARTINEZ, FERNANDO
LUCAS CUESTA, JUAN MANUEL
LOPEZ LEBON, LORENA
MONTERO, JUAN MANUEL

published in

SPEECH COMMUNICATION Journal

publication date

September 2013

start page

825

end page

840

issue

7-8

volume

55

Digital Object Identifier (DOI)

https://doi.org/10.1016/j.specom.2013.04.005

International Standard Serial Number (ISSN)

0167-6393

Electronic International Standard Serial Number (EISSN)

1872-7182

abstract

Detecting user affect automatically during real-time conversation is the main challenge towards our greater aim of infusing social intelligence into a natural-language mixed-initiative High-Fidelity (Hi-Fi) audio control spoken dialog agent. In recent years, studies on affect detection from voice have moved on to using realistic, non-acted data, which is subtler. However, it is more challenging to perceive subtler emotions and this is demonstrated in tasks such as labeling and machine prediction. This paper attempts to address part of this challenge by considering the role of user satisfaction ratings and also conversational/dialog features in discriminating contentment and frustration, two types of emotions that are known to be prevalent within spoken human-computer interaction. However, given the laboratory constraints, users might be positively biased when rating the system, indirectly making the reliability of the satisfaction data questionable. Machine learning experiments were conducted on two datasets, users and annotators, which were then compared in order to assess the reliability of these datasets. Our results indicated that standard classifiers were significantly more successful in discriminating the abovementioned emotions and their intensities (reflected by user satisfaction ratings) from annotator data than from user data. These results corroborated that: first, satisfaction data could be used directly as an alternative target variable to model affect, and that they could be predicted exclusively by dialog features. Second, these were only true when trying to predict the abovementioned emotions using annotator's data, suggesting that user bias does exist in a laboratory-led evaluation.

A satisfaction-based model for affect recognition from conversational features in spoken dialog systems Articles

Overview

authors

published in

publication date

start page

end page

issue

volume

Digital Object Identifier (DOI)

International Standard Serial Number (ISSN)

Electronic International Standard Serial Number (EISSN)

abstract

Classification

keywords