Search

Summary

Computer scientists Jonathan Gratch and Stacy Marsella trace the influence of the 1988 publication of the OCC model, noting its impact on computational models of the interplay between emotion and cognition and on practical applications such as those relating to emotion recognition and the generation of emotion-related behaviors. They explain how OCC’s detailing of specific rules for reasoning about emotions invigorated work in affective computing and artificial intelligence more generally by giving computer scientists a clear pathway for modeling emotion processes. The model, they suggest, had both “upstream influences” on work concerning the cognitive antecedents of emotions, and “downstream influences” on work focused on modeling some of the consequences of emotions, such as those concerned with coping and decision-making and their relation to changes in beliefs, desires, and intentions. Finally, at a more sociology of science level, the authors suggest that the OCC model contributed significantly to bringing together the emotion research community in psychology and computer science communities interested in modeling affective phenomena.

The majority of research in speech emotion recognition (SER) is conducted to recognize emotion categories. Recognizing dimensional emotion attributes is also important, however, and it has several advantages over categorical emotion. For this research, we investigate dimensional SER using both speech features and word embeddings. The concatenation network joins acoustic networks and text networks from bimodal features. We demonstrate that those bimodal features, both are extracted from speech, improve the performance of dimensional SER over unimodal SER either using acoustic features or word embeddings. A significant improvement on the valence dimension is contributed by the addition of word embeddings to SER system, while arousal and dominance dimensions are also improved. We proposed a multitask learning (MTL) approach for the prediction of all emotional attributes. This MTL maximizes the concordance correlation between predicted emotion degrees and true emotion labels simultaneously. The findings suggest that the use of MTL with two parameters is better than other evaluated methods in representing the interrelation of emotional attributes. In unimodal results, speech features attain higher performance on arousal and dominance, while word embeddings are better for predicting valence. The overall evaluation uses the concordance correlation coefficient score of the three emotional attributes. We also discuss some differences between categorical and dimensional emotion results from psychological and engineering perspectives.

Search Results

Refine search

Refine search

Actions for selected content:

2 results

Chapter 10 - There and Back Again: OCC and Affective Computing

Summary

Dimensional speech emotion recognition from speech features and word embeddings by using multitask learning

Search Results

Refine search

Refine search

Actions for selected content:

Save Search

2 results

Chapter 10 - There and Back Again: OCC and Affective Computing

Summary

Dimensional speech emotion recognition from speech features and word embeddings by using multitask learning