A Pattern Mining Approach for Improving Speech Emotion Recognition

Loading...
Publication Logo

Date

2022

Authors

Umut Avci

Journal Title

Journal ISSN

Volume Title

Publisher

World Scientific

Open Access Color

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Average
Influence
Average
Popularity
Average

Research Projects

Journal Issue

Abstract

Speech-driven user interfaces are becoming more common in our lives. To interact with such systems naturally and effectively machines need to recognize the emotional states of users and respond to them accordingly. At the heart of the emotion recognition research done to this end lies the emotion representation that enables machines to learn and predict emotions. Speech emotion recognition studies use a wide range of low-to-high-level acoustic features for representation purposes such as LLDs their functionals and BoAW. In this paper we present a new method for extracting a novel set of high-level features for classifying emotions. For this purpose we (1) reduce the dimension of discrete-time speech signals (2) perform a quantization operation on the new signals and assign a distinct symbol to each quantization level (3) use the symbol sequences representing the signals to extract discriminative patterns that are capable of distinguishing different emotions from each other and (4) generate a separate set of features for each emotion from the extracted patterns. Experimental results show that pattern features outperform Energy Voicing MFCC Spectral and RASTA feature sets. We also demonstrate that combining the pattern-based features and the acoustic features further improves the classification performance. © 2022 Elsevier B.V. All rights reserved.

Description

Keywords

Feature Extraction, Pattern Mining, Speech Emotion Recognition, Classification (of Information), Data Mining, Quantization (signal), Speech Recognition, User Interfaces, Acoustic Features, Emotion Representation, Emotional State, Features Extraction, Functionals, Learn+, Low-to-high, Pattern Mining, Speech Emotion Recognition, Emotion Recognition, Classification (of information), Data mining, Quantization (signal), Speech recognition, User interfaces, Acoustic features, Emotion representation, Emotional state, Features extraction, Functionals, Learn+, Low-to-high, Pattern mining, Speech emotion recognition, Emotion Recognition

Fields of Science

0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology

Citation

WoS Q

Scopus Q

OpenCitations Logo
OpenCitations Citation Count
1

Source

International Journal of Pattern Recognition and Artificial Intelligence

Volume

36

Issue

Start Page

End Page

PlumX Metrics
Citations

Scopus : 1

Captures

Mendeley Readers : 1

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.185

Sustainable Development Goals

SDG data is not available