• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Huang, X. (Huang, X..) [1] | Zheng, Q. (Zheng, Q..) [2] | Zhang, Y. (Zhang, Y..) [3] | Cheng, D. (Cheng, D..) [4] | Liu, Y. (Liu, Y..) [5] | Dong, C. (Dong, C..) [6] (Scholars:董晨)

Indexed by:

EI Scopus

Abstract:

Emotion is an essential aspect of human life, and effectively identifying corresponding emotions from different scenarios will help promote the development of human-computer interaction systems.Therefore, emotion classification has gradually become a challenging and popular research field. Compared with text emotion analysis, emotion analysis of audio data is still relatively immature.Traditional audio sentiment analysis research is based on feature information such as MFCC, MFSC, etc. while using time-memory models such as LSTM and RNN for emotion analysis.Due to the rapid development of transformers and attention mechanisms, many scholars have shifted their research from the RNN family to the transformer family or deep learning models with attention mechanisms.Therefore, this paper proposes a method to convert audio data into a spectrogram and use a vision transformer model based on transfer learning for emotion classification.This paper conducts experiments on the IEMOCAP dataset and the MELD dataset. The experimental results show that the emotion classification accuracy of the Vision transformer in the IEMOCAP and the MELD datasets reach 56.18% and 37.1%. © 2023 SPIE.

Keyword:

Attention Mechanism Emotion Analysis IEMOCAP MELD Vision Transformer

Community:

  • [ 1 ] [Huang X.]College of Computer and Data Science, College of Software, Fuzhou University, Fuzhou, 350108, China
  • [ 2 ] [Zheng Q.]College of Computer and Data Science, College of Software, Fuzhou University, Fuzhou, 350108, China
  • [ 3 ] [Zhang Y.]College of Computer and Data Science, College of Software, Fuzhou University, Fuzhou, 350108, China
  • [ 4 ] [Cheng D.]College of Computer and Data Science, College of Software, Fuzhou University, Fuzhou, 350108, China
  • [ 5 ] [Liu Y.]College of Computer and Data Science, College of Software, Fuzhou University, Fuzhou, 350108, China
  • [ 6 ] [Dong C.]College of Computer and Data Science, College of Software, Fuzhou University, Fuzhou, 350108, China

Reprint 's Address:

Email:

Show more details

Related Keywords:

Source :

ISSN: 0277-786X

Year: 2023

Volume: 12605

Language: English

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 0

Online/Total:290/10000157
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1