Speech Emotion Recognition Using 3D Convolutions and Attention-Based Sliding Recurrent Networks With Auditory Front-Ends

Zhichao Peng; Xingfeng Li; Zhi Zhu; Masashi Unoki; Jianwu Dang; Masato Akagi

doi:https://doi.org/10.1109/access.2020.2967791

doi.org/10.1109/access.2020.2967791

Speech Emotion Recognition Using 3D Convolutions and Attention-Based Sliding Recurrent Networks With Auditory Front-Ends

,

,

..., Masato Akagi

17

IEEE Access3.90

Volume: 8, Pages: 16560 - 16572

Published: Jan 1, 2020

Abstract

Emotion information from speech can effectively help robots understand speaker's intentions in natural human-robot interaction. The human auditory system can easily track temporal dynamics of emotion by perceiving the intensity and fundamental frequency of speech, and focus on the salient emotion regions. Therefore, speech emotion recognition combined with the auditory mechanism and attention mechanism may be an effective way. Some previous...

Paper Fields

Paper Details

Title

Speech Emotion Recognition Using 3D Convolutions and Attention-Based Sliding Recurrent Networks With Auditory Front-Ends

DOI

doi.org/10.1109/access.2020.2967791

Published Date

Jan 1, 2020

Journal

IEEE Access

Volume

8

Pages

16560 - 16572

Citation AnalysisPro

You’ll need to upgrade your plan to Pro

Looking to understand the true influence of a researcher’s work across journals & affiliations?

Scinapse’s Top 10 Citation Journals & Affiliations graph reveals the quality and authenticity of citations received by a paper.
Discover whether citations have been inflated due to self-citations, or if citations include institutional bias.

Learn more

Notes

History