Multiclass audio segmentation based on recurrent neural networks for broadcast domain data

Volume: 2020, Issue: 1
Published: Mar 5, 2020
Abstract
This paper presents a new approach based on recurrent neural networks (RNN) to the multiclass audio segmentation task whose goal is to classify an audio signal as speech, music, noise or a combination of these. The proposed system is based on the use of bidirectional long short-term Memory (BLSTM) networks to model temporal dependencies in the signal. The RNN is complemented by a resegmentation module, gaining long term stability by means of the...
Paper Details
Title
Multiclass audio segmentation based on recurrent neural networks for broadcast domain data
Published Date
Mar 5, 2020
Volume
2020
Issue
1
Citation AnalysisPro
  • Scinapse’s Top 10 Citation Journals & Affiliations graph reveals the quality and authenticity of citations received by a paper.
  • Discover whether citations have been inflated due to self-citations, or if citations include institutional bias.