Multiclass audio segmentation based on recurrent neural networks for broadcast domain data

Pablo Gimeno; Ignacio Viñals; Alfonso Ortega; Antonio Miguel; Eduardo Lleida

doi:https://doi.org/10.1186/s13636-020-00172-6

doi.org/10.1186/s13636-020-00172-6

Multiclass audio segmentation based on recurrent neural networks for broadcast domain data

,

,

..., Eduardo Lleida

16

EURASIP Journal on Audio, Speech, and Music Processing2.40

Volume: 2020, Issue: 1

Published: Mar 5, 2020

Abstract

This paper presents a new approach based on recurrent neural networks (RNN) to the multiclass audio segmentation task whose goal is to classify an audio signal as speech, music, noise or a combination of these. The proposed system is based on the use of bidirectional long short-term Memory (BLSTM) networks to model temporal dependencies in the signal. The RNN is complemented by a resegmentation module, gaining long term stability by means of the...

Paper Fields

Paper Details

Title

Multiclass audio segmentation based on recurrent neural networks for broadcast domain data

DOI

doi.org/10.1186/s13636-020-00172-6

Published Date

Mar 5, 2020

Journal

EURASIP Journal on Audio, Speech, and Music Processing

Volume

2020

Issue

1

Citation AnalysisPro

You’ll need to upgrade your plan to Pro

Looking to understand the true influence of a researcher’s work across journals & affiliations?

Scinapse’s Top 10 Citation Journals & Affiliations graph reveals the quality and authenticity of citations received by a paper.
Discover whether citations have been inflated due to self-citations, or if citations include institutional bias.

Learn more

Notes

History