Detection of activity and position of speakers by using deep neural networks and acoustic data augmentation

Volume: 134, Pages: 53 - 65
Published: Nov 1, 2019
Abstract
The task of Speaker LOCalization (SLOC) has been the focus of numerous works in the research field, where SLOC is performed on pure speech data, requiring the presence of an Oracle Voice Activity Detection (VAD) algorithm. Nevertheless, this perfect working condition is not satisfied in a real world scenario, where employed VADs do commit errors. This work addresses this issue with an extensive analysis focusing on the relationship between...
Paper Details
Title
Detection of activity and position of speakers by using deep neural networks and acoustic data augmentation
Published Date
Nov 1, 2019
Volume
134
Pages
53 - 65
Citation AnalysisPro
  • Scinapse’s Top 10 Citation Journals & Affiliations graph reveals the quality and authenticity of citations received by a paper.
  • Discover whether citations have been inflated due to self-citations, or if citations include institutional bias.