Match!
Peter Birkholz
Dresden University of Technology
Speech synthesisSpeech recognitionComputer scienceSpeech productionVocal tract
77Publications
12H-index
549Citations
What is this?
Publications 82
Newest
#1Christian Thiele (TUD: Dresden University of Technology)
#2Christine Mooshammer (Humboldt University of Berlin)H-Index: 13
Last. Peter Birkholz (TUD: Dresden University of Technology)H-Index: 12
view all 5 authors...
Abstract Loops are well known as the elliptical tongue fleshpoint paths in vowel-consonant-vowel (VCV) sequences with a velar consonant. However, the discussion about the relative influences of aerodynamics, active motor control and biomechanics on loops is controversial. In order to characterize the biomechanical influence, the design of the current study was specifically based on V1-V2-V1 sequences with no consonants involved, recorded by means of electromagnetic articulography (EMA). In this ...
Source
#1Thuanvan Ngo (Japan Advanced Institute of Science and Technology)
#2Masato Akagi (Japan Advanced Institute of Science and Technology)H-Index: 15
Last. Peter Birkholz (TUD: Dresden University of Technology)H-Index: 12
view all 3 authors...
Abstract In noisy conditions, speakers involuntarily change their manner of speaking to enhance the intelligibility of their voices. The increased intelligibility of this so-called Lombard speech is enabled by the change of multiple articulatory and acoustic features. While the major features of Lombard speech are well known from previous studies, little is known about their relative contributions to the intelligibility of speech in noise. This study used an analysis-by-synthesis strategy to exp...
Source
#1Lisa Bartha-Doering (Medical University of Vienna)H-Index: 3
#2Peter Birkholz (TUD: Dresden University of Technology)H-Index: 12
Last. Jürgen WendlerH-Index: 2
view all 22 authors...
This historical review covers the scientific background of the discipline as reflected in documents from antiquity, the Renaissance, the seventeenth to the nineteenth centuries and the twentieth century up to present times.
Source
Sep 15, 2019 in INTERSPEECH (Conference of the International Speech Communication Association)
#1Peter Birkholz (TUD: Dresden University of Technology)H-Index: 12
#2Susanne DrechselH-Index: 1
Last. Simon Stone (TUD: Dresden University of Technology)H-Index: 2
view all 3 authors...
1 CitationsSource
Sep 15, 2019 in INTERSPEECH (Conference of the International Speech Communication Association)
#1Yingming Gao (TUD: Dresden University of Technology)
#2Simon Stone (TUD: Dresden University of Technology)H-Index: 2
Last. Peter Birkholz (TUD: Dresden University of Technology)H-Index: 12
view all 3 authors...
Source
#1Teruki Toya (Japan Advanced Institute of Science and Technology)H-Index: 1
#2Peter Birkholz (TUD: Dresden University of Technology)H-Index: 12
Last. Masashi Unoki (Japan Advanced Institute of Science and Technology)H-Index: 12
view all 3 authors...
Beacause transmission characteristics of bone-conducted (BC) speech from the larynx to auditory systems have not yet been clarified, this paper investigates the transmission characteristics related to the BC speech perception focusing on temporal bone (TB) vibration signals and ear canal (EC) radiated speech signals. First, long-term average spectra (LTAS) of the normally produced speech signals recorded at the lips, TB and EC were analyzed. It was found that the frequency components above 2 kHz...
Source
#2Peter BirkholzH-Index: 12
view all 2 authors...
Source
#1Peter Birkholz (TUD: Dresden University of Technology)H-Index: 12
#2Falk Gabriel (TUD: Dresden University of Technology)
Last. Matthias Echternach (LMU: Ludwig Maximilian University of Munich)H-Index: 1
view all 4 authors...
The estimation of formant frequencies from acoustic speech signals is mostly based on Linear Predictive Coding (LPC) algorithms. Since LPC is based on the source-filter model of speech production, the formant frequencies obtained are often implicitly regarded as those for an infinite glottal impedance, i.e., a closed glottis. However, previous studies have indicated that LPC-based formant estimates of vowels generated with a realistically varying glottal area may substantially differ from the re...
Source
#1Peter Birkholz (TUD: Dresden University of Technology)H-Index: 12
#2Daniel Pape (McMaster University)H-Index: 8
Abstract Self-oscillating bar-mass models of the vocal folds are frequently used as the voice source in articulatory speech synthesis. For these models, a number of ways to handle the entrance loss and the flow separation in the glottis have been proposed. However, the effect of different modeling choices on vocal fold oscillation and glottal flow, and on the quality of synthesized speech has been rarely examined. In this study, a modified two-mass model of the vocal folds was used to simulate p...
Source
#2Thomas FangmeierH-Index: 13
Last. Andreas RiedelH-Index: 9
view all 8 authors...
Source
123456789