Institut for Forretningsudvikling og Teknologi

Ramjee Prasad

Multistyle Training and Fusion for Speaker Identification of Disguised Voice

Publikation: Bidrag til bog/antologi/rapport/proceedingKonferencebidrag i proceedingsForskningpeer review

Speaker identification research faces challenges due to mismatched training and test conditions, arising out of several factors. Non-electronic voice disguise is one of such factor and is commonly seen in crimes. This paper presents a study of the effect of three different types of voice disguises, taken from the CHAINS speech corpus for the speaker identification accuracy. Out of the three voice disguises, two are variants of imitative style, namely, synchronous and repetitive synchronous imitation, and one is the fast speaking style. Different variants of multistyle training to increase the speaker identification accuracy are investigated in this paper. The manner in which the different speaking style’s speech examples are used for multistyle training plays an important role in the speaker identification accuracy. Further, a fusion of two multistyle training at the decision level is proposed. Experimental results show the overall better and more stable performance of the fusion multistyle training, over single style training and the investigated multistyle trainings, across the different voice disguises.
OriginalsprogEngelsk
TitelICCo5-2013 Conference Proceedings
Antal sider6
ForlagICCo5
Udgivelsesårdec. 2013
StatusUdgivet - dec. 2013
Eksternt udgivetJa
BegivenhedThe First International Conference on Communications, Connectivity, Convergence, Content and Cooperation (IC5) - Mumbai, Indien
Varighed: 16 dec. 201319 dec. 2013

Konference

KonferenceThe First International Conference on Communications, Connectivity, Convergence, Content and Cooperation (IC5)
LandIndien
ByMumbai
Periode16/12/201319/12/2013

Se relationer på Aarhus Universitet Citationsformater

ID: 171390633