Experimental speaker recognition investigations using head / torso simulator and telephone transmission

Cetnarowicz, D.; Drgas, S.; Dąbrowski, A.

Artykuł - szczegóły

Tytuł artykułu

Experimental speaker recognition investigations using head / torso simulator and telephone transmission

Autorzy

Cetnarowicz D. , Drgas S. , Dąbrowski A.

Identyfikatory

Warianty tytułu

Eksperymentalne badania rozpoznawania mówcy z użyciem sztucznej głowy / torsu i transmisji telefonicznej

Języki publikacji

Abstrakty

In this paper a system for speaker recognition and respective experiments based on telephone speech signal quality are presented and reported. First, the speech signals are transmitted using regular GSM or analog telephone systems. The recorded signals are used as input for the Gaussian mixture model based speaker recognition system. The results suggest that the parameters of MFCC extraction should be tailored to the signal quality.

Artykuł prezentuje eksperymenty z systemem rozpoznawania mówcy działającym na sygnale mowy o jakości telefonicznej. Najpierw sygnał mowy został przetransmitowany przez rzeczywisty kanał telefoniczny zawierający zarówno kodek GSM jak i standard analogowy. Tak uzyskany sygnał został zapisany i wykorzystany do testowania rozpoznawania mówcy opartego na modelu liniowych mieszanin Gaussowskich. Uzyskane wyniki wskazują, że parametry obliczania współczynników MFCC powinny być dopasowane do jakości sygnału.

Słowa kluczowe

speaker recognition GMM MFCC GSM

rozpoznawanie mówcy GMM MFCC GSM

Wydawca

Wydawnictwo SIGMA-NOT

Czasopismo

Elektronika : konstrukcje, technologie, zastosowania

Rocznik

2011

Tom

Vol. 52, nr 5

Strony

94--97

Opis fizyczny

Bibliogr. 7 poz., il., wykr.

Twórcy

autor

Cetnarowicz D.

autor

Drgas S.

autor

Dąbrowski A.

Politechnika Poznańska, Katedra Sterowania i Inzynierii Systemów

Bibliografia

[1] Keshet J., Bengio S. (ed): Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods: John Wiley & Sons, Ltd., 2009.
[2] Dąbrowski A., Drgas S., Cetnarowicz D., Chmielewska I.: Gaussian Mixture Model speaker recognition system experiments with CORPORA database. Presented at SIGNAL PROCESSING SPA'2007. Poland Section, Chapter Circuits and Systems IEEE, Poznań, Poland, 2007.
[3] Grocholewski S.: CORPORA - speech database for Polish diphones. presented at EUROSPEECH '97, Rhodes, Greece, 1995.
[4] Reynolds D. A., Rose R. C.: Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Transactions on Speech and Audio Processing, vol. 3, pp. 72 -83, 1995.
[5] Nabney I. T.: Netlab: Algorithms for pattern recognition: Springer, 2002.
[6] Reynolds D. A.: The effects of handset variability on speaker recognition performance: experiments on the Switchboard corpus. Presented at IEEE International Conference on Acoustics, Speech, and Signal Processing. ICASSP-96., 1996.
[7] Drgas S., Cetnarowicz D., Dąbrowski A.: Speaker verification based on prosodic features. Presented at SIGNAL PROCESSING SPA '2008, Poland Section, Chapter Circuits and Systems IEEE, Poznań, Poland, 2008.

Typ dokumentu

Bibliografia

Identyfikator YADDA

bwmeta1.element.baztech-article-BWAK-0024-0017