Amplitude and Frequency Modulation in Speaker Recognition Systems

Ciota, Z.

Artykuł - szczegóły

Tytuł artykułu

Amplitude and Frequency Modulation in Speaker Recognition Systems

Autorzy

Ciota Z.

Treść / Zawartość

Pełne teksty:

Pobierz

Identyfikatory

Warianty tytułu

Języki publikacji

Abstrakty

The paper presents a review of the nowadays methods of voice vector extraction, applied in such speech processing, like person identification and emotion recognition. A special attention was held on mixed time-frequency analysis based on temporary frequency approach. The methods of calculation of time - frequency voice characterization were also described. The most important building blocks of identification and recognition of speakers have been presented. The characterization of feature vectors suitable for identification and verification in microcomputer systems was described. Components and appropriate method of speech identification based on the long-term spectra vectors were discussed.

Słowa kluczowe

AM-FM modulation Gabor filters Hilbert transform speaker recognition system spectrogram analysis speech processing

modulacja AM-FM filtr Gabora transformata Hilberta biometryka głosu spektrogram przetwarzanie mowy

Wydawca

Lodz University of Technology. Department of Microelectronics and Computer Science

Czasopismo

International Journal of Microelectronics and Computer Science

Rocznik

2012

Tom

Vol. 3, nr 2

Strony

41--44

Opis fizyczny

Bibliogr. 15 poz.

Twórcy

autor

Ciota Z.

Department of Microelectronics and Computer Science, Technical University of Lodz, Wolczanska 221/223, 90-924 Lodz, Poland, ciota@dmcs.pl

Bibliografia

[1] L. G. Kersta, “Voiceprint identification”, Nature , vol. 196, pp. 1253–257, 1962
[2] Saeed Gazor, Reza Rashidi Far, Adaptive Maximum Windowed Likelihood Multicomponent AM-FM Signal Decomposition , IEEE Transactions on Audio, Speech, and Language Processing , vol. 14, no. 2, March 2006
[3] M. H. Hayes, J. S. Lim & A. V. Oppenheim, “Signal Reconstruction from Phase or Magnitude”, IEEE Trans. Acous. Speech & Signal Proc ., vol. ASSP-28, No. 6, pp. 672-680
[4] Yasser Hifny, Steve Renals, Speech Recognition Using Augmented Conditional Random Fields, IEEE Transactions on Audio, Speech, and Language Processing , vol. 17, no. 2, February 2009
[5] T. P. Zieliński, Cyfrowe przetwarzanie sygnałów. Od teorii do zastosowań, Wydawnictwa Komunikacji i Łączności , Warszawa 2009
[6] Thomas Pellegrini, Lori Lamel, Automatic Word Decompounding for ASR in a Morphologically Rich Language: Application to Amharic, IEEE Transactions on Audio, Speech, and Language Processing , vol. 17, no. 5, July 2009
[7] D. V. Dimitriadis, P. Maragos, A. Potamianos, “Robust AM-FM features for speech recognition,” IEEE Signal Process. Letters , vol. 12, no. 9, Sepember 2005
[8] Ran D. Zilca, Brian Kingsbury, J. Navrátil, Ganesh N. Ramaswamy, Pseudo Pitch Synchronous Analysis of Speech With Applications to Speaker Recognition, IEEE Transactions on Audio, Speech, and Language Processing , vol. 14, no. 2, March 2006
[9] Mari Ostendorf et al., Speech Segmentation and Spoken Document Processing, IEEE Signal Processing Magazine, May 2008
[10] Z. Ciota: „Metody przetwarzania sygnałów akustycznych w komputerowej analizie mowy”. Akademicka Oficyna Wydawnicza EXIT , Warszawa 2010
[11] Marco Grimaldi, Fred Cummins, Speaker Identification Using Instantaneous Frequencies , IEEE Transactions on Audio, Speech, and Language Processing , vol. 16, no. 6, August 2008
[12] S. Grocholewski, Two Stage Speaker Verification System, Speech and Language Technology , vol.6, 2005, pp.45-56
[13] Z. Ciota: "Audio-Haptic Feedback in Speech Processing". The 6th IEEE Int. Workshop on Haptic, Audio and Visual Environments and Games - HAVE 2007, 12-14 October 2007, Ottawa, Canada
[14] T. R. van de Water, Otolaryngology: Basic Science and Clinical Review, Stuttgart, Thieme Publishing Group, 2005
[15] J. C. Stemple, L. E. Glaze & B. Klaben Gerdemann, Clinical Voice Pathology Theory and Management, 3r d Edition, New Jersey, Thomson Delmar Learning, 2000.

Typ dokumentu

Bibliografia

Identyfikator YADDA

bwmeta1.element.baztech-article-LODD-0002-0025