PL EN


Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Tytuł artykułu

Amplitude and Frequency Modulation in Speaker Recognition Systems

Autorzy
Treść / Zawartość
Identyfikatory
Warianty tytułu
Języki publikacji
EN
Abstrakty
EN
The paper presents a review of the nowadays methods of voice vector extraction, applied in such speech processing, like person identification and emotion recognition. A special attention was held on mixed time-frequency analysis based on temporary frequency approach. The methods of calculation of time - frequency voice characterization were also described. The most important building blocks of identification and recognition of speakers have been presented. The characterization of feature vectors suitable for identification and verification in microcomputer systems was described. Components and appropriate method of speech identification based on the long-term spectra vectors were discussed.
Twórcy
autor
  • Department of Microelectronics and Computer Science, Technical University of Lodz, Wolczanska 221/223, 90-924 Lodz, Poland, ciota@dmcs.pl
Bibliografia
  • [1] L. G. Kersta, “Voiceprint identification”, Nature , vol. 196, pp. 1253–257, 1962
  • [2] Saeed Gazor, Reza Rashidi Far, Adaptive Maximum Windowed Likelihood Multicomponent AM-FM Signal Decomposition , IEEE Transactions on Audio, Speech, and Language Processing , vol. 14, no. 2, March 2006
  • [3] M. H. Hayes, J. S. Lim & A. V. Oppenheim, “Signal Reconstruction from Phase or Magnitude”, IEEE Trans. Acous. Speech & Signal Proc ., vol. ASSP-28, No. 6, pp. 672-680
  • [4] Yasser Hifny, Steve Renals, Speech Recognition Using Augmented Conditional Random Fields, IEEE Transactions on Audio, Speech, and Language Processing , vol. 17, no. 2, February 2009
  • [5] T. P. Zieliński, Cyfrowe przetwarzanie sygnałów. Od teorii do zastosowań, Wydawnictwa Komunikacji i Łączności , Warszawa 2009
  • [6] Thomas Pellegrini, Lori Lamel, Automatic Word Decompounding for ASR in a Morphologically Rich Language: Application to Amharic, IEEE Transactions on Audio, Speech, and Language Processing , vol. 17, no. 5, July 2009
  • [7] D. V. Dimitriadis, P. Maragos, A. Potamianos, “Robust AM-FM features for speech recognition,” IEEE Signal Process. Letters , vol. 12, no. 9, Sepember 2005
  • [8] Ran D. Zilca, Brian Kingsbury, J. Navrátil, Ganesh N. Ramaswamy, Pseudo Pitch Synchronous Analysis of Speech With Applications to Speaker Recognition, IEEE Transactions on Audio, Speech, and Language Processing , vol. 14, no. 2, March 2006
  • [9] Mari Ostendorf et al., Speech Segmentation and Spoken Document Processing, IEEE Signal Processing Magazine, May 2008
  • [10] Z. Ciota: „Metody przetwarzania sygnałów akustycznych w komputerowej analizie mowy”. Akademicka Oficyna Wydawnicza EXIT , Warszawa 2010
  • [11] Marco Grimaldi, Fred Cummins, Speaker Identification Using Instantaneous Frequencies , IEEE Transactions on Audio, Speech, and Language Processing , vol. 16, no. 6, August 2008
  • [12] S. Grocholewski, Two Stage Speaker Verification System, Speech and Language Technology , vol.6, 2005, pp.45-56
  • [13] Z. Ciota: "Audio-Haptic Feedback in Speech Processing". The 6th IEEE Int. Workshop on Haptic, Audio and Visual Environments and Games - HAVE 2007, 12-14 October 2007, Ottawa, Canada
  • [14] T. R. van de Water, Otolaryngology: Basic Science and Clinical Review, Stuttgart, Thieme Publishing Group, 2005
  • [15] J. C. Stemple, L. E. Glaze & B. Klaben Gerdemann, Clinical Voice Pathology Theory and Management, 3r d Edition, New Jersey, Thomson Delmar Learning, 2000.
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-article-LODD-0002-0025
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.