PL EN


Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Tytuł artykułu

Lip-reading with discriminative deformable models

Autorzy
Wybrane pełne teksty z tego czasopisma
Identyfikatory
Warianty tytułu
Konferencja
International Conference on Computer Vision and Graphics ICCVG 2006 (25-27.09.2006 ; Warsaw, Poland)
Języki publikacji
EN
Abstrakty
EN
The following paper describes a novel lip-reading method developed for the purpose of isolated word recognition. The method is based on a concept of a discriminative deformable model, which represents an image analysis method derived from the deformable grid paradigm. The discriminative deformable model is used to characterize the lip shape at each frame of the video sequence. The information extracted from the consecutive frames is next analyzed using the Hidden Markov Models. The proposed visual speech recognition method is tested using the Polish digits recognition task.
Słowa kluczowe
Rocznik
Strony
567--575
Opis fizyczny
Bibliogr. 15 poz., il., tab., wykr.
Twórcy
autor
  • Institute of Electronics, Technical University of Lodz, Wolczanska 211/215 Str., 90-924 Lodz, Poland, nowakhub@p.lodz.pl
Bibliografia
  • [1] Otsu N.: A threshold selection method from gray level Histograms. IEEE Trans. Systems, Man and Cybernetics, pp. 62-66, 1979.
  • [2] Deller J. R., Proakis J. G., Hansen J. H.: Discrete-time processing of speech signals, Prentice Hall, NJ, 1987.
  • [3] Petajan E. D.: Automatic lip-reading to enhance speech recognition, Proc. IEEE Communications Society Global Telecom Conf. Atlanta, 1984.
  • [4] Rabiner L. R.: A tutorial on hidden Markov Models and selected applications in speech recognition. Proc. of the IEEE, Vol. 77, No.2, pp. 257-286, 1989.
  • [5] Mase K., Pentland A.: Automatic lipreadingby optical-flow analysis, System and Computers in Japan, 22(6), pp.67-76, 1991.
  • [6] Leymarie F., Levine M. D.: Simulating the grassfire transform using an active contour model, IEEE Trans. on PAMI, Vol. 14, No. l, January, pp. 56-75, 1992.
  • [7] Duchnowski P., Hunke M.: Toward movement-invariant automatic lip-reading and speech recognition ICCASP, 1995.
  • [8] Sobottka K., Pitas I.: Looking for faces and facial features in color images. Pattern Recognition and Image Analysis, Advances in Mathematical Theory and Applications, Russian Academy of Sciences, 1996.
  • [9] Luettin J.: Towards speaker independent continuous speechreading. EUROSPEECH-1997, pp. 1991-1994, 1997.
  • [10] Dupont S., Luettin J.: Audio-visual speech modeling for continuous speech recognition. IEEE Transaction on Multimedia, Vol.2, No 3, 2000.
  • [11] Szczypiński P., Materka A.: Object tracking and recognition using deformable grid with geometrical template's. Int. Conf. on Signals and Electronic Systems, Poland, pp. 169-174, 2000.
  • [12] Szczypiński P.: Deformable models for quantitative analysis and object recognition in digital images, doctoral thesis, Technical Univeristy of Lodz, 2001.
  • [13] Nowak H., Ślot K.: Object classification with intermediate deformable models, Proc. of ECCTD, pp.240-243 Kraków, 2003.
  • [14] Foo S. W., Lian Y., Dong L.: Recognition of visual speech elements using adaptively boosted hidden Markov models. IEEE Trans. on Circuits and Systems for Video Technology, 2004.
  • [15] Kubanek M.: The method of audio-visual Polish Speech Recognition Based on Hidden Markov Models, doctoral thesis, Częstochowa University of Technology, 2005.
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-article-BWA1-0026-0036
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.