Lip-reading with discriminative deformable models

Nowak, H.

Artykuł - szczegóły

Tytuł artykułu

Lip-reading with discriminative deformable models

Autorzy

Nowak H.

Wybrane pełne teksty z tego czasopisma

https://mgv.sggw.edu.pl/

Identyfikatory

Warianty tytułu

Konferencja

International Conference on Computer Vision and Graphics ICCVG 2006 (25-27.09.2006 ; Warsaw, Poland)

Języki publikacji

Abstrakty

The following paper describes a novel lip-reading method developed for the purpose of isolated word recognition. The method is based on a concept of a discriminative deformable model, which represents an image analysis method derived from the deformable grid paradigm. The discriminative deformable model is used to characterize the lip shape at each frame of the video sequence. The information extracted from the consecutive frames is next analyzed using the Hidden Markov Models. The proposed visual speech recognition method is tested using the Polish digits recognition task.

Słowa kluczowe

lip-reading deformable grid speech recognition

Wydawca

Institute of Information Technology of the Warsaw University of Life Sciences – SGGW

Czasopismo

Machine Graphics and Vision

Rocznik

2006

Tom

Vol. 15, No. 3/4

Strony

567--575

Opis fizyczny

Bibliogr. 15 poz., il., tab., wykr.

Twórcy

autor

Nowak H.

Institute of Electronics, Technical University of Lodz, Wolczanska 211/215 Str., 90-924 Lodz, Poland, nowakhub@p.lodz.pl

Bibliografia

[1] Otsu N.: A threshold selection method from gray level Histograms. IEEE Trans. Systems, Man and Cybernetics, pp. 62-66, 1979.
[2] Deller J. R., Proakis J. G., Hansen J. H.: Discrete-time processing of speech signals, Prentice Hall, NJ, 1987.
[3] Petajan E. D.: Automatic lip-reading to enhance speech recognition, Proc. IEEE Communications Society Global Telecom Conf. Atlanta, 1984.
[4] Rabiner L. R.: A tutorial on hidden Markov Models and selected applications in speech recognition. Proc. of the IEEE, Vol. 77, No.2, pp. 257-286, 1989.
[5] Mase K., Pentland A.: Automatic lipreadingby optical-flow analysis, System and Computers in Japan, 22(6), pp.67-76, 1991.
[6] Leymarie F., Levine M. D.: Simulating the grassfire transform using an active contour model, IEEE Trans. on PAMI, Vol. 14, No. l, January, pp. 56-75, 1992.
[7] Duchnowski P., Hunke M.: Toward movement-invariant automatic lip-reading and speech recognition ICCASP, 1995.
[8] Sobottka K., Pitas I.: Looking for faces and facial features in color images. Pattern Recognition and Image Analysis, Advances in Mathematical Theory and Applications, Russian Academy of Sciences, 1996.
[9] Luettin J.: Towards speaker independent continuous speechreading. EUROSPEECH-1997, pp. 1991-1994, 1997.
[10] Dupont S., Luettin J.: Audio-visual speech modeling for continuous speech recognition. IEEE Transaction on Multimedia, Vol.2, No 3, 2000.
[11] Szczypiński P., Materka A.: Object tracking and recognition using deformable grid with geometrical template's. Int. Conf. on Signals and Electronic Systems, Poland, pp. 169-174, 2000.
[12] Szczypiński P.: Deformable models for quantitative analysis and object recognition in digital images, doctoral thesis, Technical Univeristy of Lodz, 2001.
[13] Nowak H., Ślot K.: Object classification with intermediate deformable models, Proc. of ECCTD, pp.240-243 Kraków, 2003.
[14] Foo S. W., Lian Y., Dong L.: Recognition of visual speech elements using adaptively boosted hidden Markov models. IEEE Trans. on Circuits and Systems for Video Technology, 2004.
[15] Kubanek M.: The method of audio-visual Polish Speech Recognition Based on Hidden Markov Models, doctoral thesis, Częstochowa University of Technology, 2005.

Typ dokumentu

Bibliografia

Identyfikator YADDA

bwmeta1.element.baztech-article-BWA1-0026-0036