Tytuł artykułu
Autorzy
Wybrane pełne teksty z tego czasopisma
Identyfikatory
Warianty tytułu
Języki publikacji
Abstrakty
This paper describes the development of the LVCSR (Large Vocabulary Continuous Speech Recognition) system for Polish, using the Janus system developed at the University Karlsruhe/Carnegie Mellon University. The system has been tested on the selected material from the SpeeCon database. Test results for sentences read by 16 speakers are given. The system shows good performance and can be used as a basis for further development of modern speech recognition technology for Polish.
Słowa kluczowe
Wydawca
Czasopismo
Rocznik
Tom
Strony
119--126
Opis fizyczny
Bibliogr. 12 poz., tab.
Twórcy
autor
- Polish-Japanese Institute of Information Technology, Koszykowa 86, 02-008 Warszawa, Poland, kmarasek@pjwstk.edu.pl
Bibliografia
- [1] DE MORI, Spoken dialogues with computers: Signal processing and its applications, Academic Press, Hardcover, 1998, ISBN 0122090551.
- [2] FINKE M., GEUTNER P., HILD H., KEMP T., RIES K., WESTPHAL M., The Karlsruhe. VERBMOBIL speech recognition engine, Proceedings of the IEEE ASSP Conf., vol. 1, pp. 83.86 Munich 1997.
- [3] HUANG, ACERO, HON, Spoken language processing, Prentice Hall, 2001, ISBN 0130226165.
- [4] MATSOUKAS S., SCHWARTZ R., JIN H., NGUYEN L., Practical implementations of speakeradaptive training, proceedings of the 1997 DARPA speech recognition workshop, Chantilly, Virginia, USA, February 2.5, 1997.
- [5] SOLTAU H., METZE F., FÜGEN CH., WAIBEL A., A one pass-decoder based on polymorphic linguistic context assignment, Proceedings of the ASRU Workshop, Madonna di Campiglio, Italy, pp. 214.217, 2001.
- [6] FRITSCH J., ROGINA I., The bucket box intersection (BBI) algorithm for fast approximative evaluation of diagonal mixture Gaussians, Proceedings of ICASSP 96, Atlanta, Vol. 2, pp. 837.840, 1996.
- [7] ROACH P. et al., Babel: An eastern european multi-language database, proceedings of ICSLP-96, Philadelphia, pp. 1982.1986, 1996.
- [8] MARASEK K., Large vocabulary continuous speech recognition system for Polish, Archives of Acoustics, 28, 4, 293.303 (2003).
- [9] MARASEK K., GUBRYNOWICZ R., Mutlilevel annotation in SpeeCon Polish speech database, IMTCI (International Workshop on Intelligent Media Technology for Communicative Inteligence), Warszawa, Lecture Notes in Computer Science, Springer, pp. 58.67, 2004.
- [10] IslSystem Documentation, Example Training/Testing Setup for Use with JRTk, The Ibis-gang, 22. November 2002, v0.1, Interactive Systems Labs, University of Karlsruhe, Germany, 2002.
- [11] STOLCKE A., SRILM . An extensible language modeling toolkit, Proc. of ICSLP 2002, Denver, Colorado, pp. 901.904, 2002.
- [12] WOODLAND P. C., ODELL J. J., VALTCHEV V., YOUNG S. J., Large vocabulary continuous speech recognition using HTK, Proceedings ICASSP'94, Adelaide, pp. 125.128, 1994.
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-article-BAT8-0003-0066