PL EN


Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Tytuł artykułu

System dialogowy języka mówionego : przegląd problemów

Treść / Zawartość
Identyfikatory
Warianty tytułu
EN
Spoken language dialogue system
Języki publikacji
PL
Abstrakty
PL
Przedstawiono strukturę systemu dialogowego języka mówionego. Scharakteryzowano pożądane własności składników funkcjonalnych systemu: urządzenia rozpoznawania mowy, procesora językowego, sterownika (menedżera) dialogu i syntezatora mowy. Scharakteryzowano przykładowe realizacje systemów dialogowych języka mówionego.
EN
In this paper, the structure of a spoken language dialogue system was described. The underlying human language technologies were described: automatic speech recognizer, natural language understanding, dialogue manager, and speech synthesizer. The recent progress in spoken dialogue systems and some of the ongoing research challenges were presented.
Twórcy
  • Instytut Teleinformatyki i Automatyki, Wojskowa Akademia Techniczna, ul. gen. S. Kaliskiego 2, 00–908 Warszawa, awis@ita.wat.edu.pl
Bibliografia
  • [1] BARNARD E., HALBERSTADT A., KOTELLY C., PHILLIPS M.: A Consistent Approach to Designing Spoken-Dialog Systems, Proc. ASRU Workshop, Keystone, CO, 1999.
  • [2] BEUTNAGEL M., CONKIE A., SCHROETER J., STYLIANOU Y., SYRDAL A.: The AT&T Next-Gen TTS System, Proc. ASA, Berlin, 1999.
  • [3] BILLI R., CAVANESIO R., RULLENT C.: Automation of Telecom Italia Directory Assistance Service: Field Trial Results, Proc. IVTTA, 1998.
  • [4] BOBROW R., INGRIA R., STALLARD D.: Syntactic and Semantic Knowledge in the DELPHI Uniffication Grammar, Proc. DARPA Speech and Natural Language Workshop, 1990.
  • [5] BOVES L., OS E.: Applications of Speech Technology: Designing for Usability, Proc. IEEE Worshop on ASR and Understanding, 1999.
  • [6] COHEN P., JOHNSON M., MCGEE D., OVIATT S., CLOW J., SMITH I.: The Effeciency of Multimodal Interaction: A Case Study, Proc. ICSLP, 1998.
  • [7] COLE R.A., MARIANI J., USZKOREIT H., ZAENEN A. and ZUE V. W. (Editorial Board), VARILE G. and ZAMPOLII A. (Managing Editors), Survey of the State of the Art in Human Language Technology, 1996. URL:http://www.cse.ogi.edu/CSLU/HLTsurvey/.
  • [8] DAL D.: Practical Spoken Dialog Systems, 2005.
  • [9] DOWDING J., GAWRON J., APPELT D., BEAR J., CHERNY L., MOORE R., MORAN D., GEMINI A.: Natural Language System for Spoken Language Understanding, Proc. ARPA Workshop on Human Language Technology, 1993.
  • [10] FLAMMIA G.: Discourse Segmentation of Spoken Dialogue: An Empirical Approach, Ph.D. Thesis, MIT, 1998.
  • [11] FANT G., LILJENCRANTS J., LIN Q.: A Four-parameter Model of Glottal Flow, STL-QPSR, 4, 1985.
  • [12] FANT G.: The LF-model Revisited. Transform and Frequency Domain Analysis. STL-QPSR, 2-3, 1995.
  • [13] GLASS J., FLAMIA G., GOODINE D., PHILLIPS M., POLIFRONI J., SAKAI S., SENEFF S., ZUE V.: Multilingual Spoken-Language Understanding in the MIT Voyager System, Speech Communication, 17, 1995.
  • [14] GODDEAU D.: Using Probabilistic Shift-Reduce Parsing in Speech Recognition Systems, Proc. ICSLP, 1992.
  • [15] GORIN A., RICARDI G., WRIGHT J.: How may I help you?, Speech Communication, 23,1997.
  • [16] HETHERINGTON L., ZUE V.: New words: Implications for Continuous Speech Recognition, Proc. Eurospeech, 1991.
  • [17] LIPPMANN R.P.: Speech Perception by Humans and Machines, Speech Communication, 22(1), 1997.
  • [18] MCDONALD D., BOLC L. (Eds.): Natural Language Generation Systems (Symbolic Computation Artificial Intelligence), Springer Verlag, Berlin, 1998.
  • [19] MILLER S., SCHWARTZ R., BOBROW R., INGRIA R.: Statistical Language Processing Using Hidden Understanding Models, Proc. ARPA Speech and Natural Language Workshop, 1994.
  • [20] MOORE R., APPELT D., DOWDING J., GAWRON J., MORAN D.: Combining Linguistic and Statistical Knowledge Sources in Natural-Language Processing for ATIS, Proc. ARPA Spoken Language Systems Workshop, 1995.
  • [21] Nuance Communications, http://www.nuance.com
  • [22] OH A.: Stochastic Natural Language Generation for Spoken Dialog Systems, M.S. Thesis, CMU, May 2000.
  • [23] OS E., BOVES L., LAMEL L., BAGGIA P.: Overview of the ARISE project, Proc. Eurospeech, 1999.
  • [24] PAO C., SCHMID P., GLASS J.: Con_dence Scoring for Speech Understanding Systems, Proc. ICSLP, 1998.
  • [25] PECKAM J.: A New Generation of Spoken Dialogue Systems: Results and Lessons from the SUNDIAL Project, Proc. Eurospeech, 1993.
  • [26] PRICE P.: Evaluation of Spoken Language Systems: the Atis Domain, Proc. DARPA Speech and Natural Language Workshop, 1990.
  • [27] RABINER L., JUANG B.-H.: Fundamentals of speech recognition, 1993.
  • [28] REITER E., DALE R.: Building Natural Language Generation Systems, Cambridge University Press, Cambridge, 2000.
  • [29] ROSENBERG A.E.: Effect of Glottal Pulse Shape on the Quality of Natural Vowels. Journal of The Acoustical Society of America vol. 49, 1970.
  • [30] ROSSET S., BENNACEF S., LAMEL L.: Design Strategies for Spoken Language Dialog Systems, Proc. Eurospeech,1999.
  • [31] SENEFF S.: TINA, A natural language system for spoken language applications, Computational Linguistics, 18(1), 1992.
  • [32] SENEFF S., GODDEAU D., PAO C., POLIFRONI J.: Multimodal discourse modelling in a multi-user multi-domain environment, Proc. ICSLP, 1996.
  • [33] SENEFF S., LAU R., POLIFRONI J.: Organization, Communication, and Control in the Galaxy-II Conversational System, Proc. Eurospeech, 1999.
  • [34] SENEFF S.: Robust Parsing for Spoken Language Systems, Proc. ICASSP, 1992.
  • [35] SOUVIGNIER V., KELLNER A., RUEBER B., SCHRAMM H., SEIDE F.: The Thoughtful Elephant: Strategies for Spoken Dialogue Systems, IEEE Trans. SAP, 8(1), 2000.
  • [36] STALLARD D., BOBROW R.: Fragment Processing in the DELPHI System, Proc. DARPA Speech and Natural Language Workshop, 1992.
  • [37] SUTTON S., et al.: Universal Speech Tools: The CSLU Toolkit, Proc. ICSLP, 1998.
  • [38] TATHAM M., MORTON K.: Developments in Speech Synthesis, 2005.
  • [39] KUPPEVELT J.C. van, SMITH R.W.: Current and New Directions in Discourse and Dialogue, 2005.
  • [40] WARD W.: The CMU Air Travel Information Service: Understanding Spontaneous Speech, Proc. ARPA Workshop on Speech and Natural Language, 1990.
  • [41] YI J., GLASS J.: Natural-Sounding Speech Synthesis Using Variable Length Units, Proc. ICSLP, 1998.
  • [42] YOUNG S., BLOOTHOOFT G.: Corpus-based methods in Language and speech processing, 1997.
  • [43] ZUE V., SENEFF S., GLASS J., Polifroni J., Pao C., HAZEN T., HETHERINGTON L.: JUPITER, A Telephone-Based Conversational Interface forWeather Information, IEEE Trans. SAP, 8(1), 2000.
  • [44] ZUE V., SENEFF S., POLIFRONI J., PHILLIPS M., PAO C., GODDEAU D., GLASS J., BRILL E.: PEGASUS, A Spoken Language Interface for On-Line Air Travel Planning, Speech Communication, 15, 1994.
  • [45] ZUE V.W., GLASS J. R.: Conversational Interfaces: Advances and Challenges, Proceedings of the IEEE, vol. 88, no. 8, 2000.
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-article-BWAK-0008-0006
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.