PL EN


Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Tytuł artykułu

Perceptually motivated approaches to speech enhancement. Part 2, Psychoacoustic optimization of spectral weighting rules

Wybrane pełne teksty z tego czasopisma
Identyfikatory
Warianty tytułu
PL
Metody motywowane perceptualnie w uzdatnianiu sygnału mowy. Cz. 2, Psychoakustycan optymalizacja metod wag widmowych
Języki publikacji
EN
Abstrakty
EN
This paper focuses on the class of speech enhancement systems, which capitalize on psychoacoustic properties of the human ear. More advanced psychoacoustically motivated spectral weighting rules are described. Presented systems are analyzed and classified according to their similarity with a human auditory model. Especially, a comparison of improvements in musical noise cancellation and increasing speech intelligibility is performed. Moreover, advantages of the perceptual approaches over conventional ones are focused. Finally, perspectives of integrated psychoacoustically motivated speech enhancement and coding systems are discussed. Paper shows that integration of subband coder with speech enhancement system based on non-uniformly spaced filter bank leads to most promissing combined scheme.
PL
Dokonano przeglądu oraz porównania metod uzdatniania sygnału mowy motywowanych perceptualnie. Wskazano na niedoskonałość rozwiązań psychoakustycznych wykorzystujących klasyczne metody wag widmowych. Opierając się na literaturze zaprezentowano różne sposoby psychoakustycznej optymalizacji tych metod. Prezentowane systemy sklasyfikowano według stopnia zgodności z modelem słuchowym człowieka. Jednocześnie zestawiono wyniki zastosowań rozwiązań psychoakustycznych pod kątem możliwości tłumienia szumu środowiskowego i zapobiegabia zniekształceń sygnału mowy. W zestawieniu uwzględniono także połączone systemy eliminacji echa i redukcji szumów. Ostatecznie przedstawiono perspektywy integracji systemu uzdatniania sygnału mowy z systemem kodowania podpasmowego uwydatniając wykorzystanie modeli psychoakustycznych jako element wspólny obu systemów.
Rocznik
Strony
395--409
Opis fizyczny
Bibliogr. 37 poz., rys., wykr.
Twórcy
autor
  • Wydział Informatyki, Politechnika Białostocka ul. Wiejska 45A, 15-351 Białystok
  • Wydział Informatyki, Politechnika Białostocka ul. Wiejska 45A, 15-351 Białystok
Bibliografia
  • 1. K. Kroschel: Noisereduction: anold problem of actual concern. 8th International Symposium on Sound Engineering and Mastering (ISSEM'99), Gdańsk, 1999, pp. 3-8.
  • 2. R. Martin: Spectral subtraction based on minimum statistic. VII European Signal Processing Conference (EUSIPCO'94), Edinburg. 1994. pp. 1182-1185.
  • 3. S. F. Boll: Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions on Acoustic Speech and Signal Processing. 1979, vol. ASSP-27, no. 2, pp. 113-120.
  • 4. Y. Ephraim, D. Malah: Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator. IEEE Transactions on Acoustic Speech and Signal Processing, 1984, vol. ASSP-32, no. 6, pp. 1109-1121.
  • 5. Y. Ephraim, D. Malah: Speech enhancement using a minimum mean-square log-spectral amplitude estimator. IEEE Transactions on Acoustic Speech and Signal Processing, 1985, vol. ASSP-33, no. 2, pp. 443-445.
  • 6. C. Beaugeant, P. Scalart: Noise reduction using perceptual spectral change. 6th European Conference on Speech Communication and Technology (EUROSPEECH'99), Budapest, Hungary, 1999. pp. 2543-2546.
  • 7. O. Cappe: Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor. IEEE Transactions on Speech and Audio Processing, 1994, vol. 2, pp. 345-349.
  • 8. E. Zwicker, H. Fasl: Psychoacoustic. Facts and models. Berlin, Springer Verlag, 1990.
  • 9. T. Painter, A. Spanias: Perceptual coding of digital audio. Proceedings of the IEEE, 2000, vol. 88, no. 4, pp. 451-513.
  • 10. P. Noll: Digital audio coding for visual communications. Proceedings of the IEEE, 1995, vol. 83, no. 6, pp. 925-943.
  • 11. N. Virag: Single channel speech enhancement based on masking properties of the human auditory system. IEEE Transactions on Speech and Audio Processing, 1999, vol. 7, no. 2, pp. 126-137.
  • 12. T. Haulick, K. Linhard, P. Schrogmeier: Residual noise suppression using psychoacoustic criteria. 5th European Conference on Speech Communication and Technology, Greece, 1997, pp. 1395-1398.
  • 13. D. E. Tsoukalas, J. N. Mourjopoulos, G. Kokkinakis: Improving the intelligibility of noisy speech using an audible noise suppression technique. 5th European Conference on Speech Communication and Technology, Greece, 1997, pp. 1415-1418.
  • 14. D. E. Tsoukalas, J. N. Mourjopoulos, G. Kokkinakis: Speech enhancement based on audible noise suppression. IEEE Transactions on Speech and Audio Processing, 1997, vol. 5, no. 6, pp. 497-514.
  • 15. K. Bielawski: System eliminacji echa ekustycznego i szumów w urządzeniach głośnomówiących wykorzystujący model psychoakustyczne. Rozprawa doktorska (promotor prof. A. A. Petrovsky), Politechnika Warszawska, 2001.
  • 16. K. Bielawski, A. A. Petrovsky: Acoustic echo control and noise reduction in non-uniform filter bank: an application of oversampling multirate systems and all-pass transformation. IEEE Nordic Signal Processing Symposium (NORSIG'2000), Kolmarden, Sweden, 2000, pp. 41-44.
  • 17. K. Bielawski, A. A. Petrovsky: Speech enhancement system for hands-free telephone based on the psychoacoustically motivated filter bank with all-pass frequency transformation. 6th European Conterence on Speech Communication and Technology (EROSPEECH'99), Budapest. Hungary, 1999, pp. 2555-2558.
  • 18. A. A. Petrovsky, K. Bielawski: Combined system for echo cancellation and noise reduction: subbands adaptive filtering approach. 3th International Conference, ,,Sampling Theory"·(SAMPTA'99), 1999, Loen. Norway, (invited paper) pp. 1-6.
  • 19. K. Bielawski, A. A. Petrovsky: Proposition of minimum bands multirate noise reduction system which exploits properties· of the human auditory system and all-pass transformed filter bank. IEEE Workshop Signal Processing, Poznań, Poland, 2001, pp. 65-70.
  • 20. A. A. Petrovsky, K. Bielawski: Auditory model based enhancement system for hands-free devices. XI European Signal Processing Conference (EUSIPCO'2002), Toulouse, France. 2002, vol. 1 pp. 487-490.
  • 21. M. Parfieniuk, A. A. Petrovsky: Struktury polifazowe w cyfrowych bankach filtrów. Elektronika. SEP Warszawa, Maj 2001 , pp. 17-21.
  • 22. A. A. Petrovsky, M. Parfieniuk. K. Bielawski: Psychoacousticall motivated nonuniform cosine modulated polyphase filter bank. International TICSP Workshop on Spectral Methods and Multirate Signal Processing (SMMSP'02), Toulouse, France, 2002. pp. 95-101.
  • 23. K. Bielawski, J. Baszun, A. A. Petrovsky: Cochlear spaced filter bank using allpass frequency transformation. EURASIP Conference DSP for Multimedia Communication and Services (ECMCS' 99), Krakow, Poland, 1999, CD.
  • 24. S. Gustafsson, P. Jax, P. Vary: A novel psychoacoustically motivated audio enhancement algorithm preserving background noise characteristic. IEEE International Conference on Acoustic, Speech, and Signal Processing (ICASSP'98), Seattle, USA, 1998, vol. 1, pp. 397-400.
  • 25. S. Gustafsson, P. Jax, A. Kamphausen, P. Vary: A postfilter for echo and noise reduction avoiding the problem of musical tones. IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP'99), Phoenix, USA, 1999, vol. 2, pp. 873-876.
  • 26. S. Gustafsson, R. Martin, P. Jax, P. Vary: A psychoacoustic approach to combined acoustic echo cancellation and noise reduction. IEEE Transactions on Speech and Audio Processing, 2002, vol. 10, no. 5, pp. 245-256.
  • 27. V. Turbin, A. Gilloire, P. Scalart, C. Beaugeant: Using psychoacoustic criteria in acoustic echo cancellation algorithms. IEEE Workshop on Acoustic Echo and Noise Control, London, 1997, pp. 53-56.
  • 28. S. Gustafsson, R. Martin: Combined acoustic echo control and noise reduction based on residual echo estimation. International Workshop on Acoustic Echo and Noise Control, London, U.K., 1997, pp. 160-163.
  • 29. G. P. M. Egelmeers: Decoupling of partition factors in partitioned block FDAF European Conference on Circuits Theory and Design (ECCTD'93), Davos, Switzerland, 1993, pp. 323-329.
  • 30. F. Capman, J. Boudy, P. Lockwood: Acoustic echo cancellation and noise reduction in the frequency domain: a global optimization. 8th European Signal Processing Conference (E SIPCO'96), Trieste, Italy, 1996, pp. 29-32.
  • 31. A. Petrovsky, K. Bielawski, A. Anoshenko: Hands-free radiotelephony communication devices with combine front end processing systems: global approaches in the time and frequency domain. Journal of the University of Applied Sciences, Mittweida (FH), Kommunikationstechnik, band C, no. 3, 1998, pp. 135-142.
  • 32. A. A. Petrovsky, A. E. Anoshenko: Combined system f or echo cancellation and noise reduction in frequency domain with psychoacoustic motivation. 2nd International Conference and Exhibition, “Digital Signal Processing and its Applications”, 1999, Moscow, Russia, pp. 166-169 (in Russian).
  • 33. D. Virette, P. Scalart, C. Lamblin: Analysis of background noise reduction techniques for robust speech coding. XI European Signal Processing Conference (EUSIPCO'2002), Toulouse, France, 2002, vol. 3, pp. 297-300.
  • 34. T. Agarwal, P. Kabal: Pre-processing of noisy speech for voice coders. IEEE Workshop on Speech Coding. Tsukaba, Japan, 2002. pp. 169-171.
  • 35. R. Martin. H. G. Kang. R. V. Cox: Low delay analysis/synthesis scheme for joint speech enhancement and low·bit rate speech coding. 6th European Conference on Speech Communication and Technology (EUROSPEECH' 99), Budapest, Hungary. 1999. vol. 3, pp. 1463-1466.
  • 36. B. Carnero. A. Drygajło: Perceptual Speech coding and enhancement using frame-synchronized fast wavelet packet transform algorithms. IEEE Transactions on Signal Processing, 1999, vol. 47, no. 6, pp. 1622-1635.
  • 37. A. Cichocki, S. Amari: Adaptive Blind Signal and Image Processing Wiley & Sons, Ltd., Chichester, UK. 2002.
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-article-BWA1-0005-0148
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.