Perceptually motivated approaches to speech enhancement. Part 1, Elimination of the musical noise phenomenon

Borowicz, A.; Petrovsky, A. A.

Artykuł - szczegóły

Tytuł artykułu

Perceptually motivated approaches to speech enhancement. Part 1, Elimination of the musical noise phenomenon

Autorzy

Borowicz A. , Petrovsky A. A.

Wybrane pełne teksty z tego czasopisma

http://ijet.pl/index.php/ijet

Identyfikatory

Warianty tytułu

Metody motywowane perceptualnie w uzdatnianiu sygnału mowy. Cz. 1, Eliminacja zjawiska tonów muzycznych

Języki publikacji

Abstrakty

This paper addresses the problem of noise reduction in speech communication devices. Firstly, basis of the spectral weighting techniques and related problems are presented. Next, we describe psychoacoustic principles as a tool improvement of speech enhancement systems. Conventional methods aimed at elemination of residual noise and their psychoacoustic modifications are discussed. The differences between them are amphasized. Finally, advantages of perceptual approaches over conventional ones are presented. Paper shows that exploitation of psychoacoustic models in speech enhancement provides significant improvements especially residual noise reduction and decrease of speech distortion.

Przedstawiono problematykę związaną z projektowaniem oraz użytkowaniem systemów transmisji mowy opartych na metodzie wag widmowych. Omówiono rolę takich zjawisk psychoakustycznych jak maskowanie, pasma krytyczne i absolutny próg słyszenia w procesie uzdatniania sygnału mowy. Dokonano zestawienia klasycznych metod uzdatniania sygnału mowy ukierunkowanych na eliminację tonów muzycznych z systemami opartymi na psychoakustycznej modyfikacji metody wag widmowych. Porównano wyniki zastosowań rozwiązań psychoakustycznych pod kątem możliwości tłumienia szumu środowiskowego i zapobiegania generacji zniekształceń sygnału mowy. Uwypuklono korzyści wynikające z zastosowań podstawowych właściwości psychoakustycznych systemu słuchowego człowieka w metodach wag widmowych, jednocześnie wskazano ograniczenia tych metod i możliwości ich optymalizacji.

Słowa kluczowe

noise reduction echo cancellation psychoacoustic

redukcja szumów eliminacja echa psychoakustyka

Wydawca

Wydawnictwo Naukowe PWN SA

Czasopismo

Kwartalnik Elektroniki i Telekomunikacji

Rocznik

2004

Tom

Vol. 50, z. 3

Strony

379--394

Opis fizyczny

Bibliogr. 30 poz., rys., wykr.

Twórcy

autor

Borowicz A.

borowicz@ii.pb.bialystok.pl

Wydział Informatyki, Politechnika Białostocka ul. Wiejska 45A, 15-351 Białystok

autor

Petrovsky A. A.

Wydział Informatyki, Politechnika Białostocka ul. Wiejska 45A, 15-351 Białystok

Bibliografia

1. K. Kroschel: Noise reduction: an old problem of actual concern . 8th International Symposium on Sound Engineering and Mastering (ISSEM'99), Gdańsk, 1999, pp. 3-8.
2. R. Martin: Spectral subtraction based on minimum statistic. VII European Signal Processing Conference (EUSIPCO'94), Edinburg 1994 pp 1182-1185.
3. S. F. Boll: Suppression of acoustic noise in speech using spectral subtraction. IEEE Transaction on Acoustic Speech and Signal Processing, 1979, vol. ASSP-27, no. 2, pp. 113-120.
4. Y. Ephraim, D. Malah: Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator. IEEE Transactions on Acoustic Speech and Signal Processing 1984, vol. ASSP-32, no. 6, pp. 1109-1121.
5. Y. Ephraim, D. Malah: Speech enhancement using a minimum mean-square log-spectral amplitude estimator. IEEE·Transactions on Acoustic Speech and Signal Processing, 1985. vol. ASSP-33 no. 2, pp. 443-445.
6. C. Beaugeant, P. Scalart: Noise, reduction using perceptual spectral change. 6th European Conference on Speech Communication and Technology (EUROSPEECH'99), Budapest, Hungary, 1999, pp. 2543-2546.
7. O. Cappe: Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor. IEEE Transactions on Speech and Audio Processing, 1994. vol 2. pp. 345-349.
8. N. Virag: Single channel speech enhancement based on masking properties of the human auditory system. IEEE Transactions on Speech and Audio Processing, 1999, vol. 7, no. 2, pp. 126-137.
9. T. Haulicki, K. Linhard, P. Schrogmeier: Residual noise suppression using psychoacoustic criteria. 5th European Conference on Speech Communication and Technology, Greece, 1997, pp. 1395- 1398.
10. D. E. Tsoukalas, J. N. Mourjopoulos, G. Kokkinakis: Speech enhancement based on audible noise suppression. IEEE Transactions on Speech and Audio Processing, 1997, vol. 5, no 6, pp. 497-514.
11. K. Bielawski: System eliminacji echa akustycznego i szumów w urządzeniach głośnomówiących wykorzystujące modele psychoakustyczne.·Rozprawa doktorska (promotor prof. A. .A. Petrovsky), Politechnika Warszawska, 2001.
12. S. Gustafsson, R. Martin, P. Jax, P. Vary: A psychoacoustic approach to combined acoustic echo cancellation and noise reduction. IEEE Transactions on Speech and Audio Processing 2002, vol. 10, no. 5, pp. 245-256.
13. A. A.·Petrovsky, A. E. Anoshenko: Combined system for echo cancellation and noise reduction in frequency domain with psychoacoustic motivation. 2'th International Conference and Exhibitation ,,Digital Signal Processing and its Applications", 1999, Moscow, Russia, pp. 166-169 (in Russian).
14. E. Zwicker, H. Fasl: Psychoacoustic. Facts and models. Berlin, Springer Verlag, 1990.
15. A. Borowicz, A. A. Petrovsky: The comparative study of voice activity detectors. VII International Conference of Modern Telecommunication Systems, Naroch, Belarus, 2002, pp. 148-152.
16. R. Martin: Noise power spectral density estimation based on optimal smoothing and minimum statistic. IEEE Transations on Speech and Audio Processing, 2001, vol. 9, no. 5, pp. 504-512.
17. A. Akbari Azirani, R. Lebouquin, G. Faucon: Speech enhancement using a Wiener filtering under signal presence uncertainty. VIII European Signal Processing Conference (EU-SIPCO’96), Trieste, Italy 1996, pp. 971-974.
18. J. Lim, A. Oppenheim: Enhancement and Bandwidth Compression of Noisy Speech. Proceedings of the IEEE, 1979, vol. 67. no. 12. pp. 1586- 1604
19. K. Bielawski, A. A. Petrovsky: Joint system for acoustic echo and noise control in hands-free communication devices. Kwartalnik Elektroniki i Telekomunikacji, PWN Warszawa, 1998, t. 44, z. 4, pp. 115-136.
20. S. Haykin: Adaptive filter theory. Third edition, Prentice-Hall, 1996.
21.·B. Ayad, G. Faucon, R. Bouquin-Jeannes: Optimization of a noise reduction pre-processing in an acoustic echo and noise controller: International Conference on Acoustic, Speech and Signal Processing (ICASSP' 96), Atlanta, USA, 1996, pp. 953-956.
22. R. Bouquin-Jeannes, P. Scalart, G. Faucon, C. Beaugeant: Combined noise and echo echo reduction in hands-free systems: a survey. IEEE Transactions on Speech and Audio Processing, 2001, vol. 9, no. 8, pp. 808-820.
23. S. Gustafsson, R. Martin: Combined acoustic echo control and noise reduction based on residual echo estimation International Workshop on Acoustic Echo and Noise Control, London, U.K., 1997, pp. 160-163.
24. P. Lockwood. J. Boudy: Experiments with a nonlinear spectral subtractor (NSS). Speech Communication, 1992, vol. 11, pp. 215-228.
25. T. Painter. A. Spanias: Perceptual coding of digital audio. Proceedings of the IEEE, 2000, vol. 88, no 4, pp. 451-513.
26. P. Noll: Digital audio coding for visual communications. Proceedings of the IEEE 1995, vol. 83, no. 6, pp. 925-943.
27. Al. Petrovsky, D. Krahe. A. A. Petrovsky: Real-time wavelet packet-based low bit-rate audio coding on a dynamic reconfiguration system. AES Convention paper 5778, presented at the 114th Convention, 2003, Amsterdam. The Netherlands, p. 22-27.
28. J. D. Johnston: Transform coding of audio signal using perceptual noise criteria. IEEE Journal on Selected Areas in Communications, 1988, vol. 6, no. 2, pp. 314-323.
29. ISO/IEC 11172-3, Information technology - Coding of moving pictures and associated audio for digital storage media at up to about 1,5 Mbit/s, Part 3: Audio, 1993.
30. Z. Goh, K. Tan, B. T. G. Tan: Postprocessing method for suppressing musical noise generated by spectral subtraction. IEEE Transactions on Speech and Audio Processing, 1998, vol. 6, no. 3. pp. 287-292.

Typ dokumentu

Bibliografia

Identyfikator YADDA

bwmeta1.element.baztech-article-BWA1-0005-0147