A robust generalized sidelobe canceller employing speech leakage masking

Borowicz, A.

Artykuł - szczegóły

Tytuł artykułu

A robust generalized sidelobe canceller employing speech leakage masking

Autorzy

Borowicz A.

Treść / Zawartość

Pełne teksty:

Pobierz

Identyfikatory

Warianty tytułu

Skuteczny tłumik listków bocznych z wykorzystaniem maskowania przecieku mowy

Języki publikacji

Abstrakty

A novel speech enhancement method based on generalized sidelobe canceller (GSC) structure is presented. We show that it is possible to reduce audible speech distortions and preserve residual noise level under acoustic model uncertainties. It can be done by constraining a speech leakage power according to masking phenomena and conditional minimizing the residual noise power. We implemented the proposed approach using a simple delay-and-sum beamformer model. Finally a comparative evaluation of the selected methods is performed using objective speech quality measures. The results show that the novel method outperforms conventional one providing lower speech distortions.

Prezentowana jest nowa metoda uzdatniania mowy w oparciu o strukturę uogólnionego tłumika listków bocznych. Wykazujemy, ze możliwe jest zmniejszenie słyszalnych zniekształceń mowy przy zachowaniu stałego poziomu szumu rezydualnego, dla modeli przybliżonych środowiska akustycznego. Może to być dokonane poprzez uwarunkowanie poziomu mocy przecieku mowy zgodnie ze zjawiskiem maskowania oraz minimalizację warunkową mocy szumu rezydualnego. Proponowane podejście zaimplementowano w oparciu o prosty model beamformera opóźniająco-sumującego. Ostatecznie przeprowadzono ocenę porównawczą wybranych metod z wykorzystaniem obiektywnych miar jakości mowy. Wyniki pokazują, że nowa metoda przewyższa konwencjonalną zapewniając mniejsze zniekształcenia mowy.

Słowa kluczowe

GSC psychoacoustics speech enhancement

GSC psychoakustyka uzdatnianie mowy

Wydawca

Oficyna Wydawnicza Politechniki Białostockiej

Czasopismo

Advances in Computer Science Research

Rocznik

2014

Tom

Nr 11

Strony

17--29

Opis fizyczny

Bibliogr. 16 poz., rys., wykr.

Twórcy

autor

Borowicz A.

Faculty of Computer Science, Bialystok University of Technology, Białystok, Poland

Bibliografia

[1] J.B. Allen and D.A. Berkley. Image method for efficiently simulating smallroom acoustics. Journal Acoustic Society of America, 65(4):943, 1979.
[2] J. Benesty, J. Chen, Y. Huang, and J. Dmochowski. On microphonearray beamforming from a mimo acoustic signal processing perspective. IEEE Trans. Audio, Speech, Lang. Process., 15(3):1053–1065, 2007.
[3] A. Borowicz and A. Petrovsky. Signal subspace approach for psychoacoustically motivated speech enhancement. Speech Comm., 53(2):210–219, 2011.
[4] O.L. Frost. An algorithm for linearly constrained adaptive array processing. In Proc. IEEE, volume 60, pages 926–935, Aug 1972.
[5] S. Gannot, D. Burshtein, and E. Winstein. Signal enhancement using beamforming and nonstationarity with applications to speech. IEEE Trans. Signal Process., 49(8):1614–1626, 2001.
[6] J.S. Garofolo, L.F. Lamel, W.M. Fisher, J.G. Fiscus, D.S. Pallett, and N.L. Dahlgren. DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus. National Institute of Standards and Technology (NIST), CD-ROM, 1993.
[7] Y. Huang, J. Benesty, and J. Chen. Analysis and comparison of multichannel noise reduction methods in a common framework. IEEE Trans. Audio, Speech, Lang. Process., 16(5):957–968, 2008. [8] ITU-T. Perceptual evaluation of speech quality (PESQ). Rec. P.862, ITU, Geneva, 2001.
[9] J.D. Johnston. Transform coding of audio signals using perceptual noise criteria. IEEE J. on Selected Areas in Comm., 6:314–323, February 1988.
[10] R. Ratnam, D.L. Jones, and W.D. O’Brien. Fast algorithms for blind estimation of reverberation time. IEEE Signal Process. Lett., 11(6):537–540, 2004.
[11] M. Souden, J. Benesty, and S. Affes. A study of the LCMV and MVDR noise reduction filters. IEEE Trans. Sig. Process., 58(9):4925–4935, Sept 2010.
[12] R. Talmon, I. Cohen, and S. Gannot. Convolutive transfer function generalized sidelobe canceler. IEEE Trans. Speech, Lang. Process., 17(7):1420–1434, Sept 2009.
[13] A. Varga and H.J.M. Steeneken. Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems. Speech Communication, 12(3):247–251, 1993.
[14] D. Virette, P. Scalart, and C. Lamblin. Analysis of background noise reduction techniques for robust speech coding. In Proc. EUSIPCO, volume 3, pages 297– 300, 2002.
[15] S. Wang, A. Sekey, and A. Gersho. An objective measure for predicting subjective quality of speech coders. IEEE J. Sel. Areas Commun., 10:819–829, 1992.
[16] W. Yang, M. Benbouchta, and R. Yantorno. Performance of a modified bark spectral distortion measure as an objective speech quality measure. In Proc. ICASSP, pages 541–544, Seattle, USA, 1998.

Typ dokumentu

Bibliografia

Identyfikator YADDA

bwmeta1.element.baztech-71291ecc-3519-466e-9655-2da4e7575177