Wyniki wyszukiwania - BazTech

Ograniczanie wyników

2 Przegląd Elektrotechniczny

Znaleziono wyników: 2

Liczba wyników na stronie

Wyniki wyszukiwania

Wyszukiwano:
w słowach kluczowych: dźwięk mowy

Sortuj według:

Ogranicz wyniki do:

Sub-band VAD Based on Continuous Noise Estimation in Wavelet Domain

Rasoolzadeh S., Rahmani M.

Przegląd Elektrotechniczny

2012

R. 88, nr 2

311-314

Voice Activity Detectors (VADs) are widely used in speech processing applications such as speech enhancement and coding. In speech enhancement systems, we use VAD in order to obtain more accurate estimation of noise. Voice activity detection methods usually work in time or frequency domains. In this paper, we propose two approaches for voice activity detection in wavelet domain for continuous noise estimation: Subband VAD and Full-band VAD. We use the correct noise/speech classification rate to show the acceptable performance of our proposed VADs. In addition, we apply sub-band VAD to a speech enhancement system. Objective and subjective measures show that Sub-band VAD outperforms Fullband VAD in noise reduction applications.

Analizowano detektor aktywności głosowej VAD w zastosowaniu do poprawy jakości dźwięku mowy. Celem było określenie szumów towarzyszących. Zastosowano transformatę falkową do analizy szumów.

A Data Field method for speech enhancement incorporating Binary Time-Frequency Masking

Huang J., Zhang Y., Zhang X., Zhu T.

Przegląd Elektrotechniczny

2011

R. 87, nr 7

225-229

A data field approach coupled with binary time-frequency masking is presented for the speech enhancement problem. In this proposed approach, data field method is employed to model the time and frequency dependencies of speech. This formulation has proved to be very helpful in enhancing speech quality by exploiting the correlation of speech both in time and in frequency. The experimental results demonstrate that the proposed algorithm offers improved signal to noise ratio and less spectral distortion.

Do poprawy jakości dźwięku mowy zastosowano metodę pola danych (Data field) połączoną z binarnym maskowanie czasowoczęstotliwościowym. Pozwoliło to znacząco poprawić jakość dźwięku przez wykorzystanie korelacji czasowej i częstotliwościowej. Uzyskano poprawę stosunku sygnału do szumu i zmniejszenie poziomu zniekształceń.