Wyniki wyszukiwania - BazTech

1

Enhancing speech signals based on an mems microphone array and temporal differences in the incoming signal

Felcyn Jan, Raszewski Michał

Vibrations in Physical Systems

|

2022

|

Vol. 33, nr 2

art. no. 2022202

EN

The development of the Internet of things and automatisation in everyday life also influences our houses. There are more and more devices on the market which can be controlled remotely. One kind of such control involves the use of voice signals. This method tends to use microphone arrays and dedicated algorithms to enhance the speech signal and recognize the words in it. In this project, a small 5-microphone array was developed. To enhance the quality of the signal, dedicated software was written. It consists of several modules, including the direction of arrival estimation, denoising, and differentiation between adults and children. The results showed that the custom algorithm can increase the signal to noise ratio by up to 6 dB.

2

A robust generalized sidelobe canceller employing speech leakage masking

Borowicz A.

Advances in Computer Science Research

|

2014

|

Nr 11

17--29

EN

A novel speech enhancement method based on generalized sidelobe canceller (GSC) structure is presented. We show that it is possible to reduce audible speech distortions and preserve residual noise level under acoustic model uncertainties. It can be done by constraining a speech leakage power according to masking phenomena and conditional minimizing the residual noise power. We implemented the proposed approach using a simple delay-and-sum beamformer model. Finally a comparative evaluation of the selected methods is performed using objective speech quality measures. The results show that the novel method outperforms conventional one providing lower speech distortions.

PL

Prezentowana jest nowa metoda uzdatniania mowy w oparciu o strukturę uogólnionego tłumika listków bocznych. Wykazujemy, ze możliwe jest zmniejszenie słyszalnych zniekształceń mowy przy zachowaniu stałego poziomu szumu rezydualnego, dla modeli przybliżonych środowiska akustycznego. Może to być dokonane poprzez uwarunkowanie poziomu mocy przecieku mowy zgodnie ze zjawiskiem maskowania oraz minimalizację warunkową mocy szumu rezydualnego. Proponowane podejście zaimplementowano w oparciu o prosty model beamformera opóźniająco-sumującego. Ostatecznie przeprowadzono ocenę porównawczą wybranych metod z wykorzystaniem obiektywnych miar jakości mowy. Wyniki pokazują, że nowa metoda przewyższa konwencjonalną zapewniając mniejsze zniekształcenia mowy.

3

Using auditory properties in multimicrophone speech enhancement

Borowicz A., Petrovsky A.

Elektronika : konstrukcje, technologie, zastosowania

|

2012

|

Vol. 53, nr 5

30-34

EN

In this article a perceptually motivated multichannel speech enhancement system is presented. The proposed approach uses a generalized sidelobe canceler (GSC) method for speech dereverberation and noise suppression. The conventional GSC structure has been modified by introducing a weighting factor into the noise cancellation loop. It allows for a perceptually optimal shaping of the residual noise spectrum which results in speech distortion decrease. Acomparatwe evaluation of the selected methods has been performed using objective speech guality measures. Experimental results show that the proposed approach outperforms conventional ones providing better speech guality.

PL

Artykuł przedstawia motywowany percepcyjnie wielokanałowy system uzdatniania mowy. Proponowane podejście wykorzystuje uogólnioną metodę tłumienia listków bocznych (ang. Generalised Sidelobe Canceller) do usuwania pogłosu i szumu. Zmodyfikowano konwencjonalną strukturę algorytmu GSC poprzez wprowadzenie współczynnika wagowego w pętli usuwania szumu. Umożliwia to optymalne, w sensie percepcyjnym, kształtowanie widma szumu resztkowego, co skutkuje zmniejszeniem zniekształceń mowy. Przeprowadzono ocenę porównawczą wybranych metod z wykorzystaniem obiektywnych miar jakości mowy. Wyniki eksperymentów pokazują, że proponowane podejście przewyższa metody konwencjonalne, zapewniając lepszą jakość mowy.

4

Perceptually constrained signal subspace method for speech enhancement : approximate solutions

Borowicz A., Petrovsky A.

Elektronika : konstrukcje, technologie, zastosowania

|

2008

|

Vol. 49, nr 4

38-44

EN

This paper is concerned with recently proposed perceptually constrained signal subspace (PCSS) method for speech enhancement. Two simplifications of the PCSS method are presented. The first approach is based on approximate diagonalization of the covariance matrix of noise energies in the transformed domain. The approximate solution is presented in a new form which provides perceptually optimal resi-dual noise shaping and does not require a whitening transformation. The second approach is a realization of the PCSS method in the frequency-domain. This is done using an assumption that the covariance matrices are circulant. The resulting estimator is almost identical to the well known IND (Inaudible Noise Distortion) rule. An evaluation of selected methods is performed using objective speech quality mea-sures and informal listening tests. The results show that the sub-optimal methods offer comparable speech quality as the exact solution in common situations.

PL

Artykuł dotyczy zaproponowanej ostatnio metody podprzestrzeni sygnału z ograniczeniami percepcyjnymi (PCSS). Prezentowane są dwa uproszczenia metody PCSS. Pierwsze podejście opiera się na przybliżonej diagonalizacji macierzy kowariancji energii szumu w dziedzinie transformaty. Rozwiązanie przybliżone umożliwia optymalne w sensie percepcyjnym kształtowanie widma szumu resztkowego i nie wymaga transformacji wybielających. Drugie podejście stanowi realizację metody PCSS w dziedzicznie częstotliwości. Osiąga się to wykorzystując założenie, że macierze kowariancji są macierzami okresowymi. Uzyskany estymator okazuje się niemal identyczny z dobrze znaną regułą IND. Przeprowadzana jest ocena wybranych metod przy użyciu obiektywnych miar jakościowych oraz nieformalnych testów odsłuchowych. Wyniki wskazują, że metody przybliżone oferują porównywalną jakość mowy do metody dokładnej w typowych warunkach.