Wyniki wyszukiwania - BazTech

1

Investigation of the Lombard effect based on a machine learning approach

Korvel Gražina, Treigys Povilas, Kąkol Krzysztof, Kostek Bożena

International Journal of Applied Mathematics and Computer Science

|

2023

|

Vol. 33, no. 3

479--492

EN

The Lombard effect is an involuntary increase in the speaker’s pitch, intensity, and duration in the presence of noise. It makes it possible to communicate in noisy environments more effectively. This study aims to investigate an efficient method for detecting the Lombard effect in uttered speech. The influence of interfering noise, room type, and the gender of the person on the detection process is examined. First, acoustic parameters related to speech changes produced by the Lombard effect are extracted. Mid-term statistics are built upon the parameters and used for the self-similarity matrix construction. They constitute input data for a convolutional neural network (CNN). The self-similarity-based approach is then compared with two other methods, i.e., spectrograms used as input to the CNN and speech acoustic parameters combined with the k-nearest neighbors algorithm. The experimental investigations show the superiority of the self-similarity approach applied to Lombard effect detection over the other two methods utilized. Moreover, small standard deviation values for the self-similarity approach prove the resulting high accuracies.

2

Koncepcja systemu wspomagającego koordynację pracy służb ratownictwa medycznego

Długosz T., Kubiak P.

Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne

|

2014

|

nr 2-3

42--44

PL

Przedstawiono koncepcję systemu wspomagającego i usprawniającego działanie służb ratownictwa medycznego. Zaproponowano moduł rozpoznawania głosu i automatycznego uzupełniania kart udzielonej kwalifikowanej pierwszej pomocy System ma na celu sprawniejszą obsługę poszkodowanego i unikanie pomyłek w dokumentacji medycznej.

EN

A concept of system aiding and improving the action of emergency medical services is presented in this paper. A module of voice recognition and automatic replenishment of patient card is described. The main purpose of this system is efficient handling of the victim and avoiding mistakes in medical records.

3

Audio features for speech detection in adverse conditions

Mąka T.

Elektronika : konstrukcje, technologie, zastosowania

|

2010

|

Vol. 51, nr 4

38-40

EN

The paper presents an analysis of the audio features for speech processing systems, where speech signal is contaminated by background noise. To determine robustness of speech features for different audio environments, a comparison between feature contours in clean and noisy conditions using mean-square error criterion was performed. The obtained results have been exploited to simple, low-complexity speech detection algorithm. Experimental results show that accurate determination of speech regions is highly dependent on recording conditions and speaker characteristics. However, such approach is suitable for automatic detection of sentence boundaries for speech processing systems.

PL

W pracy przedstawiono analizę cech wykorzystywanych w systemach przetwarzania sygnału mowy w kontekście jego detekcji w niekorzystnych warunkach rejestracji. W wyniku przeprowadzonej analizy określono zbiór cech, których kontury ulegają najmniejszym zniekształceniom na podstawie pomiaru błędu średniokwadratowego dla sygnału bez zakłóceń i zdegradowanego. Z użyciem tych cech zaproponowano prosty algorytm detekcji sygnału mowy o niskiej złożoności. Wyniki przeprowadzonych badań pokazują, że określenie dokładnych granic poszczególnych słów jest ściśle uzależnione od warunków akwizycji oraz rodzaju mówcy. Pomimo tego, proponowane podeście umożliwia określenie w sposób automatyczny granic wypowiedzi w systemach przetwarzania sygnału mowy.

4

Moduł wykrywania sygnału głosowego o zróżnicowanej amplitudzie w systemie automatycznego rozpoznawania mowy

Binkowski M.

Zeszyty Naukowe. Automatyka / Politechnika Śląska

|

2003

|

z. 138

31-46

PL

W pracy opisano przykładowy moduł wykrywania sygnału głosowego, wspomagający system rozpoznawania mowy. Jednym z bloków funkcjonalnych opisywanego modułu jest funkcja wyznaczająca obwiednię sygnału. Zaproponowano działanie modułu przy zastosowaniu różnych funkcji obwiedni: wartości maksymalnej sygnału w oknie, średniej wartości bezwzględnej sygnału w oknie, odchylenia standardowego sygnału w oknie, energii sygnału w oknie oraz entropii sygnału w oknie. W końcowej części pracy porównano rezultaty otrzymane dla przykładowej wypowiedzi, zarejestrowanej wraz z zakłóceniami i szumem otoczenia. Wskazano również możliwości dalszych badań i sposoby poprawy skuteczności wykrywania głosu.

EN

This work describes an example of voice detection module, supporting speech recognition system. One of functional blocks of described module is an envelope function. Voice detection efficiency has been examined using various envelope functions: maximum value of signal in a window, mean of absolute values of signal in a window, standard deviation of signal in a window, energy of signal in a window and entropy of signal in a window. In last part of the work the results of analysis of an exemplary signal have been presented. The signal has been recorded with accompanying ambient noise and disturbances. Also means of improvements and areas of further exploration has been proposed.