Wyniki wyszukiwania - Biblioteka Nauki

1

Automatic speech signal segmentation based on the innovation adaptive filter

100%

Makowski R. , Hossa R.

International Journal of Applied Mathematics and Computer Science

|

2014

|

tom 24

|

nr 2

259-270

EN

Speech segmentation is an essential stage in designing automatic speech recognition systems and one can find several algorithms proposed in the literature. It is a difficult problem, as speech is immensely variable. The aim of the authors' studies was to design an algorithm that could be employed at the stage of automatic speech recognition. This would make it possible to avoid some problems related to speech signal parametrization. Posing the problem in such a way requires the algorithm to be capable of working in real time. The only such algorithm was proposed by Tyagi et al., (2006), and it is a modified version of Brandt's algorithm. The article presents a new algorithm for unsupervised automatic speech signal segmentation. It performs segmentation without access to information about the phonetic content of the utterances, relying exclusively on second-order statistics of a speech signal. The starting point for the proposed method is time-varying Schur coefficients of an innovation adaptive filter. The Schur algorithm is known to be fast, precise, stable and capable of rapidly tracking changes in second order signal statistics. A transfer from one phoneme to another in the speech signal always indicates a change in signal statistics caused by vocal track changes. In order to allow for the properties of human hearing, detection of inter-phoneme boundaries is performed based on statistics defined on the mel spectrum determined from the reflection coefficients. The paper presents the structure of the algorithm, defines its properties, lists parameter values, describes detection efficiency results, and compares them with those for another algorithm. The obtained segmentation results, are satisfactory.

2

Automatyczna segmentacja sygnałów mowy w oparciu o metodę siatek o zmiennych parametrach

75%

Dulas J.

Przegląd Elektrotechniczny

|

2010

|

tom R. 86, nr 1

229-232

PL

Artykuł przedstawia nową metodę segmentacji sygnału mowy opracowaną przez autora i przetestowaną na zbiorze 50-ciu nagrań pochodzących od osób różnej płci i w różnym wieku. Metoda ta bazuje na rozpoznawaniu obrazów uzyskanych z analizy charakterystyk czasowych tych nagrań.

EN

The article describes a new method of the speech signal segmentation. This method was worked out by the author and tested on 50 records come from people different age and different sex. It is based on the time characteristic’s image recognition.

3

Automatic speech signal segmentation based on the innovation adaptive filter

75%

Makowski R. , Hossa R.

International Journal of Applied Mathematics and Computer Science

|

2014

|

tom Vol. 24, no. 2

259--270

EN

Speech segmentation is an essential stage in designing automatic speech recognition systems and one can find several algorithms proposed in the literature. It is a difficult problem, as speech is immensely variable. The aim of the authors’ studies was to design an algorithm that could be employed at the stage of automatic speech recognition. This would make it possible to avoid some problems related to speech signal parametrization. Posing the problem in such a way requires the algorithm to be capable of working in real time. The only such algorithm was proposed by Tyagi et al., (2006), and it is a modified version of Brandt’s algorithm. The article presents a new algorithm for unsupervised automatic speech signal segmentation. It performs segmentation without access to information about the phonetic content of the utterances, relying exclusively on second-order statistics of a speech signal. The starting point for the proposed method is time-varying Schur coefficients of an innovation adaptive filter. The Schur algorithm is known to be fast, precise, stable and capable of rapidly tracking changes in second order signal statistics. A transfer from one phoneme to another in the speech signal always indicates a change in signal statistics caused by vocal track changes. In order to allow for the properties of human hearing, detection of inter-phoneme boundaries is performed based on statistics defined on the mel spectrum determined from the reflection coefficients. The paper presents the structure of the algorithm, defines its properties, lists parameter values, describes detection efficiency results, and compares them with those for another algorithm. The obtained segmentation results, are satisfactory.

4

Automatyczne rozpoznawanie cyfr w języku polskim - identyfikacja fonemów szumowych

63%

Dulas J.

Przegląd Elektrotechniczny

|

2011

|

tom R. 87, nr 1

280-283

PL

Artykuł opisuje kolejny etap badań prowadzonych przez autora, zmierzających do stworzenia systemu umożliwiającego automatyczną identyfikację i sterowanie za pomocą cyfr wypowiadanych w języku polskim. Przedstawiono tu nową metodę odnajdywania fonemów szumowych. W trakcie badań wykorzystywana jest baza nagrań Corpora opracowana na Politechnice Poznańskiej uzupełniona o własne nagrania.

EN

The article describes next steps of the author's research which goal is building the automatic speech recognition and control system for Polish. The new method of noisy phonemes finding is presented. In the author's research the Corpora data base, made by Poznan Technical University scientists and own records are used.

5

Pitch period’s properties and the new method used for finding them

63%

Dulas J.

Przegląd Elektrotechniczny

|

2012

|

tom R. 88, nr 7a

297-300

EN

This article describes the pitch’s periods interesting properties. These periods are included in each vowel and voiced consonant. It also describes the new method of pitch period finding and their duration counting. These parameters are very important elements of the automatic speech recognition algorithm worked out by the author.

PL

Artykuł przedstawia interesujące właściwości okresów podstawowych tonu krtaniowego występującego we wszystkich samogłoskach i spółgłoskach dźwięcznych oraz nową metodę ich odnajdywania i wyznaczania ich długości. Poprawne odnajdywanie okresów podstawowych i wyznaczanie czasu ich trwania jest ważnym elementem algorytmu automatycznej identyfikacji słów opracowanego przez autora.

6

Szybka metoda identyfikacji fonemów szumowych występujących w cyfrach wypowiadanych w języku polskim

63%

Dulas J.

Przegląd Elektrotechniczny

|

2011

|

tom R. 87, nr 2

242-245

PL

Niniejszy artykuł jest sprawozdaniem z zakończonego, kolejnego etapu prac autora nad stworzeniem systemu automatycznej identyfikacji cyfr wypowiadanych w języku polskim. Przedstawia on metodę automatycznego rozpoznawania fonemów szumowych przetestowaną na 100 nagraniach cyfr "trzy" i "cztery" pochodzących od mówców różnej płci i w różnym wieku.

EN

This article is the coverage from the last, finished author's research aiming to build automatic speech recognition system for digits spoken in polish. It describes the method of automatic noisy phonemes recognition which was tested on 100 records of digit 3 and 4 received from speakers of different sex and age.

7

The new method of the inter-phonemes transitions finding

63%

Dulas J.

Przegląd Elektrotechniczny

|

2012

|

tom R. 88, nr 10a

135-138

EN

This article describes the new method of the inter-phonemes transition finding based on the image recognition. Automatic borders between phonemes finding is the same as the number of phonemes finding. This is an important factor used in Automatic Speech Recognition systems.

PL

Artykuł przedstawia nową metodę lokalizacji przejść międzyfonemowych opartą o analizę obrazów. Automatyczne określenie miejsc przejść międzyfonemowych jest równoznaczne z określeniem liczby fonemów występujących w danym wyrazie. Jest to ważny parametr wykorzystywany w systemach automatycznej identyfikacji sygnałów mowy. (Nowa metoda lokalizacji przejść międzyfonemowych).

8

Automatyczna identyfikacja cyfr dla mówców polskojęzycznych

63%

Dulas J.

Przegląd Elektrotechniczny

|

2010

|

tom R. 86, nr 5

15-18

PL

Artykuł przedstawia aktualny stan prac autora nad wdrożeniem systemu automatycznej identyfikacji cyfr wypowiadanych w języku polskim. Pokazuje również obecnie wykorzystywane techniki rozpoznawania mowy w innych językach oraz osiągane tam rezultaty. W trakcie badań wykorzystywana jest baza nagrań Corpora opracowana na Politechnice Poznańskiej.

EN

The article describes current progress in implementation of the automatic digits recognition system for polish design by the author. It also shows different ways of speech recognition used for the other languages and results of their applications. In the author’s research the Corpora data base, made by Poznan Technical University scientists, is used.

9

Automatic word's identification algorithm used for digits classification

51%

Dulas J.

Przegląd Elektrotechniczny

|

2011

|

tom R. 87, nr 11

230-233

EN

This article describes the results of some years of research into automatic digits' identification algorithm for Polish. The new method based on the image recognition received from time characteristics gives better results than well known frequency domain analyses.

PL

Artykuł przedstawia efekt kilkuletnich prac autora nad stworzeniem algorytmu automatycznej identyfikacji cyfr wypowiadanych w języku polskim. Nowatorska metoda wykorzystująca analizę obrazów otrzymanych z charakterystyk czasowych wypowiedzi pozwala na osiągnięcie lepszych rezultatów niż stosowane powszechnie analizy widmowe.