Wyniki wyszukiwania - BazTech

1

Voice Controlled Games – The approach and challenges of implementing speech recognition and voice control in games

Strzałko Dominik

Annals of Computer Science and Information Systems

|

2021

|

Vol. 26

229--230

EN

The subject of voice controlled games is quite underrated and exciting. Even though there are not that many papers focusing on that particular subject, we can find many papers describing both, concept of voice recognition, and controlling the game using natural language. The most illustrative example of usage of that kind of control are games in VR/AR/MR and games for people with disabilities. Moreover, almost every game can benefit from voice commands e.g. for controlling the user interface or units in strategy game. Therefore, voice controlled games are an interesting and innovative concept for game designers and developers.

2

Performance Analysis of Developed Multimodal Biometric Identity Verification System

Czyżewski A., Bratoszewski P., Hoffmann P., Kurowski A., Lech M., Szczodrak M.

Elektronika : konstrukcje, technologie, zastosowania

|

2018

|

Vol. 59, nr 4

35--42

EN

The bank client identity verification system developed in the course of the IDENT project is presented. The total number of five biometric modalities including: dynamic handwritten signature proofing, voice recognition, face image verification, face contour extraction and hand blood vessels distribution comparison have been developed and studied. The experimental data were acquired employing multiple biometric sensors installed at engineered biometric terminals. The biometric portraits of more than 10 000 bank clients were registered and stored in the database during the presented study and then verified experimentally. Problem-specific survey was done on the basis of questionnaires completed by the subjects in order to assess the look and feel of the developed biometric system as well as to collect opinions concerning its implementation in banking outlets. A discussion concerning the quality of registered data and results achieved in the study is included.

PL

W artykule przedstawiono system weryfikacji tożsamości klienta bankowego opracowany w ramach projektu IDENT. Opracowano i przebadano pięć metod biometrycznych, w tym: rozpoznawanie dynamicznej reprezentacji podpisu odręcznego, weryfikację głosową, weryfikację obrazu twarzy, rozpoznawanie ekstrahonego konturu twarzy i porównywanie rozkładu naczyń krwionośnych dłoni. Przedstawione w artykule dane badawcze pozyskano za pomocą wielu czujników biometrycznych zainstalowanych w skonstruowanych stanowiskach biometrycznych. Łącznie z wykorzystaniem skonstruowanych stanowisk zarejestrowano próbki biometryczne pochodzące od ponad 10 000 klientów banku. W trakcie badania uczestnicy, tzn. klienci i doradcy bankowi byli proszeni o wypełnienie ankiet w celu ułatwienia oceny wyglądu i sposobu działania opracowanego systemu biometrycznego oraz zebrania opinii na temat jego przyszłego wdrożenia w placówkach bankowych. W artykule przedstawiono wyniki analiz zgromadzonych danych, z uwzględnieniem wzajemnej korelacji poszczególnych modalności oraz semantycznej analizy ankiet wypełnionych przez uczestników badania.

3

Voice recognition through the use of Gabor transform and heuristic algorithm

Woźniak M., Połap D.

International Journal of Electronics and Telecommunications

|

2017

|

Vol. 63, No. 2

159--164

EN

Increasingly popular use of verification methods based on specific characteristics of people like eyeball, fingerprint or voice makes inventing more accurate and irrefutable methods of that urgent. In this work we present voice verification based on Gabor transformation. Proposed approach involves creation of spectrogram, which serves as a habitat for the population in selected heuristic algorithm. The use of heuristic allows for feature extraction to enable identity verification using classical neural network. The results of the research are presented and discussed to show efficiency of the proposed methodology.

4

System rozpoznawania komend głosowych z zastosowaniem systemu wbudowanego

Tomasik P., Barański P.

Elektronika : konstrukcje, technologie, zastosowania

|

2014

|

Vol. 55, nr 9

116-118

PL

W artykule przedstawiony został system do rozpoznawania komend głosowych zbudowany w oparciu o 32-bitowy mikrokontroler rodziny ARM7. Zadaniem systemu jest rozpoznanie komendy w oparciu o zadany słownik komend, a następnie wysterowanie odpowiedniego wyjścia. Wypowiedziana komenda zostaje zamieniona na wektor cech wypracowany w oparciu o analizę cepstralną. Ten wektor cech jest porównywany z wektorami cech komend wzorcowych. Porównywanie wykonywane jest metodą nieliniowej transformacji czasowej (ang. Dynamic Time Warping, DTW), w celu eliminacji problemu różnej szybkości wypowiadanych komend.

EN

The article presents a voice recognition system implemented on a 32-bit microcontroller of ARM7 core. The system recognizes a spoken commend from a prepared dictionary and drives the corresponding output. A spoken command is recorded and then converted into a representative vector of cepstrum features. This vector is compared against feature vectors of the commands from the dictonary. The comparison is carried out by means of Dynamic Time Warping algorithm (DTW) which remedies the problem of different duration of spoken commands.

5

Voice control of visual supervision system based on spectral analysis using mean square error measure

Pęksiński J., Mikołajczak G.

Poznan University of Technology Academic Journals. Electrical Engineering

|

2010

|

No. 63

89-94

EN

The authors presented in the study human voice recognition algorithm, which functioning is based on analysis of spectrograms with MSE (mean square error) quality measure utilization, which is employed for digital images comparing. The algorithm of human voice recognition proposed in the article was practically used for voice control of visual supervision system.

6

Word extraction method in human speech processing

Porwik P., Proksa R.

Journal of Medical Informatics & Technologies

|

2008

|

Vol. 12

209--216

EN

A major problem in isolated-word speech recognition systems is detection of the beginning and ending boundaries of the word. It is an essential of speech recognition algorithms, where signal speech segments should be reliably separated. During speech recognition background noise is also recorded, hence the word isolation is difficult. The parametric representation of the speech must provide enough information to characterize the words and to differentiate between acoustically similar words. In this paper the method of words extraction from human speech will be considered.

7

Text-independent speaker recognition

Bobulski J.

Computing, Multimedia and Intelligent Techniques

|

2007

|

Vol. 3, nr 1

101-107

EN

In this paper, text-independent speaker recognition method based on Wavelet Transform and melcepstrum is presented. The results of experiments point the best parameters of Wavelet Transform for speaker identification, and can be useful for design speaker identification systems. This kind method of person identification is useful in services such as banking by telephone, access authorization to resources and for forensic purpose.

8

Spoken digit recognition using generalized Brain-State-in-a-Box Model

Starczewski K.

Przegląd Elektrotechniczny

|

2006

|

R. 82, nr 1

43-46

EN

In this paper an approach to spoken digit recognition using generalized Brain-State-in-a-Box (gBSB) model is shown. The author presents several methods of feature extraction from sampled human voice signals and some major problems of implementation. Experimental results and working examples are included.

PL

Praca przedstawia nowatorskie podejście do problemu rozpoznawania cyfr z mowy ludzkiej z wykorzystaniem modelu Brain-State-in-a-Box (gBSB). Autor prezentuje również kilka metod ekstrakcji unikalnego wektora cech dla danej cyfr z przygotowanych wcześniej próbek dźwiękowych. Omówione są pewne problemy implementacyjne. Całość jest zilustrowana przykładami.

9

Systemy rozpoznawania mowy w zastosowaniach telekomunikacyjnych

Korzec Z., Jędrzejewski D., Kluszczyński K.

Elektronika : prace naukowe

|

1998

|

nr 3

149-161

PL

Systemy rozpoznawania mowy znajdują w ostatnich latach coraz szersze zastosowania w wielu dziedzinach, a wprowadzanie ich do telekomunikacji przebiega szczególnie dynamiczne. W artykule przedstawiony jest syntetyczny zbiór informacji dotyczących realizacji tego rodzaju systemów a następnie podane zostały przykłady ich zastosowań w automatycznych centralach telefonicznych, centrach informacji, i innych. Na tle aktualnego stanu wiedzy w tej dziedzinie omówione zostały prace projektowe i badawcze z tego zakresu prowadzone w Instytucie Elektroniki Politechniki Łódzkiej.

EN

Voice processing has been a subject of research for several decades, but in the past few years the technology was developed sufficiently to make the voice recognition systems ready for wide range of applications. In this paper, after some general remarks on speech recognition systems technology, the main applications to telecommunications are reviewed. There are two broad categories of speech recognition applications to telecommunications; namely, those which provide cost reduction, and those which generate revenue. Cost reduction applications are those which replace human attendants by speech recognition systems. The second wide category include applications which provide services that were previously not available: touch tone replacement by voice recognition, voice access information systems, speaker verification, etc. In the final sections some glimpses on the works carried in this field in Institute of Electronics are incuded.

10

Rozpoznawanie głosów na podstawie dyskryminacji amplitudowej sygnału mowy

Shomali A., Kapusta M.

Automatyka / Akademia Górniczo-Hutnicza im. Stanisława Staszica w Krakowie

|

1998

|

T. 2, z. 1

91--103

PL

Artykuł przedstawia nowy sposób opisu sygnałów mowy oraz jego wykorzystanie dla celów rozpoznawania głosu. Doświadczenie z rozpoznawaniem, opisane w artykule, oparto na nagraniach siedmiu głosów (żeńskie i męskie) w trzech różnych językach (polski, arabski i angielski). Głosy były zarejestrowane w warunkach laboratoryjnych (w komorze bezechowej za pomocą wysokiej klasy aparatury rejestrującej) i w pokojowych, za pomocą karty dźwiękowej komputera osobistego i zwykłego mikrofonu (wyższy poziom hałasu). Zaproponowana w artykule nowa metoda rozpoznawania jest wyjątkowo prostą (nieporównywalnie mniejsza złożoność obliczeniowa niż w przypadku innych metod) i skuteczną metodą rozpoznawania. Metoda ta nakłada zdecydowanie mniej ograniczeń co do treści (jak i języka) badanych wypowiedzi niż inne metody. Wykazano bowiem, że za jej pomocą można rozpoznać głos niezależnie od treści wypowiedzi, jak i od jej, mimo iż rozpoznawane wypowiedzi były rozdzielone w sposób «ślepy» (tj. niezależnie od początku i końca słów), to metoda ta daje zadawalające wyniki. Ponadto wypowiedzi nagrywane w warunkach pokojowych były zdecydowanie dobrze rozpoznawane. Pokazano również, że ewentualna realizacja sprzętowa (bądź programowa w języku maszynowym) obliczeń związanych z przedstawioną metodą dyskryminacji amplitudowej jest prosta i co zatem idzie mało kosztowna. W artykule zasygnalizowano jednocześnie przydatność metody dyskryminacji amplitudowej dla celów rozpoznawania języka. Zaproponowana w artykule metoda (jak większość metod rozpoznawania głosu i mowy) jest zależna od właściwości kanału transmisyjnego sygnału, zwłaszcza od jego wzmocnienia. W związku z tym przedstawiono łatwy sposób wyrównania wpływu wzmocnienia kanału poprzez dopasowania poziomu mocy (lub średniej amplitudy) porównywanych sygnałów.

EN

In this correspondence a new method for speech signal presentation applying amplitude discrimination is illustrated, the appliance of this method to voice recognition is discussed. The voice recognition experiment described here is established on seven different voices (female and male) that were registered in three languages (Polish, Arabie and English). Two variants of registration apparatus and environments were considered: anechoic chamber (noise free, high elass microphone and recording apparatus) and room conditions (noisy, PC multimedia were used for registration). Amplitude discrimination method is particularly simple (incomparably lower computing complexity in contrast with other methods) and quite efficient. The presented method definitively imposes fewer limitations on utterances' text and language for it proved to be almost text and language independent operating well and accomplishing good results. Moreover, despite the fact that utterances were «blindly» segmented (i.e. independently of words' boundaries) adequate results were achieved. Furthermore, utterances registered in noisy environment were also defmitely well elassified. In addition, it was shown that hardware implementation (of software using machine language) of computations associated with amplitude discrimination would be simple and therefore inexpensive. Besides, the usability of amplitude discrimination for automatic language identification was also signalized. The introduced method (just like most other voice and speech recognition methods) is sensitive to channel characteristies specially to gain. Therefore, an easy technique for gain influence compensation applying power level or mean amplitude matching (gain normalization) of compared signals is suggested.