Nowa wersja platformy, zawierająca wyłącznie zasoby pełnotekstowe, jest już dostępna.
Przejdź na https://bibliotekanauki.pl
Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników

Znaleziono wyników: 9

Liczba wyników na stronie
first rewind previous Strona / 1 next fast forward last
Wyniki wyszukiwania
help Sortuj według:

help Ogranicz wyniki do:
first rewind previous Strona / 1 next fast forward last
|
|
tom Vol. 3, nr 1
87-99
EN
This document contains the novel approach that use other orthogonal sources of information to the acoustic input that not only considerably improve the performance in severely degraded conditions, but also are independent to the type of noise and reverberation. Visual speech is one such source not perturbed by the acoustic environment and noise. It was proposed own approach to lip-tracking for audio-visual speech recognition system and novel audio-visual fusion technique. It was presented video analysis of visual speech for extraction visual features from a talking person in color video sequences. I was developed a method for automatically face, eyes, region of lips, region of corners and detection of contour of lips. Finally, the paper will show results of audio -visual speech recognition in noisy environments.
|
2006
|
tom Vol. 2, nr 1
55-64
EN
The purpose of this work is to explain the theoretical issues and implementational techniques related to the fascinating field of speech recognition. The topic of discussion are focused on some of the well-established and widely used speech coding standards, required to speech recognition and speaker identification. By studying the most successful standards and understanding their principles, performance and limitations, it is possible to apply a particular technique to a given situation according to the underlying constraints - with the ultimate goal being the development of next-generation algorithms, with improvements in all aspects. This document contains own created methods to determine the beginning and end of isolated words in audio speech. To extraction of the audio features of person's speech, in this work it was applied the mechanism of cepstral speech analysis. Finally, the paper will show results of speech coding.
|
|
tom Vol. 1 nr 1
181-190
EN
Mainstream automatic speech recognition has focused almost exclusively on the acoustic signal. The performance of these systems degrades considerably in the real word in the presence of noise. It was needed novel approaches that use other orthogonal sources of information to the acoustic input that not only considerably improve the performance in severely degraded conditions, but also are independent to the type of noise and reverberation. Visual speech is one such source not perturbed by the acoustic environment and noise. In this paper, it was presented own approach to lip-tracking for audio-visual speech recognition system. It was presented video analysis of visual speech for extraction visual features from a talking person in color video sequences. It was developed a method for automatically face, eyes, lip's region, lip's corners and lip's contour de-tection. Finally, the paper will show results of lip-tracking depending on various factors (lighting, beard).
4
Content available remote Person identification system using an identikit picture of the suspect
63%
|
|
tom Vol. 42, nr 4
865--873
EN
The article presents a person identification system, which may work with an identikit picture. The identikit picture (sketch) is often used in practice as an investigative tool to search for the perpetrators of an unknown identity. With a portrait of the perpetrator of a crime, one may identify the criminal. When the face database for comparisons is large, this is labour-absorbing. With the help of a computer system of face identification, this process becomes quick and easy.
EN
In a person identification or verification, the prime interest is not in recognizing the words but determining who is speaking the words. In systems of person identification, a test of signal from an unknown speaker is compared to all known speaker signals in the set. The signal that has the maximum probability is identified as the unknown speaker. In security systems based on person identification and verification, faultless identification has huge meaning for safety. In systems of person verification, a test of signal from a known speaker is compared to recorded signals in the set, connected with a known tested persons label. There are more than one recorded signals for every user in the set. In aim of increasing safety, in this work it was proposed own approach to person verification, based on independent speech and facial asymmetry. Extraction of the audio features of person's speech is done using mechanism of cepstral speech analysis. The idea of improvement of effectiveness of face recognition technique was based on processing information regarding face asymmetry in the most informative parts of the face the eyes region.
|
|
tom Nr 1(5)
15-39
PL
Praca dotyczy wykorzystania cech biometrycznych, wynikających z wyglądu twarzy, do celów weryfikacyjnych. Opisano w niej różne metody doboru i analizy cech podczas rozpoznawania twarzy. Zamieszczony opis zawiera przede wszystkim możliwość analizy, a w późniejszych etapach – weryfikacji tożsamości na podstawie cech asymetrycznych twarzy. Zaproponowano nową metodę weryfikacji na podstawie wyznaczonych punktów charakterystycznych, bazującą na odpowiednim zakodowaniu informacji o asymetrii twarzy w postaci wektorów obserwacji i rozpoznawaniu z wykorzystaniem ukrytych modeli Markowa.
EN
The research refers to making use of biometrical features, beeing result of the look of the face, for the verification purposes. Different methods of collecting and analysing of features during face recognition, have been described. Presented description conveys first of all the analyse possibility and in the further stages – identity verification on the basis of face assymetry. The new method of verification has been proposed, taking into account chosen characteristic poins, being based on proper information code concerning face assymetry, represented by both – observation vectors and recognition, with the use of hidden Markow’s models.
7
Content available Biometric Systems Based on Palm Vein Patterns
63%
EN
The work covers issues related to the design of biometric systems based on the hand vascular pattern. The study includes analysis of various stages of biometric systems design ranging from acquisition, feature extraction and biometric pattern creation for verification methods. The extraction methods based on two-dimensional density function and the extraction of the characteristic points - minutiae are presented. The article features the results of tests carried out on two different bases of blood vessels in a hand.
EN
This paper focuses on combining audio-visual signals for Polish speech recognition in conditions of the highly disturbed audio speech signal. Recognition of audio-visual speech was based on combined hidden Markov models (CHMM). The described methods were developed for a single isolated command, nevertheless their effectiveness indicated that they would also work similarly in continuous audiovisual speech recognition. The problem of a visual speech analysis is very difficult and computationally demanding, mostly because of an extreme amount of data that needs to be processed. Therefore, the method of audio-video speech recognition is used only while the audiospeech signal is exposed to a considerable level of distortion. There are proposed the authors’ own methods of the lip edges detection and a visual characteristic extraction in this paper. Moreover, the method of fusing speech characteristics for an audio-video signal was proposed and tested. A significant increase of recognition effectiveness and processing speed were noted during tests – for properly selected CHMM parameters and an adequate codebook size, besides the use of the appropriate fusion of audio-visual characteristics. The experimental results were very promising and close to those achieved by leading scientists in the field of audio-visual speech recognition.
first rewind previous Strona / 1 next fast forward last
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.