Wyniki wyszukiwania - BazTech

1

Discrimination between patients with CVDs and healthy people by voiceprint using the MFCC and pitch

Bourouhou Abdelhamid, Jilbab Abdelilah, Cherti Mohammed, Bourouhou Zaineb, Nacir Chafik

Diagnostyka

|

2021

|

Vol. 22, No. 4

9--16

EN

Heart diseases cause many deaths around the world every year, and his death rate makes the leader of the killer diseases. But early diagnosis can be helpful to decrease those several deaths and save lives. To ensure good diagnose, people must pass a series of clinical examinations and analyses, which make the diagnostic operation expensive and not accessible for everyone. Speech analysis comes as a strong tool which can resolve the task and give back a new way to discriminate between healthy people and person with cardiovascular diseases. Our latest paper treated this task but using a dysphonia measurement to differentiate between people with cardiovascular disease and the healthy one, and we were able to reach 81.5% in prediction accuracy. This time we choose to change the method to increase the accuracy by extracting the voiceprint using 13 Mel-Frequency Cepstral Coefficients and the pitch, extracted from the people's voices provided from a database which contain 75 subjects (35 has cardiovascular diseases, 40 are healthy), three records of sustained vowels (aaaaa…, ooooo… .. and iiiiiiii….) has been collected from each one. We used the k-near-neighbor classifier to train a model and to classify the test entities. We were able to outperform the previous results, reaching 95.55% of prediction accuracy.

2

Classification of cardiovascular diseases using dysphonia measurement in speech

Bourouhou Abdelhamid, Jilbab Abdelilah, Nacir Chafik, Hammouch Ahmed

Diagnostyka

|

2021

|

Vol. 22, No. 1

31--37

EN

Cardiovascular disease is the leading cause of death worldwide. The diagnosis is made by non-invasive methods, but it is far from being comfortable, rapid, and accessible to everyone. Speech analysis is an emerging non-invasive diagnostic tool, and a lot of researches have shown that it is efficient in speech recognition and in detecting Parkinson's disease, so can it be effective for differentiating between patients with cardiovascular disease and healthy people? This present work answers the question posed, by collecting a database of 75 people, 35 of whom suffering from cardiovascular diseases, and 40 are healthy. We took from each one three vocal recordings of sustained vowels (aaaaa…, ooooo… .. and iiiiiiii… ..). By measuring dysphonia in speech, we were able to extract 26 features, with which we will train three types of classifiers: the k-near-neighbor, the support vectors machine classifier, and the naive Bayes classifier. The methods were tested for accuracy and stability, and we obtained 81% accuracy as the best result using the k-near-neighbor classifier.

3

Speech and tremor tester - monitoring of neurodegenerative diseases using smartphone technology

Chronowski Maurycy, Kłaczyński Maciej, Dec-Ćwiek Małgorzata, Porębska Karolina, Sawczyńska Katarzyna

Diagnostyka

|

2020

|

Vol. 21, No. 2

31--39

EN

One of the most frequently diagnosed neurodegenerative disorders, along with Alzheimer’s disease, is Parkinson’s disease. It is a slowly progressing disease of the central nervous system that affects parts of the brain which are responsible for one’s motor functions. Despite the frequency of its occurrence among the elderly population, there has not yet been established a universal approach towards its certain diagnostics ante mortem. The study presents a pilot experiment regarding the assessment of the usefulness of simultaneous processing and analysis of speech signal and hand tremor accelerations for patient’s screening and monitoring of the progress in healing, using the data acquired with a mid-range Android smartphone. During the study, a mobile device of this kind was used to record the patients of the Department of Neurology, University Hospital of the Jagiellonian University in Kraków and a control group of healthy persons over the age of 50. The samples were then analysed and an attempt towards classification was made using statistical methods and machine learning techniques (PCA, SVM, LDA). It was shown that even for a limited population, the classifier reaches about 85% accuracy. Another topic discussed in the study is the possibility of implementing a fully automated mobile system for the monitoring of the disease’s progression. Propositions of further research were also drawn.

PL

Jednym z najczęściej diagnozowanych zaburzeń neurodegeneracyjnych, obok choroby Alzheimera, jest choroba Parkinsona. To wolno postępująca choroba zwyrodnieniowa ośrodkowego układu nerwowego, która zajmuje obszary mózgu odpowiedzialne za motorykę. Pomimo powszechności choroby wśród osób starszych, do tej pory nie została opisana uniwersalna metoda jej pewnego zdiagnozowania. Praca przedstawia pilotażowe badanie dotyczące określenia przydatności i możliwości wykorzystania metod jednoczesnego przetwarzania i analizy sygnału mowy oraz sygnału przyspieszenia drgań kończyny górnej w kontekście badań przesiewowych lub obiektywnego monitorowania postępu leczenia chorób neurodegeneracyjnych, z wykorzystaniem danych pozyskanych za pomocą średniej klasy smartfonu z systemem Android. W ramach badania wykonano za pomocą urządzenia mobilnego nagrania pacjentów Oddziału Neurologii Szpitala Uniwersyteckiego w Krakowie ze zdiagnozowaną chorobą Parkinsona oraz osób zdrowych powyżej 50 roku życia. Próbki poddano analizie i wstępnej klasyfikacji z wykorzystaniem metod statystycznych oraz technik uczenia maszynowego (PCA, SVM, LDA). Pokazano, że skuteczność klasyfikacji już dla niewielkiej populacji sięga około 85%. W pracy omówiono również możliwość implementacji w pełni automatycznego systemu mobilnego monitorowania przebiegu choroby, a także przedstawiono propozycję dalszych badań w tym kierunku.

4

Fusing the electromagnetic articulograph, high-speed video cameras and a 16-channel microphone array for speech analysis

Mik Ł., Lorenc A., Król D., Wielgat R., Święciński R., Jędryka R.

Bulletin of the Polish Academy of Sciences. Technical Sciences

|

2018

|

Vol. 66, nr 3

257--266

EN

Electromagnetic articulography (EMA) is one of the instrumental phonetic research methods used for recording and assessing articulatory movements. Usually, articulographic data are analysed together with standard audio recordings. This paper, however, demonstrates how coupling the articulograph with devices providing other types of information may be used in more advanced speech research. A novel measurement system is presented that consists of the AG 500 electromagnetic articulograph, a 16-channel microphone array with a dedicated audio recorder and a video module consisting of 3 high-speed cameras. It is argued that synchronization of all these devices allows for comparative analyses of results obtained with the three components. To complement the description of the system, the article presents innovative data analysis techniques developed by the authors as well as preliminary results of the system’s accuracy.

5

Artificial intelligence in medical diagnosis of some brain dysfunctions

Ciota Z., Napieralski A.

International Journal of Microelectronics and Computer Science

|

2015

|

Vol. 6, nr 1

1--5

EN

The paper presents an analysis of objective evaluation of speech quality. As an example, the recovering process after stroke for patients with vascular lesion of a central nervous system has been taken into account. Application of neural networks gives possibility for objective evaluation of speech quality of patient suffering from disorder of speech motor.

6

Baza danych nagrań mowy dla analizy porównawczej różnojęzycznych fonemów

Mąsior M., Igras M., Ziółko M., Kacprzak S.

Studia Informatica

|

2013

|

Vol. 34, nr 2B

79--87

PL

Artykuł prezentuje system gromadzenia, archiwizacji i akustycznej analizy wielojęzycznych próbek mowy. Głównym celem badań jest analiza porównawcza fonemów dla kilkuset języków i stworzenie drzewa genealogicznego języków świata. Opisana została implementacja systemu, jako bazy danych z portalem internetowym. Przedstawiono informacje dotyczące zawartości i formy bazy, perspektyw rozwoju i zastosowań w lingwistyce komputerowej.

EN

The paper presents a system of collecting and analyzing multilanguage speech samples for research on characteristics of phonemes in several hundred world languages. We describe the implementation: database and webpage. The content and form of the database and applications for development of the new methods of speech analysis are presented.

7

Methods of deformed voice signal evaluation after larynx surgery

Wszołek W., Kłaczyński M.

Mechanics / AGH University of Science and Technology

|

2009

|

Vol. 28, no. 1

31-37

EN

In the work has been shown from studies concerning the application of modified acoustic signal processing methods to the task of evaluation and classification of larynx surgery effects. The goal of the standard speech recognition studies is to reveal the semantic aspects of the pronounced text. In the tasks of medical diagnosis employing the speech signal analysis the semantic aspects are insignificant. The required signal characteristics should be as sensitive as possible to small deformations of the layers directly related to the voice functioning and the structure of vocal tract. The goal of the work is presentation of voice quality after various surgical treatments, performed in the ENT area. The research subject is the speech articulation process itself and all its pathological deformations, which determines both the used signal analysis tools as well as the techniques of the selected objects recognition, which are the forms of the particular ill person speech deformation forms in comparison to the speech of the whole sound people population. The evaluation has been carried out both for voice quality after larynx surgery as well as voice quality after surgical treatment of resonance cavities (nose, paranasal sinussis). The study was oriented towards the construction of systems based on the analysis of objectively registered acoustic signals of deformed speech.

PL

W pracy przedstawiono badania dotyczące metod przetwarzania sygnału akustycznego do oceny i klasyfikacji mowy po zabiegach w obrębie kanału głosowego. W zagadnieniach rozpoznawania mowy, problem dotyczy ujawniania semantycznych aspektów wypowiedzi. Natomiast w zagadnieniach diagnostyki medycznej przy wykorzystaniu sygnału mowy, cechy semantyczne są nieistotne. Poszukiwane cechy sygnału mowy winny być wrażliwe na małe deformacje, które mogą wystąpić w poszczególnych warstwach kanału głosowego. Celem pracy jest ocena jakości głosu po różnorodnych zabiegach chirurgicznych wykonanych w obszarze kanału głosowego. Tematem badań jest zarówno sam proces artykulacji mowy, jak i jego patologiczne deformacje. Diagnostykę narządu głosu można określić jako jednoznaczne rozpoznanie cech aktualnego stanu źródła głosu na podstawie zespołu istotnych cech akustycznych, zwartych w sygnale akustycznym. Ocena jakości głosu została przeprowadzona dla osób po chirurgicznym leczeniu krtani, nosa oraz zatok przynosowych. Badania zostały ukierunkowane na stworzenie systemu analizy umożliwiającego obiektywne rozpoznawanie deformacji sygnału mowy.

8

Isolated word descriptors as control parameters of the computer applications

Porwik P.

Journal of Medical Informatics & Technologies

|

2006

|

Vol. 10

35--46

EN

This paper is an extended version of the MIT'06 conference contribution. During the conference, many inquiries about the used techniques were performed. Hence, in the paper some parts of investigations were explained and discussed, with greater accuracy. It is shown that the computer applications can be controlled by a human voice. The computer controlling processes are available by means of utterance of isolated words, where application events with the aid of user's voice can be serviced. The voice usage can be convenient for blind or partially sighted users or for persons with limb paresis. The Microsoft application events, by means of the practicable Microsoft Windows firmware MSAA® technology can be analysed. Such technology, together with isolated word descriptors, as voice recognition system, has been presented.

9

Proces komunikacji słownej i jego teorie

Wykowska M., Bieda R.

Mechanika / Akademia Górniczo-Hutnicza im. St. Staszica

|

2004

|

T. 23, z. 1

75-95

PL

Artykuł stanowi zbiór treści dotyczących procesu komunikacji słownej, jaka zachodzi między ludźmi. Podana jest w nim krótka charakterystyka procesu słyszenia z anatomiczną, funkcjonalną klasyfikacją narządu słuchu i jego właściwościami oraz procesu mówienia z nakreśleniem pracy narządu mowy z rodzajem artykulacji. Omawiane są rodzaje sygnału mowy, metody analizy i syntezy mowy, jej cechy binarne i dystynktywne, zagadnienia percepcji głosek (fonemów) polskich i ich klasyfikacja. Zwrócono także uwagę na konieczność znajomości relacji między formą optyczną i akustyczną oraz sposobu transponowania z jednej formy na drugą, zwłaszcza w audiologii słownej ze względu na występowanie błędów interpretacyjnych. Całość zamykają teorie precepcji mowy uwzględniające specyfikę mowy w porównaniu z pozostałymi rodzajami dźwięku z rejestracją wypowiedzi w postaci oscylogramów i spektogramów, których analiza może być pomocna w audiometrii słownej.

EN

The paper consists of a collection of issues concerning process of interpersonal verbal communication. A short descriphon of hearing process together with anatomical and functional classification of the hearing organ and its properties as well as description of the speech organ's functioning with the type of articulation is also included. Types of speech signals, methods of speech analysis and synthesis, binary and distinctive language and their classification are all discussed. Attention is drawn to the necessity of being aware of the relation between the optic and acoustic form and the means of transponing from one to the other, especially due to verbal audiology with respect to the interpretational errors. Concluding remarks arę concerned with those theories of speech perception which pay respect to speech as being specific compared to other types of sound with speech recording in form of oscilograms and spectograms, analysis of which may be helpful in verbal audiometrics.

10

Analiza czasowo-częstotliwościowa sygnałów niestacjonarnych z przykładami zastosowań

Lenort F.

Prace Instytutu Lotnictwa

|

2001

|

Nr 1 (164)

1-76

PL

W praktyce spotykamy się często z koniecznością analizy krótkotrwałych sygnałów o zmiennych własnościach spektralnych w czasie. Odpowiedzi impulsowe obiektów zanikają często w czasie krótszym od jednej sekundy. Analiza modalna takich sygnałów jest utrudniona i wymaga specjalnych metod. Prowadzone sa intensywne prace nad metodami analizy mowy przez komputer. W pracy przytoczono podstawowe informacje o metodach analizy czasowo-częstotliwościowej, spotykanych w literaturze. Zwrócono uwagę na ich zalety i ograniczenia. stosowana jest też do tych celów szybka transformata Fouriera. W rozprawie zaproponowano algorytm szybkiego obliczania transformaty Fouriera o zwiększonej rozdzielczości w dolnym zakresie częstotliwości i metodę rekurencyjną obliczania transformaty Fouriera dla potrzeb analizy czasowo-częstotliwościowej. Wskazano na zalety i wady opracowanych metod. Opracowane algorytmy szybkiego obliczania transformaty Fouriera zastosowano do analizy modalnej odpowiedzi impulsowych samolotu celem określenia odporności na drgania flatterowe. Wyprowadzono układ równań, opisano zasady określania parametrów składowych postaci drgań. Metodę przetestowano na danych modelowych i rzeczywistych. Poddano też analizie odcinki zanikających drgań buffetingowych celem oceny własności flatterowych samolotu. Przedstawiono wyniki analizy czasowo-częstotliwościowej sygnałów akustycznych, muzycznych i mowy. Zwrócono uwagę na zastosowanie praktyczne metody do rozpoznawania słów przez komputer. Przytoczono wyniki wykonanych prac w tym zakresie, opisano problemy do rozwiązania.

EN

This paper presents a method of the time-frequency analysis and its application to modal analysis and numerical computing of the non-parametric time-frequency representation for non-stationary signals. To achieve this aim, an efficient algotithm for calculation discrete Fourier transform was developed. This algorithm offers good frequency resolution at low spectral frequencies. High resolution is obtained by increasing the density of sampling for the analyzed signals. In research practice it has been proved that the formula for the vibration damping coefficients can be invented, emploing the Fourier transform. Fourier transform was also used to the modal frequencies identification. The recurrence procedure for calculating of the discrete Fourier transform, which enables very fast calculations especially in the case of precise time-frequency representations approach, has been established. These methods have been tested both on the model and the experimental data. The decomposition of the structure aircraft impulse response in flight is presented togother with time-Frequency representations (contour maps) for acoustic signals.