Wyniki wyszukiwania - BazTech

1

Comparison of the efficiency of time and frequency domain descriptors for the classification of selected wind instruments

Tyburek Krzysztof, Namli Ömer Bora

Studia i Materiały Informatyki Stosowanej

|

2022

|

T. 14, nr 3

13--19

EN

By analyzing the physical features of the time domain and the frequency domainof the audio signal, it is possible to determine its source and use appropriate algorithms to automatically classify of it. The issue of sound indexing deals with the analysis ofdifferent classes and sources -including signals from musical instruments. By calculating the values of descriptors and classifying them, we obtain information about the type of instrument and its structure -most often the material from which it was made. During the conducted research, it turned out that a different composition of the feature vector is implemented to describe brass instruments and a different one for wooden instruments. In this case, the key feature may be harmonic highs in the frequency domain. The conducted experiments concern an attempt to parameterize wind instruments (aerophones) in order to compare the classification effectiveness of time and spectral descriptors. Sounds from a tube, a flute and a soprano saxophone were used for research. The sample population for each instrument was 21.

PL

Analizując fizyczne cechy domeny czasu i domeny częstotliwości sygnału audio można okreslić jego źródło i przy pomocy własciwych algorytmów dokonac jego automatycznej klasyfikacji. Kwestia indeksacji dźwięku dotyczy analizy różnych klas i źródeł –także sygnałów wywodzących się z instrumentów muzycznych. Obliczając wartości deskryptorów i dokonując ich klasyfikacji uzyskujemy informację o typie instrumentu oraz jego budowie -najczęściej materiału, z którego zostal wykonany. Podczas prowadzonych badań okazało się, że różna kompozycja wektora cech jest implementowana do opisu instrumentów blaszanych oraz inna dla instrumentów drewnianych. W tym przypadku cechą kluczową mogą być składowe wyże harmoniczne w postaci częstotliwościowej dźwieku. Przeprowadzone eksperymenty dotyczą próby parametryzacji instrumentów dętych (aerofonów) w celu porównania skuteczności klasyfikacyjnej deskryptorów czasowych i widmowych. Do badań przeznaczono dźwieki pochodzace z tuby, fletu oraz saksofonu sopranowego. Populacja próbek dla każdego instrumentu wynosiła 21.

2

An Expert System for Automatic Classiﬁcation of Sound Signals

Tyburek Krzysztof, Kotlarz Piotr

Journal of Telecommunications and Information Technology

|

2020

|

nr 2

86--90

EN

In this paper, we present the results of research focusing on methods for recognition/classiﬁcation of audio signals. We consider the results of the research project to serve as a basis for the main module of a hybrid expert system currently under development. In our earlier studies, we conducted research on the eﬀectiveness of three classiﬁers: fuzzy classiﬁer, neural classiﬁer and WEKA system for reference data. In this project, a particular emphasis was placed on ﬁne-tuning the fuzzy classiﬁer model and on identifying neural classiﬁer applications, taking into account new neural networks that we have not studied so far in connection with sounds classiﬁcation methods.

3

Video summarization using color features and efficient adaptive threshold technique

Cvetkovic S., Jelenkovic M., Nikolic S. V.

Przegląd Elektrotechniczny

|

2013

|

R. 89, nr 2a

247--250

EN

Most of the methods for video summarization relay on complicated clustering algorithms that makes them too computationaly complex for real time applications. In this paper we propose an efficient approach for video summary generation that does not relay on complex clustering algorithms and does not require frame length as a parameter. Our method combines MPEG-7 Color Layout Descriptors with adaptive threshold technique to detect shot boundaries. For each shot a keyframe is extracted and similar keyframes are eliminated in a simple manner. A MOS measure evaluation on a standard dataset show that the method produces video summaries of highest visual quality.

PL

W artykule zaproponowano nieskomplikowany algorytm do tworzenia skrótów materiałów wideo. Metoda łączy w sobie deskryptor warstwy koloru MPEG-7 z techniką progu adaptacyjnego, co pozwala na wykrywanie granic stopklatki. Dla wielu takich samych lub podobnych klatek, pozostawiana jest tylko jedna z nich.

4

Etykietowanie danych dźwiękowych do celów przeszukiwania multimedialnych baz danych

Tyburek K., Garlicki K.

Studia Informatica

|

2011

|

Vol. 32, nr 2A

553-563

PL

Klasyfikacją i agregacją danych multimedialnych zajmuje się standard MPEG-7, który dostarcza wiele podstawowych deskryptorów opisujących dźwięk. Wzorując się na istniejącym standardzie MPEG-7 dobrano deskryptory rozpoznające konkretne efekty gitarowe. Głównym zadaniem postawionym w badaniach jest taki dobór deskryptorów w przestrzeni widmowej, które w połączeniu z określonymi algorytmami przeszukiwań pozwolą na prawidłową interpretację źródła dźwięku, z uwzględnieniem zastosowanego efektu gitarowego. Do badań wykorzystano gitary elektryczne oraz efekty znanych producentów.

EN

The classification and the aggregation of the multimedia data are determined by the MPEG-7 standard. This standard provides many definitions of descriptors which describe features of sound. According to MPEG-7 standard one has selected the groups of descriptors which recognize exact guitar effects. The selection of groups of frequency domain descriptors was the main item of this paper. These groups of descriptors and specific searching algorithm allow to recognize the guitar effects. The electric guitars and guitar effects of prominent producers were used for experiments.

5

Manipulation of compressed data using MPEG-7 low level audio descriptors

Lukasiak J., Stirling D., Perrow S., Harders N.

Journal of Telecommunications and Information Technology

|

2003

|

nr 2

83-91

EN

This paper analyses the consistency of a set of MPEG-7 low level audio descriptors when the input audio stream has previously been compressed with a lossy compression algorithm. The analysis results show that lossy compression has a detrimental effect on the integrity of practical search and retrieval schemes that utilize the low level audio descriptors. Methods are then proposed to reduce the detrimental effects of compression in searching schemes. These proposed methods include improved searches, switched adaptive scalar and vector prediction, and other prediction schemes based on machine learning principles. Of the proposed schemes the results indicate that searching which incorporates previous and future frames combined with machine learning based prediction best nullifies the effects of compression. However, future scope is identified to further improve the reliability of the MPEG-7 audio descriptors in practical search environments.

6

Optimal intervals for fuzzy categories of color temperature with application to image browsing

Skarbek W., Kukiełka G.

Machine Graphics and Vision

|

2002

|

Vol. 11, No. 2/3

297-310

EN

THis paper presents the results of experiments for a color temperature browsing descriptor. We consider the problem of the optimal conversion of an objective value (color temperature) into a subjective category (Hot, Warm, Neutral, and Cold). The situation where subjective categories are based on an objective object attribute appears to be common while comparing interpretation of human sensors with physical sensors. The proposed optimal procedure for segmenting the color temperature partition into four disjoint intervals and the experimental results are described.