Wyznaczanie podzbioru atrybutów w procesie eksploracji danych

Romański, P.; Rosiak, J.; Mulawka, J. J.

Powiadomienia systemowe

Sesja wygasła!
Sesja wygasła!

Artykuł - szczegóły

Tytuł artykułu

Wyznaczanie podzbioru atrybutów w procesie eksploracji danych

Autorzy

Romański P. , Rosiak J. , Mulawka J. J.

Identyfikatory

Warianty tytułu

Feature subset selection in the knowledge discovery process

Języki publikacji

Abstrakty

Celem niniejszej pracy jest selekcja najlepszego podzbioru atrybutów wykorzystywanego dalej do klasyfikacji i aproksymacji w procesie eksploracji danych dokonanej przez filtrowanie atrybutów z wykorzystaniem różnych funkcji oceny przydatności. W szczególności dokonane zostanie porównanie jakości klasyfikatorów i aproksymatorów budowanych przed i po selekcji. Jako badane zbiory danych wykorzystane będą klasyczne bazy dostępne w repozytorium UCI. Do przeprowadzania oceny jakości proponowanych rozwiązań zostaną użyte klasyfikatory i aproksymatory znajdujące się w gotowych pakietach języka R.

The aim of this publication is to select the best feature subsets applied further to the classification and approximation problems as part of the knowledge discovery process via filtering the attributes using different evaluation functions. In particular. a comparison of the classifiers as well as approximators' quality before and after selection will be provided. In order to test the effectiveness of the presented approaches, classic data sets available at the UCI repository will be used. The algorithms existing in the R packages will be applied to measure the quality of the proposed solutions.

Słowa kluczowe

odkrywanie wiedzy sztuczna inteligencja selekcja atrybutów

knowledge discovery artificial intelligence feature selection

Wydawca

Wydawnictwo SIGMA-NOT

Czasopismo

Elektronika : konstrukcje, technologie, zastosowania

Rocznik

2009

Tom

Vol. 50, nr 8

Strony

271--275

Opis fizyczny

Bibliogr. 12 poz., tab.

Twórcy

autor

Romański P.

autor

Rosiak J.

autor

Mulawka J. J.

Politechnika Warszawska, Instytut Systemów Elektronicznych

Bibliografia

[1] Cichosz P.: Systemy uczące się. Warszawa, Poland, WNT, 2000.
[2] Mulawka J. J.: Systemy ekspertowe. Warszawa, Poland, WNT, 1997.
[3] Quinlan J. R.: Induction of decision trees. Machine Learning 1 (March 1986): 81-106.
[4] Witten I. H., Eibe Frank.: Data Mining: Practical Machine Learning Tools and Techniques. Second edition. Morgan Kaufmann Series in Data Management Systems. Morgan Kaufmann, June 2005.
[5] Holte R. C.: Very Simple Classification Rules Perform Well on Most Commonly Used Datasets. Mach. Learn. 11 (April 1993): 63-90.
[6] Dash M., Huan Liu. Consistency-based search in feature selection. Artif. Intell. 151 (2003): 155-176.
[7] Hall M. A.: Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning. ICML'00: Proceedings of the Seventeenth International Conference on Machine Learning. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., 2000, 359-366.
[8] Kononenko I.: Estimating Attributes: Analysis and Extensions of RELIEF. European Conference on Machine Learning, 1994, 171-182.
[9] Sikonja M. R., Kononenko I.: An adaptation of Relief for attribute estimation in regression. ICML'97: Proceedings of the Fourteenth International Conference on Machine Learning. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., 1997, 296-304.
[10] Asuncion, Newman D. J.: UCI Machine Learning Repository, UCI Machine Learning Repository.2007. Available: http://archive.ics.uci.edu/ml [Accessed: 15.04.2009J.
[11] Hall M. A., Holmes G.: Benchmarking attribute selection techniques for discrete class data mining. Knowledge and Data Engineering. IEEE Transactions on 15 (2003): 1437-1447.
[12] Ullman J. D., Widom J.: A First Course in Database Systems. Prentice Hall, 2008.

Typ dokumentu

Bibliografia

Identyfikator YADDA

bwmeta1.element.baztech-article-BWA9-0031-0054