Proper classification of cancer is a crucial aspect in diagnosis and choosing optimal medical therapy. It has been suggested, in recent years, that classification process of cancer can be done using gene expression monitoring. Usefulness of this approach has increased due to the new technique of gene expression monitoring – using so called "expression chips". Recently in [1, 3] a heuristic method of cancer classification, called weighted voting (WV) method, based on gene expression levels has been proposed and tested on a set of samples of acute myeloid leukemia (AML) and acute lymphoblastic leukemia (ALL). Here a more traditional approach to feature selection and classification is presented and tested on the same data set. Feature selection is performed using modified Sebestyen criterion and classification is done using linear classifying function trained by modified perception algorithm. Obtained results are better than results of the WV method. In cross-validation of initial set all 38 samples were classified correctly (WV – 1 incorrect) and only one sample from independent set was classified incorrectly (WV – 2 incorrect).
Microarrays are new technique of gene expression measurements that attracted a great deal of research interest in recent years. It has been suggested that gene expression data from microarrays (biochips) can be utilized in many biomedical areas, for example in cancer classification. Whereas several, new and existing, methods of classification has been tested, a selection of proper (optimal) set of genes, which expression serves during classification, is still an open problem. In this paper we propose a heuristic method of choosing suboptimal set of genes by using support vector machines (SVMs). Obtained set of genes optimizes one-leave-out cross-validation error. The method is tested on microarray gene expression data of samples of two cancer types: acute myeloid leukemia (AML) and acute lymphoblastic leukemia (ALL). The results show that quality of classification of selected set of genes is much better than for sets obtained using another methods of feature selection.
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.