The simplest classification task is to divide a set of objects into two classes, but most of the problems we find in real life applications are multi-class. There are many methods of decomposing such a task into a set of smaller classification problems involving two classes only. Among the methods, pairwise coupling proposed by Hastie and Tibshirani (1998) is one of the best known. Its principle is to separate each pair of classes ignoring the remaining ones. Then all objects are tested against these classifiers and a voting scheme is applied using pairwise class probability estimates in a joint probability estimate for all classes. A closer look at the pairwise strategy shows the problem which impacts the final result. Each binary classifier votes for each object even if it does not belong to one of the two classes which it is trained on. This problem is addressed in our strategy. We propose to use additional classifiers to select the objects which will be considered by the pairwise classifiers. A similar solution was proposed by Moreira and Mayoraz (1998), but they use classifiers which are biased according to imbalance in the number of samples representing classes.
2
Dostęp do pełnego tekstu na zewnętrznej witrynie WWW
This paper describes a study of emotion recognition based on speech analysis. The introduction to the theory contains a review of emotion inventories used in various studies of emotion recognition as well as the speech corpora applied, methods of speech parametrization, and the most commonly employed classification algorithms. In the current study the EMO-DB speech corpus and three selected classifiers, the k-Nearest Neighbor (k-NN), the Artificial Neural Network (ANN) and Support Vector Machines (SVMs), were used in experiments. SVMs turned out to provide the best classification accuracy of 75.44% in the speaker dependent mode, that is, when speech samples from the same speaker were included in the training corpus. Various speaker dependent and speaker independent configurations were analyzed and compared. Emotion recognition in speaker dependent conditions usually yielded higher accuracy results than a similar but speaker independent configuration. The improvement was especially well observed if the base recognition ratio of a given speaker was low. Happiness and anger, as well as boredom and neutrality, proved to be the pairs of emotions most often confused.
This paper concerns the analysis of experimental data, verifying the applicability of signal analysis techniques for condition monitoring of a packaging machine. In particular, the activity focuses on the cutting process that divides a continuous flow of packaging paper into single packages. The cutting process is made by a steel knife driven by a hydraulic system. Actually, the knives are frequently substituted, causing frequent stops of the machine and consequent lost production costs. The aim of this paper is to develop a diagnostic procedure to assess the wearing condition of blades, reducing the stops for maintenance. The packaging machine was provided with pressure sensor that monitors the hydraulic system driving the blade. Processing the pressure data comprises three main steps: the selection of scalar quantities that could be indicative of the condition of the knife. A clustering analysis was used to set up a threshold between unfaulted and faulted knives. Finally, a Support Vector Machine (SVM) model was applied to classify the technical condition of knife during its lifetime.
It has become crucial to have an early prediction model that provides accurate assurance for users about the financial situation of consumers. Recent studies have focused on predicting corporate bankruptcies and credit defaults, not personal bankruptcies. Due to this situation, the present study fills the literature gap by comparing different machine learning algorithms to predict personal bankruptcy. The main objective of the study is to examine the usefulness of machine learning models such as SVM, random forest, AdaBoost, XGBoost, LightGBM, and CatBoost in forecasting personal bankruptcy. The study relies on two samples of households (learning and testing) from the Survey of Consumer Finances, which was conducted in the United States. Among the models estimated, LightGBM, CatBoost, and XGBoost showed the highest effectiveness. The most important variables used in the models are income, refusal to grant credit, delays in the repayment of liabilities, the revolving debt ratio, and the housing debt ratio.
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.