We present a method capable of detecting potentially anomalous cosmic particle tracks acquired with complementary metal-oxide-semiconductor (CMOS) sensors. We apply a principal components analysis-based feature extraction method and rough k-means clustering for outlier detection. We evaluated our approach on more than 104 images acquired by the Cosmic Ray Extremely Distributed Observatory (CREDO). The method presented in this work proved to be an effective solution. The analysis of the behavior of the rough k-means clustering-based algorithm presented here and the method of selecting its parameters showed that the algorithm performs as expected and demonstrates efficiency, stability, and repeatability of results for the test data set. The results included in this work are very relevant to the international CREDO project and the broader problem of anomaly analysis in image data sets. We plan to deploy the presented methodology in the image processing pipeline of the large data set we are working on in the CREDO project. The results can be reproduced using our source code, which is published in an open repository.
2
Dostęp do pełnego tekstu na zewnętrznej witrynie WWW
With the gaining popularity of rough clustering, soft computing research community is studying relationships between rough and fuzzy clustering as well as their relative advantages. Both rough and fuzzy clustering are less restrictive than conventional clustering. Fuzzy clustering memberships are more descriptive than rough clustering. In some cases, descriptive fuzzy clustering may be advantageous, while in other cases it may lead to information overload. Many applications demand use of combined approach to exploit inherent strengths of each technique. Our objective is to examine correlation between these two techniques. This paper provides an experimental description of how rough clustering results can be correlated with fuzzy clustering results. We illustrate procedural steps to map fuzzy membership clustering to rough clustering. However, such a conversion is not always necessary, especially if one only needs lower and upper approximations. Experiments also show that descriptive fuzzy clustering may not always (particularly for high dimensional objects) produce results that are as accurate as direct application of rough clustering. We present analysis of the results from both the techniques.
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.