PL EN


Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Tytuł artykułu

Tracing cluster transitions for different cluster types

Treść / Zawartość
Identyfikatory
Warianty tytułu
Języki publikacji
EN
Abstrakty
EN
Clustering algorithms detect groups of similar population members, like customers, news or genes. In many clustering applications the observed population evolves and changes over time, subject to internal and external factors. Detecting and understanding changes is important for decision support. In this work, we present the MONIC+ framework for cluster-type-specific transition modeling and detection. MONIC+ encompasses a typification of clusters and cluster-type-specific transition indicators, by exploiting cluster topology and cluster statistics for the transition detection process. Our experiments on both synthetic and real datasets demonstrate the usefulness and applicability of our framework.
Rocznik
Strony
239--259
Opis fizyczny
Bibliogr. 20 poz., rys.
Twórcy
autor
Bibliografia
  • AGGARWAL, C. (2005) On Change Diagnosis in Evolving Data Streams. IEEE Trans, on Knowledge and Data Engineering 17 (5), 587-600.
  • AGGARWAL, C., HAN, J., WANG, J. and Yu, P. (2003) A Framework for Clustering Evolving Data Streams. Proc. of the 29th International Conference on Very Large Data Bases. VLDB Endowment, 81-92.
  • ALLAN, J. (2002) Introduction to Topic Detection and Tracking. In: J. Allan, ed., Topic Detection and Tracking: Event-Based Information Organization. Kluwer Academic Publishers, 1-16.
  • BARON, S., SPILIOPOULOU, M. and GÜNTHER, O. (2003) Efficient Monitoring of Patterns in Data Mining Environments. Proc. of the 7th East European Conference on Advances in Databases and Information Systems. Springer, Berlin-Heidelberg, 253-265.
  • BARTOLINI, I., CIACCIA, P., NTOUTSI, I., PATELLA, M. And THEODORIDIS, Y. (2004) A Unified and Flexible Framework for Comparing Simple and Complex Patterns. Proc. of the 8th European Conf. on Principles and Practice of Knowledge Discovery in Databases. Springer, New York, 496-499.
  • ESTER, M., KRIEGEL, H.-P., SANDER, J., WIMMER, M. and XU, X. (1998) Incremental Clustering for Mining in a Data Warehousing Environment. Proc. of the 24th International Conference on Very Large Data Bases. Morgan Kaufmann, 322-333.
  • GANTI, V., GEHRKE, J., and RAMAKRISHNAN, R. (1999) A Framework for Measuring Changes in Data Characteristics. Proc. of the 18th ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems. ACM Press, New York, 126-137.
  • HAN, J. and KAMBER, M. (2000) Data Mining: Concepts and Techniques. Morgan Kaufmann.
  • KALNIS. P., MAMOULIS, N. and BAKIRAS, S. (2005) On Discovering Moving Clusters in Spatio-Temporal Data. Proc. of the 9th International Symposium on Advances in Spatial and Temporal Databases. Springer, 364-381.
  • MEI, Q. and ZHAI, C. (2005) Discovering Evolutionary Theme Patterns from Text: An Exploration of Temporal Text Mining. KDD ‘05: Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining. ACM Press, New York, 198-207.
  • MEILA, M. (2002) Comparing Clusterings. Technical Report, Department of Statistics, University of Washington.
  • MEILA, M. (2003) Comparing Clusterings by the Variation of Information. Proc. of the 16th Annual Conference on Computational Learning Theory. Springer, 173-187.
  • MORINAGA, S. and YAMANISHI, K. (2004) Tracking Dynamics of Topic Trends Using a Finite Mixture Model. Proc. of the 10th ACM SIKGKDD International Conference on Knowledge Discovery and Data Mining. ACM Press, New York, 811-816.
  • NTOUTSI, I., PELEKIS, N. and THEODORIDIS, Y. (2007) Pattern Comparison in Data Mining: A Survey. Research and Trends in Data Mining Technologies and Applications (Advances in Data Warehousing and Mining). Idea Group Publishing, 86-120.
  • O’CALLAGHAN, L., MISHRA, N., MEYERSON, A., GUHA, S. and MOTWANI, R. (2002) Streaming-Data Algorithms for High-Quality Clustering. Proc. of the 18th International Conference on Data Engineering. IEEE Computer Society, 685.
  • SPILIOPOULOU, M., NTOUTSI, I., THEODORIDIS, Y. and SCHULT, R. (2006) MONIC: Modeling and Monitoring Cluster Transitions. Proc. of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York, 706-711.
  • SPILIOPOULOU, M., NTOUTSI, I. and THEODORIDIS, Y. (2007) Cluster Transitions for Different Cluster Types. Proc. of the 3rd ADBIS Workshop On Data Mining And Knowledge Discovery (ADMKD), Varna, Bulgaria.
  • WITTEN, I. and FRANK, E. (2005) Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann.
  • YANG, H., PARTHASARATHY, S. and MEHTA, S. (2005) A Generalized Framework for Mining Spatio-Temporal Patterns in Scientific Data. Proc. of the 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM Press, New York, 716-721.
  • ZHOU, D., LI, J. and ZHA, H. (2005) A New Mallows Distance Based Metric for Comparing Clusterings. Proc. of the 22nd International Conference on Machine Learning. ACM Press, New York, 1028-1035.
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-article-BAT5-0036-0034
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.