PL EN


Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Tytuł artykułu

Semantic information within the BEATCA framework

Treść / Zawartość
Identyfikatory
Warianty tytułu
Języki publikacji
EN
Abstrakty
EN
In this paper we investigate the impact of semantic information on the quality of hierarchical, fuzzy-based clustering of a collection of textual documents. We show that via a relevant tagging of a part of the documents one can improve the quality of overall clustering, both of tagged and un-tagged documents.
Rocznik
Strony
377--399
Opis fizyczny
Bibliogr. 35 poz., wykr.
Twórcy
autor
  • Institute of Computer Science, Polish Academy of Sciences, Ordona 21, 01-237 Warszawa, Poland
Bibliografia
  • BAEZA-YATES, R. and RIBEIRO-NETO, B. (1999) Modern Information Retrieval. ACM Press.
  • BECKS, A. (2001) Visual Knowledge Management with Adaptable Document Maps. GMD Research Series, 15.
  • BEZDEK, J.C. and PAL, S.K. (1991) Fuzzy Models for Pattern Recognition: Methods that Search for Structures in Data. IEEE Press, New York.
  • BJÖRNEBORN, L. (2004) Small-world link structures across an academic web space: A library and information science approach. PhD dissertation. www.db.dk/LB.
  • BONING, D., CORNO, F. and FARINETTI, L. (2003) DOSE: a Distributed Open Semantic Elaboration Platform. In: ICTAI 2003, The 15th IEEE International Conference on Tools with Artificial Intelligence, Sacramento, California. IEEE Computer Society.
  • CHEUNG, Y.M. and ZENG, H. (2007) A Maximum Weighted Likelihood Approach to Simultaneous Model Selection and Feature Weighting in Gaussian Mixture. In: Artificial Neural Networks - ICANN 2007, LNCS 4668, Springer, 78-87.
  • CIESIELSKI, K. and KŁOPOTEK, M.A. (2006) Text data clustering by contextual graphs. In: L. Todorovski, N. Lavrac and K.P. Jantke, eds., Discovery Science (DS-2006). LNAI 4265, Springer-Verlag, 65-76.
  • CIESIELSKI, K. and KŁOPOTEK, M.A. (2007) Towards Adaptive Web Mining: Histograms and Contexts in Text Data Clustering. In: M.R. Berthold and J. Shawe-Taylor, eds., Intelligent Data Analysis (IDA-2007). LNCS 4723, Springer-Verlag, 284-295.
  • CIESIELSKI, K., WIERZCHOŃ, S.T. and KŁOPOTEK, M.A. (2006) An Immune Network for Contextual Text Data Clustering. In: H. Bersini and J. Carneiro, eds., 5th International Conference on Artificial Immune Systems (ICARIS-2006). LNCS 4163, Springer-Verlag, 432-445.
  • CORBY, O., DIENG-KUNTZ, R. and FARON-ZUCKER, C. (2004) Querying the Semantic Web With Corese Search Engine. In: R. Lopez de Mantaras and L. Saitta, eds., Proc. 16th ECAI/PAIS, Valencia, Spain. IOS Press, 705-709.
  • DAVIES, J., WEEKS, R. and KROHN, U. (2002) QuizRDF: Search Technology for the Semantic Web. In: WWW2002 Workshop on RDF and Semantic Web Applications. Hawaii, USA.
  • DE CASTRO, L.N. and TIMMIS, J. (2002) Artificial Immune Systems: A New Computational Intelligence Approach. Springer.
  • DING, L., FININ, T., JOSHI, A., PAN, R., COST, R.S., PENG, Y., REDDIVARI, P., DOSHI, V.C. and SACHS, J. (2004) Swoogle: A Search and Metadata Engine for the Semantic Web. In: Proceedings of the Thirteenth ACM Conference on Information and Knowledge Management. ACM Press.
  • ESMAILI, K.S. and ABOLHASSANI, H. (2006) A Categorization Scheme for Semantic Web Search Engines. In: Proceedings of the 4th ACS/IEEE International Conference on Computer Systems and Applications (AICCSA-06), Sharjah, UAE, 171-178.
  • FLORIDI, L. (2005) Semantic Conceptions of Information. In: Stanford Encyclopedia of Philosophy, http://plato.stanford.edu/entries/information-semantic/.
  • FRIGUI, H. and NASRAOUI, O. (2004) Simultaneous Clustering and Dynamic Keyword Weighting for Text Documents. In: M. Berry, ed. Survey of Text Mining. Springer, 45-70.
  • FRITZKE, B. (1997) A Self-Organizing Network that Can Follow Non-stationary Distributions. In: ICANN ‘97: Proceedings of the 7th International Conference on Artificial Neural Networks, Springer-Verlag, 613-618.
  • GRISHMAN, R. (1997) Information extraction: Techniques and challenges. In: M.T. Pazienza, ed., Information Extraction A Multidisciplinary Approach to an Emerging Information Technology. LNCS 1299, Springer, 10-27.
  • GUHA, R., MCCOOL, R. and MILLER, E. (2003) Semantic Search. In: Proc. of the 12th International Conference on World Wide Web. ACM, New York, NY, 700-709.
  • HAN, J. and KAMBER, M. (2001) Date Mining: Concepts and Techniques. Morgan Kaufmann.
  • HEFLIN, J. and HENDLER, J. (2000) Searching the Web with SHOE. In: Proc. 17th National Conference on Artificial Intelligence (AAAI-2000). AAAI/MIT Press, Menlo Park, 443-449.
  • JING, L., NG, M.K. and HUANG, J.ZH. (2007) An Entropy Weighting k-Means Algorithm for Subspace Clustering of High-Dimensional Sparse Data. IEEE Trans, on Knowl. and Data Eng., 19(8), 1026-1041, doi:http://dx.doi.org/10.1109/TKDE.2007.1048.
  • KŁOPOTEK, M.A., WIERZCHOŃ, S.T., CIESIELSKI, K., DRAMIŃSKI, M. and CZERSKI, D. (2007) Conceptual Maps of Document Collections in Internet and Intranet. Coping with the Technological Challenge. Institute of Computer Science of Polish Academy of Sciences, Warsaw.
  • KOHONEN, T., KASKI, S., SOMERVUO, P., LAGUS, K., OJA, M. and PAATERO, V. (2003) Self-organization of very large document collections. Technical Report, University of Technology, Helsinki, Finland.
  • MOBASHER, B. (2005) Practical Handbook of Internet Computing. Chapter: Web Usage Mining and Personalization, CRC Press, 342-380.
  • PRIEBE, T., SCHLAEGER, C. and PERNUL, G. (2004) A Search Engine for RDF Metadata. In: Proc. of the DEXA 2004 Workshop on Web Semantics (WebS 2004), Zaragoza, Spain.
  • SHETH, A., BERTRAM. C., AVANT, D., HAMMOND, B., KOCHUT, K. and WARKE, Y. (2002) Managing Semantic Content for the Web. IEEE Internet Computing, 6(4), 80-87.
  • SPARCK, J. (1972) A statistical interpretation of term specifity and its application in retrieval. Journal of Documentation, 28, 111-121.
  • SPYNS, P., OBERLE, D., VOLZ, R., ZHENG, J., JARRAR, M., SURE, Y., STUDER, R. and MEERSMAN, R. (2002) OntoWeb - A Semantic Web Community Portal. In: D. Karagiansis and U. Reiner, eds., Proceedings of the 4th International Conference on Practical Aspects of Knowledge Management. LNAI 2569, Springer, 189-200.
  • TAMMA, V., BLACOE, I., SMITH, B. and WOOLDRIDGE, M. (2004) SERSE: searching for semantic web content. In: Proceedings of the 16th European Conference on Artificial Intelligence, ECAI 2004, Valencia, Spain. IOS Press.
  • TERZIYAN, W., KAYKOVA, O., KLOCHKO, O., TARANOV, A., KHRIYENKO, O., KONONENKO, O. and ZHARKO, A. (2004) Semantic Search Facilitator. Technical Report, Industrial Ontologies Group.
  • TJONG, E., SANG, K. and HOFMANN, K. (2007) Automatic Extraction of Dutch Hypernym-Hyponym Pairs, taalunieversum.org/taal/technologie/ stevin/documenten/clin2007_cornetto.pdf.
  • WINSTON, M.E., CHAFPIN, R. and HERRMANN, D. (1987) A taxonomy of part-whole relations. Cognitive Science, 11(4), 417-444.
  • ZHANG, T., RAMAKRISHNAN, R. and LIVNY, M. (1996) BIRCH: An efficient data clustering method for very large databases. In: Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data. ACM Press, 103-114.
  • ZHU, H., ZHONG, J., LI, J. and YU, Y. (2002) An Approach for Semantic Search by Matching RDF Graphs. In: Proceedings of the 15th International Florida Artificial Intelligence Research Society Conference. AAAI Press, 450-454.
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-article-BAT5-0055-0008
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.