PL EN


Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Tytuł artykułu

Feeding Syntactic Versus Semantic Knowledge to a Knowledge-lean Unsupervised Word Sense Disambiguation Algorithm with an Underlying Naive Bayes Model

Autorzy
Wybrane pełne teksty z tego czasopisma
Identyfikatory
Warianty tytułu
Języki publikacji
EN
Abstrakty
EN
The present paper concentrates on the issue of feature selection for unsupervised word sense disambiguation (WSD) performed with an underlying Na¨ýve Bayes model. It introduces dependency-based feature selection which, to our knowledge, is used for the first time in conjunction with the Na¨ýve Bayes model acting as clustering technique. Construction of the dependency-based semantic space required for the proposed task is discussed. The resulting disambiguation method, representing an extension of the method introduced in [15], lies at the border between unsupervised and knowledge-based techniques. Syntactic knowledge provided by dependency relations (and exemplified in the case of adjectives) is hereby compared to semantic knowledge offered by the semantic network WordNet (and examined in [15]). Our conclusion is that the Na¨ýve Bayes model reacts well in the presence of syntactic knowledge of this type and that dependency-based feature selection is a reliable alternative to the WordNet-based semantic one.
Wydawca
Rocznik
Strony
61--86
Opis fizyczny
Bibliogr. 44 poz., tab.
Twórcy
autor
autor
  • Department of Computer Science, Faculty of Mathematics and Computer Science, University of Bucharest, 14, Academiei Str., Bucharest, Sector 1, C.P. 010014, Romania, fhristea@fmi.unibuc.ro
Bibliografia
  • [1] Agirre, E., Edmonds, P. G., (Eds.): Word Sense Disambiguation: Algorithms and Applications, Springer, 2006.
  • [2] Banerjee, S., Pedersen, T.: An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet, in: Proceedings of the Third International Conference on Computational Linguistics and Intelligent Text Processing, CICLing '02, 2002, ISBN 3-540-43219-1, 136-145.
  • [3] Banerjee, S., Pedersen, T.: Extended Gloss Overlaps as a Measure of Semantic Relatedness, in: Proceedings of the 8th International Joint Conference on Artificial Intelligence, 2003, 805-810.
  • [4] Bruce, R., Wiebe, J., Pedersen, T.: The Measure of a Model, in: Proceedings of the Conference on Empirical Methods in Natural Language Processing, 1996, 101-112.
  • [5] Collins, A. M., Quillian, M. R.: Retrieval time from semantic memory, Journal of Verbal Learning and Verbal Behavior, 8, 1969, 240-247.
  • [6] Dempster, A., Laird, N., Rubin, D.: Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society B, 39(1), 1977, 1-38.
  • [7] Domingos, P., Pazzani, M.: On the optimality of the simple Bayesian classifier under zero-one loss, Machine Learning, 29, 1997, 103-130.
  • [8] Eberhardt, F., Danks, D.: Confirmation in the Cognitive Sciences: The Problematic Case of Bayesian Models, Minds and Machines, 21, 2011, 389-410.
  • [9] Fellbaum, C.: WordNet, in: Theory and Applications of Ontology: Computer Applications (R. Poli, M. Healy and A. Kameas Eds.), Dordrecht, London: Springer, 2010, 231-243.
  • [10] Fellbaum, C., (Eds.): WordNet: an Electronic Lexical Database, The MIT Press, Cambridge, MA, 1998.
  • [11] Gale, W., Church, K., Yarowsky, D.: A method for disambiguating word senses in a large corpus, Computers and the Humanities, 26(5-6), 1992, 415-439, ISSN 0010-4817.
  • [12] Grefenstette, G.: Explorations in Automatic Thesaurus Discovery, Dordrecht: Kluwer Academic Publishers, 1994.
  • [13] Gross, D., Fischer, U., Miller, G. A.: The organization of adjectival meanings., Journal of Memory and Language, 28, 1989, 92-106.
  • [14] Hristea, F.: Recent Advances Concerning the Usage of the Naıve Bayes Model in Unsupervised Word Sense Disambiguation., International Review on Computers and Software, 4(1), 2009, 58-67.
  • [15] Hristea, F., Popescu, M.: Adjective Sense Disambiguation at the Border Between Unsupervised and Knowledge-Based Techniques, Fundamenta Informaticae, 91(3-4), 2009, 547-562, ISSN 0169-2968.
  • [16] Hristea, F., Popescu, M., Dumitrescu, M.: Performing word sense disambiguation at the border between unsupervised and knowledge-based techniques, Artificial Intelligence Review, 30(1-4), December 2008, 67-86, ISSN 0269-2821.
  • [17] Hudson, R.: Word Grammar, Oxford: Blackwell, 1984.
  • [18] Justeson, J. S., Katz, M. K.: Principled disambiguation: Discriminating adjective sense with modified nouns, Unpublished manuscript, IBM Thomas J. Watson Research Center, NY, 1993.
  • [19] Kay, M.: The concrete lexicon and the abstract dictionary, in: Proceedings of the Fifth Annual Conference of the UW Centre for the New Oxford English Dictionary, 1989, 35-41.
  • [20] Klein, D., Manning, C. D.: Accurate unlexicalized parsing, in: Proceedings of the 41st Meeting of the Association for Computational Linguistics (ACL 2003), 2003, 423-430.
  • [21] Lee, L.: Measures of distributional similarity, in: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, 1999, 25-32.
  • [22] Levin, B.: English Verb Classes and Alternations: A Preliminary Investigation, Chicago, IL: University of Chicago Press, 1993.
  • [23] Lin, D.: Automatic retrieval and clustering of similar words, in: Proceedings of the Joint Annual Meeting of the Association for Computational Linguistics and International Conference on Computational Linguistics, 1998, 768-774.
  • [24] Manning, C., Schütze, H.: Foundations of Statistical Natural Language Processing, Cambridge, MA: The MIT Press, 2003.
  • [25] de Marneffe, M.-C., MacCartney, B., Manning, C. D.: Generating Typed Dependency Parses from Phrase Structure Parses, in: Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC-06), 2006, 449-454.
  • [26] de Marneffe, M.-C., Manning, C. D.: Stanford typed dependencies manual, Technical report, Stanford University, 2008.
  • [27] Miller, G. A.: Nouns in WordNet: a lexical inheritance system, International Journal of Lexicography, 3(4), 1990, 245-264.
  • [28] Miller, G. A.: WordNet: a lexical database for English, Communications of the ACM, 38(11), November 1995, 39-41.
  • [29] Miller, G. A.: Nouns inWordNet, in: WordNet: An Electronic Lexical Database (C. Fellbaum Ed.), Cambridge, MA: The MIT Press, 1998, 23-46.
  • [30] Miller, G. A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.: WordNet: An on-line lexical database, International Journal of Lexicography, 3, 1990, 235-244.
  • [31] Miller, G. A., Hristea, F.: WordNet Nouns: Classes and Instances, Computational Linguistics, 32(1), 2006, 1-3.
  • [32] Miller, G. A., Johnson-Laird, P. N.: Language and perception, Cambridge, MA: Harvard University Press, 1976.
  • [33] Miller, K.: Modifiers inWordNet, in: WordNet: An Electronic Lexical Database (C. Fellbaum Ed.), Cambridge, MA: The MIT Press, 1998, 47-67.
  • [34] Murphy, G. L., Andrew, J. M.: The conceptual basis of antonymy and synonymy in adjectives, Journal of Memory and Language, 32, 1993, 301-319.
  • [35] Năstase, V.: Unsupervised All-words Word Sense Disambiguation with Grammatical Dependencies, in: Proceedings of the Third International Joint Conference on Natural Language Processing, 2008, 757-762.
  • [36] Padó, S., Lapata, M.: Dependency-Based Construction of Semantic Space Models, Computational Linguistics, 33(2), 2007, 161-199.
  • [37] Pedersen, T.: Unsupervised Corpus-Based Methods for WSD, in: Word Sense Disambiguation: Algorithms and Applications, Springer, 2006, 133-166.
  • [38] Pedersen, T., Bruce, R.: Distinguishing Word Senses in Untagged Text, in: Proceedings of the Second Conference on Empirical Methods in Natural Language Processing, 1997, 197-207.
  • [39] Pedersen, T., Bruce, R.: Knowledge LeanWord-Sense Disambiguation, in: Proceedings of the 15th National Conference on Artificial Intelligence, AAAI Press, 1998, 800-805.
  • [40] Ponzetto, S. P., Navigli, R.: Knowledge-rich Word Sense Disambiguation rivaling supervised systems, in: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL '10, 2010, 1522-1531.
  • [41] Schütze, H.: Automatic word sense discrimination, Computational Linguistics, 24(1), 1998, 97-123.
  • [42] Sleator, D. D. K., Temperley, D.: Parsing English with a Link Grammar, Technical report CMUCS-91-196, Carnegie Mellon University, Pittsburgh, PA, 1991.
  • [43] Sleator, D. D. K., Temperley, D.: Parsing English with a Link Grammar, in: Proceedings of the Third International Workshop on Parsing Technologies (IWPT93), 1993, 277-292.
  • [44] Tesnière, L.: Elements de syntaxe structurale, Paris: Klincksieck, 1959.
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-article-BUS8-0027-0015
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.