Tytuł artykułu
Autorzy
Wybrane pełne teksty z tego czasopisma
Identyfikatory
Warianty tytułu
Języki publikacji
Abstrakty
The present paper concentrates on the issue of feature selection for unsupervised word sense disambiguation (WSD) performed with an underlying Na¨ýve Bayes model. It introduces dependency-based feature selection which, to our knowledge, is used for the first time in conjunction with the Na¨ýve Bayes model acting as clustering technique. Construction of the dependency-based semantic space required for the proposed task is discussed. The resulting disambiguation method, representing an extension of the method introduced in [15], lies at the border between unsupervised and knowledge-based techniques. Syntactic knowledge provided by dependency relations (and exemplified in the case of adjectives) is hereby compared to semantic knowledge offered by the semantic network WordNet (and examined in [15]). Our conclusion is that the Na¨ýve Bayes model reacts well in the presence of syntactic knowledge of this type and that dependency-based feature selection is a reliable alternative to the WordNet-based semantic one.
Wydawca
Czasopismo
Rocznik
Tom
Strony
61--86
Opis fizyczny
Bibliogr. 44 poz., tab.
Twórcy
autor
autor
- Department of Computer Science, Faculty of Mathematics and Computer Science, University of Bucharest, 14, Academiei Str., Bucharest, Sector 1, C.P. 010014, Romania, fhristea@fmi.unibuc.ro
Bibliografia
- [1] Agirre, E., Edmonds, P. G., (Eds.): Word Sense Disambiguation: Algorithms and Applications, Springer, 2006.
- [2] Banerjee, S., Pedersen, T.: An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet, in: Proceedings of the Third International Conference on Computational Linguistics and Intelligent Text Processing, CICLing '02, 2002, ISBN 3-540-43219-1, 136-145.
- [3] Banerjee, S., Pedersen, T.: Extended Gloss Overlaps as a Measure of Semantic Relatedness, in: Proceedings of the 8th International Joint Conference on Artificial Intelligence, 2003, 805-810.
- [4] Bruce, R., Wiebe, J., Pedersen, T.: The Measure of a Model, in: Proceedings of the Conference on Empirical Methods in Natural Language Processing, 1996, 101-112.
- [5] Collins, A. M., Quillian, M. R.: Retrieval time from semantic memory, Journal of Verbal Learning and Verbal Behavior, 8, 1969, 240-247.
- [6] Dempster, A., Laird, N., Rubin, D.: Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society B, 39(1), 1977, 1-38.
- [7] Domingos, P., Pazzani, M.: On the optimality of the simple Bayesian classifier under zero-one loss, Machine Learning, 29, 1997, 103-130.
- [8] Eberhardt, F., Danks, D.: Confirmation in the Cognitive Sciences: The Problematic Case of Bayesian Models, Minds and Machines, 21, 2011, 389-410.
- [9] Fellbaum, C.: WordNet, in: Theory and Applications of Ontology: Computer Applications (R. Poli, M. Healy and A. Kameas Eds.), Dordrecht, London: Springer, 2010, 231-243.
- [10] Fellbaum, C., (Eds.): WordNet: an Electronic Lexical Database, The MIT Press, Cambridge, MA, 1998.
- [11] Gale, W., Church, K., Yarowsky, D.: A method for disambiguating word senses in a large corpus, Computers and the Humanities, 26(5-6), 1992, 415-439, ISSN 0010-4817.
- [12] Grefenstette, G.: Explorations in Automatic Thesaurus Discovery, Dordrecht: Kluwer Academic Publishers, 1994.
- [13] Gross, D., Fischer, U., Miller, G. A.: The organization of adjectival meanings., Journal of Memory and Language, 28, 1989, 92-106.
- [14] Hristea, F.: Recent Advances Concerning the Usage of the Naıve Bayes Model in Unsupervised Word Sense Disambiguation., International Review on Computers and Software, 4(1), 2009, 58-67.
- [15] Hristea, F., Popescu, M.: Adjective Sense Disambiguation at the Border Between Unsupervised and Knowledge-Based Techniques, Fundamenta Informaticae, 91(3-4), 2009, 547-562, ISSN 0169-2968.
- [16] Hristea, F., Popescu, M., Dumitrescu, M.: Performing word sense disambiguation at the border between unsupervised and knowledge-based techniques, Artificial Intelligence Review, 30(1-4), December 2008, 67-86, ISSN 0269-2821.
- [17] Hudson, R.: Word Grammar, Oxford: Blackwell, 1984.
- [18] Justeson, J. S., Katz, M. K.: Principled disambiguation: Discriminating adjective sense with modified nouns, Unpublished manuscript, IBM Thomas J. Watson Research Center, NY, 1993.
- [19] Kay, M.: The concrete lexicon and the abstract dictionary, in: Proceedings of the Fifth Annual Conference of the UW Centre for the New Oxford English Dictionary, 1989, 35-41.
- [20] Klein, D., Manning, C. D.: Accurate unlexicalized parsing, in: Proceedings of the 41st Meeting of the Association for Computational Linguistics (ACL 2003), 2003, 423-430.
- [21] Lee, L.: Measures of distributional similarity, in: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, 1999, 25-32.
- [22] Levin, B.: English Verb Classes and Alternations: A Preliminary Investigation, Chicago, IL: University of Chicago Press, 1993.
- [23] Lin, D.: Automatic retrieval and clustering of similar words, in: Proceedings of the Joint Annual Meeting of the Association for Computational Linguistics and International Conference on Computational Linguistics, 1998, 768-774.
- [24] Manning, C., Schütze, H.: Foundations of Statistical Natural Language Processing, Cambridge, MA: The MIT Press, 2003.
- [25] de Marneffe, M.-C., MacCartney, B., Manning, C. D.: Generating Typed Dependency Parses from Phrase Structure Parses, in: Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC-06), 2006, 449-454.
- [26] de Marneffe, M.-C., Manning, C. D.: Stanford typed dependencies manual, Technical report, Stanford University, 2008.
- [27] Miller, G. A.: Nouns in WordNet: a lexical inheritance system, International Journal of Lexicography, 3(4), 1990, 245-264.
- [28] Miller, G. A.: WordNet: a lexical database for English, Communications of the ACM, 38(11), November 1995, 39-41.
- [29] Miller, G. A.: Nouns inWordNet, in: WordNet: An Electronic Lexical Database (C. Fellbaum Ed.), Cambridge, MA: The MIT Press, 1998, 23-46.
- [30] Miller, G. A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.: WordNet: An on-line lexical database, International Journal of Lexicography, 3, 1990, 235-244.
- [31] Miller, G. A., Hristea, F.: WordNet Nouns: Classes and Instances, Computational Linguistics, 32(1), 2006, 1-3.
- [32] Miller, G. A., Johnson-Laird, P. N.: Language and perception, Cambridge, MA: Harvard University Press, 1976.
- [33] Miller, K.: Modifiers inWordNet, in: WordNet: An Electronic Lexical Database (C. Fellbaum Ed.), Cambridge, MA: The MIT Press, 1998, 47-67.
- [34] Murphy, G. L., Andrew, J. M.: The conceptual basis of antonymy and synonymy in adjectives, Journal of Memory and Language, 32, 1993, 301-319.
- [35] Năstase, V.: Unsupervised All-words Word Sense Disambiguation with Grammatical Dependencies, in: Proceedings of the Third International Joint Conference on Natural Language Processing, 2008, 757-762.
- [36] Padó, S., Lapata, M.: Dependency-Based Construction of Semantic Space Models, Computational Linguistics, 33(2), 2007, 161-199.
- [37] Pedersen, T.: Unsupervised Corpus-Based Methods for WSD, in: Word Sense Disambiguation: Algorithms and Applications, Springer, 2006, 133-166.
- [38] Pedersen, T., Bruce, R.: Distinguishing Word Senses in Untagged Text, in: Proceedings of the Second Conference on Empirical Methods in Natural Language Processing, 1997, 197-207.
- [39] Pedersen, T., Bruce, R.: Knowledge LeanWord-Sense Disambiguation, in: Proceedings of the 15th National Conference on Artificial Intelligence, AAAI Press, 1998, 800-805.
- [40] Ponzetto, S. P., Navigli, R.: Knowledge-rich Word Sense Disambiguation rivaling supervised systems, in: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL '10, 2010, 1522-1531.
- [41] Schütze, H.: Automatic word sense discrimination, Computational Linguistics, 24(1), 1998, 97-123.
- [42] Sleator, D. D. K., Temperley, D.: Parsing English with a Link Grammar, Technical report CMUCS-91-196, Carnegie Mellon University, Pittsburgh, PA, 1991.
- [43] Sleator, D. D. K., Temperley, D.: Parsing English with a Link Grammar, in: Proceedings of the Third International Workshop on Parsing Technologies (IWPT93), 1993, 277-292.
- [44] Tesnière, L.: Elements de syntaxe structurale, Paris: Klincksieck, 1959.
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-article-BUS8-0027-0015