PL EN


Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Tytuł artykułu

Categorization of persons based on their mentions in Polish news texts

Treść / Zawartość
Identyfikatory
Warianty tytułu
Języki publikacji
EN
Abstrakty
EN
Our goal described in this paper was to design, imple‐ ment and test a method of categorization of mentions of persons in Polish news texts. We gathered and classified the input data in order to measure the accuracy of the method. Train and test data were constructed by using lists of persons collected from YAGO knowledge base and Polish Wikipedia. During tests the efficiency of categori‐ zation depending on different representations of a per‐ son was studied. Experiments were executed on our and a chosen solution from literature. The results are shown and discussed in the paper.
Twórcy
  • Warsaw University of Technology, Faculty of Mathematics and Information Science, ul. Koszykowa 75, Warsaw, Poland
  • Warsaw University of Technology, Faculty of Mathematics and Information Science, ul. Koszykowa 75, Warsaw, Poland, www: www.ii.pw.edu.pl/~awroblew
Bibliografia
  • [1] M. Fleischman and E. Hovy, “Fine Grained Classification of Named Entities”. In: COLING 2002: The 19th International Conference on Computational Linguistics, 2002.
  • [2] V. Ganti, A. C. Kö nig, and R. Vernica, “Entity categorization over large document collections”. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, New York, NY, USA, 2008, 274–282, 10.1145/1401890.1401927.
  • [3] C. Giuliano, “Fine‑Grained Classification of Named Entities Exploiting Latent Semantic Kernels”. In: Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL‑2009), Boulder, Colorado, 2009, 201–209.
  • [4] A. Ekbal, E. Sourjikova, A. Frank, and S. P. Ponzetto, “Assessing the Challenge of Fine‑Grained Named Entity Recognition and Classification”. In: Proceedings of the 2010 Named Entities Workshop, Uppsala, Sweden, 2010, 93–101.
  • [5] W. Li, J. Li, Y. Tian, and Z. Sui, “Fine‑Grained Classification of Named Entities by Fusing MultiFeatures”. In: Proceedings of COLING 2012: Posters, Mumbai, India, 2012, 693–702.
  • [6] E. Alfonseca and S. Manandhar, “An Unsupervised Method for General Named Entity Recognition and Automated Concept Discovery”. In: Proceedings of the 1 st International Conference on General WordNet, Mysore, India, 2002, 34–43.
  • [7] P. Cimiano and J. Völker, “Towards large‑scale, open‑domain and ontology‑based named entity classification”. In: Proceedings of the International Conference on Recent Advances in Natural Language Processing RANLP’05, 2005, 166–172.
  • [8] F. M. Suchanek, G. Kasneci, and G. Weikum, “YAGO: a core of semantic knowledge”. In: Proceedings of the 16th international conference on World Wide Web, New York, NY, USA, 2007, 697–706, 10.1145/1242572.1242667.
  • [9] A. Radziszewski. “A Tiered CRF Tagger for Polish”. In: R. Bembenik, L. Skonieczny, H. Rybinski, M. Kryszkiewicz, and M. Niezgodka, eds., Intelligent Tools for Building a Scientific Information Platform: Advanced Architectures and Solutions, Studies in Computational Intelligence, 215–230. Springer, Berlin, Heidelberg, 2013.
  • [10] M. Maziarz, M. Piasecki, and S. Szpakowicz, “Approaching plWordNet 2.0”. In: C. Fellbaum and P. Vossen, eds., Proceedings of 6th International Global Wordnet Conference, Matsue, Japan, 2012, 189–196, Book: http://www.globalwordnet.org/gwa/proceedings/gwc2012.pdf.
  • [11] P. F. Brown, V. J. Della Pietra, P. V. deSouza, J. C. Lai, and R. L. Mercer, “Class‑Based n‑gram Models of Natural Language”, Computational Linguistics, vol. 18, no. 4, 1992, 467–480.
  • [12] M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten, “The WEKA data mining software: an update”, ACM SIGKDD Explorations Newsletter, vol. 11, no. 1, 2009, 10–18, 10.1145/1656274.1656278.
Uwagi
Opracowanie rekordu ze środków MNiSW, umowa Nr 461252 w ramach programu "Społeczna odpowiedzialność nauki" - moduł: Popularyzacja nauki i promocja sportu (2020).
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-9e870b0a-a6f6-48cd-8016-c325c4b38b55
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.