In the paper problem of searching basic forms for words in the Polish language is discussed. Polish language has a very extensive inflection and effective method for finding base form is important in many NLP tasks for example text indexing. The method for searching, based on open-source dictionary of Polish language, is presented. In this method it is important to design a structure for storing all words from dictionary, in such a way that it allows to quickly find basic words forms. Two dictionary structures: ternary search tree and associative table are presented and discussed. Tests are performed on the six actual and three crafted artificial texts and results are compared with other possible dictionary structures. At the end conclusions about structures effectiveness are formulated.
Ontologies recently have important role especially in knowledge management systems dedicated for agriculture. In the paper, issues related to indexing documents against the ontology, are presented and discussed. Problems with indexing documents in Polish language which has an extensive inflection are described. There are presented and discussed examples of ontologies and thesauri in the field of life sciences, in particular possible to use to describe aspects of plant production. We have tested Agrotagger the existing tool for indexing agricultural texts with publication in Polish. Original software developed for indexing web pages in Polish against potato ontology is described. In the final part some conclusions and plans for further research are formulated.
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.