Nowa wersja platformy, zawierająca wyłącznie zasoby pełnotekstowe, jest już dostępna.
Przejdź na https://bibliotekanauki.pl
Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników

Znaleziono wyników: 2

Liczba wyników na stronie
first rewind previous Strona / 1 next fast forward last
Wyniki wyszukiwania
Wyszukiwano:
w słowach kluczowych:  Corpus
help Sortuj według:

help Ogranicz wyniki do:
first rewind previous Strona / 1 next fast forward last
1
Content available remote Combined Machine-Learning Approach to PoS-Tagging of Middle English Corpora
100%
|
2018
|
tom 21
EN
This paper considers the problem of part-of-speech tagging in Middle English corpora (as well as historical corpora in general). Whereas PoS-tagging in general is now considered a solved problem for Modern English and is mainly achieved via hidden Markov models (HMM) and matrix-based word-to-vector conversions with every word in the dictionary being embedded into a single dimension, this approach relies on recurrent syntactic structures and context-free generative grammars and is therefore not applicable to older iterations of the English language due to irregular word order. As such, we believe that Middle English could be better handled by a morphographemic encoding and instance-based machine learning algorithms like SVM, random forests, kNN, etc. Using a moving-average method to generate multidimensional vectors giving a reliable numeric representation of character composition and sequences, we have achieved a precision and recall of 87.5% in classifying Middle English words by their part of speech while using a simplistic combined voting-based binary classifier. This result could be deemed satisfactory and encourages further research in the area.
|
|
tom 27
37-46
EN
nimal names or zooappellatives constitute an important part of the idioms and proverbs in the majority of natural languages. The Dutch idioms with an animal element were described from various perspectives by Kowalska-Szubert (1996). The Czech idioms with animal names were the subject of various studies carried out e.g. by Mrhačová (Mrhačová et al. 2000, Mrhačová-Kouptsevitch 2004). In this article we compare by the means of corpus research the use of animal names in the adjective-based similes in the current written Dutch and in the current written Czech. This research is based on the excerpta of the largest written corpora available for the Dutch and the Czech, namely the Corpus Contemporary Dutch and the Corpus Contemporary Czech SYN version 5.
first rewind previous Strona / 1 next fast forward last
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.