Warianty tytułu
Internet corpus as a source of linguistic information: some limitations
Języki publikacji
Abstrakty
The aim of present analysis is to show the use of Internet corpus in syntactic studies of Slavic languages (especially Russian). Corpus analysis is treated as a research tool, useful in describing linguistic system as well as linguistic activity. The information coming from the corpus allows to determine the frequency of occurrence of units and their combinations in texts as well as the regularity of occurrences of features/properties in the paradigmatic classes. Corpus analysis also provides the ability to verify whether a particular valence property is characteristic for a given word or not. The author shows that the use of Internet corpus in the syntactic research has its limitations. In the case of frequent phenomena, corpus analysis is effective, but does not always allow to document less typical phenomena (for example occasional and potential combinations of tokens). One of the author’s conclusions is that corpus analysis should be configured with introspection and qualitative analysis.
Czasopismo
Rocznik
Numer
Strony
75-97
Opis fizyczny
Bibliografia
Typ dokumentu
Bibliografia
Identyfikatory
Identyfikator YADDA
bwmeta1.element.mhp-5b90c815-a2a0-4306-a49e-5a4bd29d6691