This article presents a number of historiographic and lexicographic facts in order to recall how the journal Naše řeč (Our Speech) looked during the first half of the 20th century, what was published in it, and, in particular, how closely it was tied to the Czech Dictionary Office during the period when extensive lexical data were collected and the nine-volume Příruční slovník jazyka českého (Desk Dictionary of Czech) was prepared and published.
2
Dostęp do pełnego tekstu na zewnętrznej witrynie WWW
This article discusses some aspects of searching for grammatical information in corpora. It argues that any search procedure must consist of at least three principally different steps. First, a hypothesis regarding some grammatical property of the language system must be formulated in terms of an available “tagging” menu. Second, general instructions concerning the sample size, relevant context size, etc. must be stated, and only then can the third step, i.e. the proper search and interpretation of the attested data, be taken. Examples from the Czech National Corpus are offered to show that the boundary between grammaticality and non-grammaticality of a phenomenon or category is represented by a probability scale with more than just two opposing values and that the corpus may serve as an important tool for locating the most probable (favorite) point on the scale. The issue of zero or non-zero occurrence of a phenomenon is discussed in greater detail. It is argued that if no example of a phenomenon is attested in the corpus, it does not necessarily follow that the corpus is too small and that it is necessary or significant to intervene in favor of a larger one.
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.