The purpose of the paper is twofold. First, to describe the already implemented idea of DjVu corpora, i.e. corpora which consist of both scanned images and a transcription of the texts with the words associated with their occurrences in the scans. Secondly, to present a case study of a corpus consisting of almost 5 000 pages of Polish historical texts dating from 1570 to 1756 (it is practically the very first corpus of historical Polish). The tools described have universal character and are freely available under the GNU GPL license, hence they can be used also for other purposes.
2
Dostęp do pełnego tekstu na zewnętrznej witrynie WWW
This article is intended to be a position paper on advantages of free and open software for statistics and its applications to biometrics and biostatistics. Especially, the authors focus on the R package viewed as a new and still insufficiently recognized or received by the scientists, researchers, students, etc. Sample statistical computations and tests in biometrics are presented, and the most common functions and procedures are analysed and compared. Although this is a position paper from the point of view of applied computer science, the authors briefly present some original results within applied statistics (in particular: biometrics) and related computational methods using dedicated software.
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.