The paper deals with the Russian Cyrillic character ⟨ё⟩, which stands for the sound [ʲo]/[jo], and its transcription into Czech. This character is used only facultatively and is usually replaced by the Cyrillic character ⟨е⟩. This causes potentially troublesome homography since the character ⟨е⟩ could stand either for [ʲo]/[jo] (⟨е⟩ replacing ⟨ё⟩) or for [ʲe]/[je] (the “usual” ⟨е⟩). The paper investigates how chosen Russian surnames containing this facultative character (Потёмкин, Рублёв etc.) are transcribed into the Czech Latin alphabet and discusses the reasons for transcribing it into Czech (“appropriately”) as ⟨o⟩, ⟨jo⟩ or (“inappropriately”) as ⟨e⟩, ⟨je⟩, ⟨ě⟩.
The paper presents a classification of the types of morphological ambiguity and the types of homophony and homography in contemporary Czech occurring in the material of the SYN and SYN2013PUB corpora of the Czech National Corpus. The classification of homonymy and homography constitutes a data base for the rule-based automatic morphological disambiguation of written Czech performed in the Institute of Theoretical and Computational Linguistics at the Faculty of Arts, Charles University. As for homophony, the types presented in the paper and mainly the sets of word forms associated with these types, can be used for the disambiguation of spoken texts.
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.