Powiadomienia systemowe
- Sesja wygasła!
- Sesja wygasła!
- Sesja wygasła!
Tytuł artykułu
Treść / Zawartość
Pełne teksty:
Identyfikatory
Warianty tytułu
Języki publikacji
Abstrakty
We present a finite-state technology (FST) based system capable of performing metrical scansion of verse written in English. Scansion is the traditional task of analyzing the lines of a poem, marking the stressed and non-stressed elements and dividing the line into metrical feet. The system’s workflow is composed of several subtasks designer around finite-state machines that analyze verse by performing tokenization, part-of-speech tagging, stress placement, and stress-pattern prediction for unknown words. The scanner also classifies poems according to the predominant type of metrical foot found. We present a brief evaluation of the system using a gold standard corpus of humanscanned verse, on which a per-syllable accuracy of 86.78% is achieved. The program uses open-source components and is released under the GNU GPL license.
Słowa kluczowe
Wydawca
Czasopismo
Rocznik
Tom
Strony
3--28
Opis fizyczny
Bibliogr. 42 poz., rys., tab., wykr.
Twórcy
autor
- University of the Basque Country (UPV/EHU), Department of Computer Science, 20018 Donostia, Spain
autor
- University of the Basque Country (UPV/EHU), Department of Computer Science, 20018 Donostia, Spain
autor
- University of the Basque Country (UPV/EHU), Department of Computer Science, 20018 Donostia, Spain
autor
- University of Colorado Boulder, Department of Linguistics, Boulder, Colorado (USA)
Bibliografia
- [1] Manex Agirrezabal, Bertol Arrieta, Aitzol Astigarraga, and Mans Hulden (2013), POS-tag based poetry generation with WordNet, Proceedings of the 2013 European Workshop on Natural Language Generation, pp. 162-166.
- [2] Manex Agirrezabal, Jeffrey Heinz, Mans Hulden, and Bertol Arrieta (2014), Assigning stress to out-of-vocabulary words: three approaches, International Conference on Artificial Intelligence, 27: 105-110.
- [3] Wystan H. Auden (1960), The more loving one, https://www.poets.org/poetsorg/poem/more-loving-one.
- [4] Kenneth R. Beesley and Lauri Karttunen (2003), Finite-state morphology: Xerox tools and techniques, CSLI.
- [5] Terry V. F. Brogan (1981), English versification, 1570-1980: a reference guide with a global appendix, Johns Hopkins University Press.
- [6] Lewis Carroll (1869), Phantasmagoria and Other Poems, London: Macmillan, https://archive.org/details/phantasmagoriaot00carrrich.
- [7] Lewis Carroll (1916), Alice’s Adventures in Wonderland and Through the Looking Glass, George W. Jacobs & Company, Philadelphia.
- [8] Chih-Chung Chang and Chih-Jen Lin (2011), LIBSVM: A library for suport vector machines, ACM Transactions on Intelligent Systems and Technology, 2:27:1-27:27, software available AT http://www.csie.ntu.edu.tw/~cjlin/libsvm.
- [9] Noam Chomsky and Morris Halle (1968), The sound pattern of English, Harper & Row.
- [10] Alfred Corn (1997), The poem’s heartbeat: a manual of prosody, Copper Canyon Press.
- [11] Rong-En Fan, Kai-Wei Chang, Cho-Jui Hsieh, Xiang-Rui Wang, and Chih-Jen Lin (2008), LIBLINEAR: A library for large linear classification, The Journal of Machine Learning Research, 9:1871-1874, software available at http://www.csie.ntu.edu.tw/~cjlin/liblinear.
- [12] Robert Frost (1979), The poetry of Robert Frost: the collected poems, complete and unabridged, Henry Holt and Company.
- [13] Paul Fussell (1965), Poetic meter and poetic form, McGraw Hill.
- [14] Natalie Gerber (2013), Stress-based metrics revisited: a comparative exercise in scansion systems and their implications for iambic pentameter, Thinking Verse, III: 131-168.
- [15] Erica Greene, Tugba Bodrumlu, and Kevin Knight (2010), Automatic analysis of rhythmic poetry with applications to generation and translation, in Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp. 524-533.
- [16] Peter L. Groves (1998), Strange music: the metre of the English heroic line, volume 74 of ELS Monograph Series, English Literary Studies, University of Victoria.
- [17] Péter Halácsy, András Kornai, and Csaba Oravecz (2007), HunPos: an open source trigram tagger, in Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (Interactive Poster and Demonstration Sessions), pp. 209-212.
- [18] Morris Halle and Jean-Roger Vergnaud (1987), An essay on stress, MIT Press.
- [19] Michael Hart (1971), Project Gutenberg, https://www.gutenberg.org/.
- [20] Charles O. Hartman (1996), Virtual muse: experiments in computer poetry, Wesleyan University Press.
- [21] Charles O. Hartman (2005), The Scandroid 1.1, http://oak.conncoll.edu/cohar/Programs.htm.
- [22] Mans Hulden (2006), Finite-state syllabification, in Anssi Yli-Jyrä, Lauri Karttunen, and Juhani Karhumäki, editors, Finite-State Methods and Natural Language Processing, volume 4002 of Lecture Notes in Computer Science, pp. 86-96, Springer.
- [23] Mans Hulden (2009), Foma: a finite-state compiler and library, in Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics (Demonstrations Session), pp. 29-32.
- [24] Harry M. Logan (1988), Computer analysis of sound and meter in poetry, College Literature, 15 (1): 19-24.
- [25] Henry W. Longfellow (1855), The Song of Hiawatha, David Bogue, https://archive.org/details/songhiawathathe00longrich.
- [26] Mitchell P. Marcus, Mary Ann Marcinkiewicz, and Beatrice Santorini (1993), Building a large annotated corpus of English: the Penn Treebank, Computational Linguistics, 19 (2): 313-330.
- [27] William G. M. McAleese (2007), Improving scansion with syntax: an investigation into the effectiveness of a syntactic analysis of poetry by computer using phonological scansion theory, Technical report, Department of Computing Faculty of Mathematics, Computing and Technology The Open University.
- [28] Harriet Monroe (1917), The new poetry: an anthology, The Macmillan Company, https://archive.org/details/newpoetryananth01hendgoog.
- [29] Augustus T. Murray (1925), Homer: The Iliad, Heinemann, https://archive.org/details/iliadmurray01homeuoft.
- [30] Josef Novak, Nobuaki Minematsu, and Keikichi Hirose (2012), WFST-based grapheme-to-phoneme conversion: open source tools for alignment, model-building and decoding, in Proceedings of the 10th Internaltional Workshop on Finite-State Methods and Natural Language Processing, pp. 45-49.
- [31] Marc R. Plamondon (2006), Virtual verse analysis: analysing patterns in poetry, Literary and Linguistic Computing, 21 (1): 127-141.
- [32] Margaret Robertson, editor (2007), Poems published in 1820 by John Keats, Project Gutenberg, http://www.gutenberg.org/ebooks/23684.
- [33] Emmanuel Roche and Yves Schabes (1995), Deterministic part-of-speech tagging with finite-state transducers, Computational Linguistics, 21 (2): 227-253.
- [34] David E. Rumelhart and James L. McClelland (1985), On learning the past tenses of English verbs, Technical report, Institute for Cognitive Science, University of California, San Diego.
- [35] Terrence J. Sejnowski and Charles R. Rosenberg (1987), Parallel networks that learn to pronounce English text, Complex Systems, 1 (1): 145-168.
- [36] William Shakespeare (1806), Romeo and Juliet, John Cawthorn.
- [37] William Shakespeare (1904), The tragedy of Hamlet, Cambridge University Press.
- [38] William Shakespeare (2011), Shakespeare’s sonnets, Project Gutenberg, https://archive.org/details/shakespearessonn01041gut.
- [39] Timothy Steele (1999), All the fun’s in how you say a thing: an explanation of meter and versification, Ohio University Press.
- [40] Wallace Stevens (1923), Harmonium, Academy of American Poets.
- [41] Herbert F. Tucker (2011), Poetic data and the news from poems: a *For Better for Verse* memoir, Victorian Poetry, 49 (2): 267-281.
- [42] Robert L. Weide (1998), The CMU pronunciation dictionary, release 0.6, Carnegie Mellon University.
Uwagi
Opracowanie rekordu w ramach umowy 509/P-DUN/2018 ze środków MNiSW przeznaczonych na działalność upowszechniającą naukę (2018).
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-ca6efb02-dfd3-498a-b7f4-ca33197d8ea0