Open this publication in new window or tab >>2023 (English)In: Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa) / [ed] Tanel Alumäe; Mark Fishel, Tartu: University of Tartu, 2023, p. 335-346Conference paper, Published paper (Refereed)
Abstract [en]
In this study, we aim to find a parser for accurately identifying different types of subordinate clauses, and related phenomena, in 19th–20th-century Swedish literature. Since no test set is available for parsing from this time period, we propose a lightweight annotation scheme for annotating a single relation of interest per sentence. We train a variety of parsers for Swedish and compare evaluations on standard modern test sets and our targeted test set. We find clear trends in which parser types perform best on the standard test sets, but that performance is considerably more varied on the targeted test set. We believe that our proposed annotation scheme can be useful for complementing standard evaluations, with a low annotation effort.
Place, publisher, year, edition, pages
Tartu: University of Tartu, 2023
Series
NEALT Proceedings Series, ISSN 1736-8197, E-ISSN 1736-6305 ; 52
Keywords
Dependency parsing, syntactic analysis, 19th century Swedish, evaluation
National Category
Natural Language Processing General Language Studies and Linguistics
Research subject
Computational Linguistics; Scandinavian Languages
Identifiers
urn:nbn:se:uu:diva-505805 (URN)978-99-1621-999-7 (ISBN)
Conference
The 24th Nordic Conference on Computational Linguistics (NoDaLiDa), 22-24 May, 2023, Tórshavn, Faroe Islands
Projects
Fictional prose and language change. The role of colloquialization in the history of Swedish 1830–1930
Funder
Swedish Research Council, 2020-02617
2023-06-212023-06-212025-02-01Bibliographically approved