Parsing the SynTagRus Treebank of Russian
2008 (English)In: Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008), 2008, 641-648 p.Conference paper (Refereed)
We present the first results on parsing the SYNTAGRUS treebank of Russian with a data-driven dependency parser, achieving a labeled attachment score of over 82% and an unlabeled attachment score of 89%. A feature analysis shows that high parsing accuracy is crucially dependent on the use of both lexical and morphological features. We conjecture that the latter result can be generalized to richly inflected languages in general, provided that sufficient amounts of training data are available.
Place, publisher, year, edition, pages
2008. 641-648 p.
, Proceedings of the International Conference on Computational Linguistics, ISSN 1525-2477
Language Technology (Computational Linguistics)
Research subject Computational Linguistics
IdentifiersURN: urn:nbn:se:uu:diva-87663ISBN: 978-1-905593-44-6OAI: oai:DiVA.org:uu-87663DiVA: diva2:132994
COLING 2008 : The 22nd International Conference on Computational Linguistics, Manchester, UK, Aug 18, 2008 - Aug 22, 2008