uu.seUppsala University Publications
Change search
ReferencesLink to record
Permanent link

Direct link
Comparing Data-Driven Learning Algorithms for PoS Tagging of Swedish
Uppsala University, Disciplinary Domain of Humanities and Social Sciences, Faculty of Languages, Department of Linguistics and Philology.ORCID iD: 0000-0002-4838-6518
2001 (English)In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2001), 2001Conference paper (Refereed)
Abstract [en]

The aim of this study is a systematic evaluation and comparison of four state-of-the-art data-driven learning algorithms applied to part of speech tagging of Swedish. The algorithms included in this study are Hidden Markov Model, Maximum Entropy, Memory-Based Learning, and Transformation-Based Learning. The systems are evaluated from several aspects. Both the effects of tag set and the effects of the size of training data are examined. The accuracy is calculated as well as the error rate for known and unknown tokens. The results show differences between the approaches due to the different linguistic information built into the systems.

Place, publisher, year, edition, pages
National Category
Language Technology (Computational Linguistics)
Research subject
Computational Linguistics
URN: urn:nbn:se:uu:diva-19650OAI: oai:DiVA.org:uu-19650DiVA: diva2:47422
Empirical Methods in Natural Language Processing
Available from: 2006-11-30 Created: 2006-11-30 Last updated: 2016-03-08

Open Access in DiVA

No full text

Search in DiVA

By author/editor
Megyesi, Beata
By organisation
Department of Linguistics and Philology
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 166 hits
ReferencesLink to record
Permanent link

Direct link