uu.seUppsala University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Inducing Baseform Models from a Swedish Vocabulary Pool
Uppsala University, Humanistisk-samhällsvetenskapliga vetenskapsområdet, Faculty of Languages, Department of Linguistics and Philology. Datorlingvistik.
2007 (English)In: Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007, 2007, 51-58 p.Conference paper, Published paper (Refereed)
Abstract [en]

In many language technology applications, we need to map wordforms to a citation form or baseform, or the other way around, e.g. for lexicon lookup or for representational purposes.

In this paper, we used a suffix trie mapper with suffix-change probabilities, and computed wordform-baseform and baseform-wordform models from eight subsets of a ranked Swedish vocabulary. All models were evaluated for both directions on a testset, and four of the models were also evaluated for wordform-baseform mapping on five unseen texts.

For wordform-baseform mapping, the best models performed on par with state-of-the-art systems. Most models were useful for some situation—given mapping direction, and time and space restrictions—but no model was best for all situations.

Place, publisher, year, edition, pages
2007. 51-58 p.
Keyword [en]
morfologi, statistisk modell, utvärdering
National Category
Language Technology (Computational Linguistics) Specific Languages
Identifiers
URN: urn:nbn:se:uu:diva-11108ISBN: 978-9985-4-0514-7 (print)OAI: oai:DiVA.org:uu-11108DiVA: diva2:38876
Available from: 2007-05-28 Created: 2007-05-28

Open Access in DiVA

No full text

Other links

http://hdl.handle.net/10062/2519
By organisation
Department of Linguistics and Philology
Language Technology (Computational Linguistics)Specific Languages

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 322 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf