uu.seUppsala University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Ensemble of Classifiers for Noise Detection in PoS Tagged Corpora
2000 (English)In: Proceedings of the Third International Workshop on TEXT, SPEECH and DIALOGUE, 2000, 27-32 p.Conference paper, Published paper (Refereed)
Abstract [en]

In this paper we apply the ensemble approach to the identification of incorrectly annotated items (noise) in a training set. In a controlled experiment, memory-based, decision tree-based and transformation-based classifiers are used as a filter to detect and remove noise deliberately introduced into a manually tagged corpus. The results indicate that the method can be successfully applied to automatically detect errors in a corpus.

Place, publisher, year, edition, pages
2000. 27-32 p.
National Category
Language Technology (Computational Linguistics)
Research subject
Computational Linguistics
Identifiers
URN: urn:nbn:se:uu:diva-19665OAI: oai:DiVA.org:uu-19665DiVA: diva2:47437
Conference
TEXT, SPEECH and DIALOGUE
Available from: 2006-11-30 Created: 2006-11-30 Last updated: 2017-01-25

Open Access in DiVA

No full text

Authority records BETA

Megyesi, Beata

Search in DiVA

By author/editor
Megyesi, Beata
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 357 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf