uu.seUppsala University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
From raw text to Universal Dependencies: look, no tags!
Uppsala University, Disciplinary Domain of Humanities and Social Sciences, Faculty of Languages, Department of Linguistics and Philology. (Computational Linguistics)ORCID iD: 0000-0001-8844-2126
Uppsala University, Disciplinary Domain of Humanities and Social Sciences, Faculty of Languages, Department of Linguistics and Philology. (Computational Linguistics)
Uppsala University, Disciplinary Domain of Humanities and Social Sciences, Faculty of Languages, Department of Linguistics and Philology. (Computational Linguistics)
Bar-Ilan University.
Show others and affiliations
2017 (English)In: Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, Vancouver, Canada: Association for Computational Linguistics, 2017, 207-217 p.Conference paper, Published paper (Refereed)
Abstract [en]

We present the Uppsala submission to the CoNLL 2017 shared task on parsing from raw text to universal dependencies. Our system is a simple pipeline consisting of two components. The first performs joint word and sentence segmentation on raw text; the second predicts dependency trees from raw words. The parser bypasses the need for part-of-speech tagging, but uses word embeddings based on universal tag distributions. We achieved a macroaveraged LAS F1 of 65.11 in the official test run and obtained the 2nd best result for sentence segmentation with a score of 89.03. After fixing two bugs, we obtained an unofficial LAS F1 of 70.49.

Place, publisher, year, edition, pages
Vancouver, Canada: Association for Computational Linguistics, 2017. 207-217 p.
Keyword [en]
dependency, parsing, multilingual, segmentation
National Category
Language Technology (Computational Linguistics)
Research subject
Computational Linguistics
Identifiers
URN: urn:nbn:se:uu:diva-333439ISBN: 978-1-945626-70-8 (electronic)OAI: oai:DiVA.org:uu-333439DiVA: diva2:1156633
Conference
CoNLL 2017, August 3-4, 2017, Vancouver, Canada
Available from: 2017-11-13 Created: 2017-11-13 Last updated: 2017-11-15Bibliographically approved

Open Access in DiVA

fulltext(184 kB)1 downloads
File information
File name FULLTEXT01.pdfFile size 184 kBChecksum SHA-512
7c7756b475ff1cc8b86486b16a7dbf9e7c371038302f3d280033598c642d7fe674e1e0a26d4d1ca3e90510050e9afdb4f9570399c27b0ce189b71a2ad1086bf0
Type fulltextMimetype application/pdf

Other links

http://universaldependencies.org/conll17/proceedings/pdf/K17-3022.pdf

Search in DiVA

By author/editor
de Lhoneux, MiryamYan, ShaoBasirat, AliStymne, SaraNivre, Joakim
By organisation
Department of Linguistics and Philology
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
Total: 1 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 13 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf