Parallel Data, Tools and Interfaces in OPUS
2012 (English)In: Workshop abstracts: eighth international conference on language resources and evaluation, 2012, 2214-2218 p.Conference paper (Refereed)
This paper presents the current status of OPUS, a growing language resource of parallel corpora and related tools. The focus in OPUS is to provide freely available data sets in various formats together with basic annotation to be useful for applications in computational linguistics, translation studies and cross-linguistic corpus studies. In this paper, we report about new data sets and their features, additional annotation tools and models provided from the website and essential interfaces and on-line services included in the project.
Place, publisher, year, edition, pages
2012. 2214-2218 p.
Language Technology (Computational Linguistics)
Research subject Computational Linguistics
IdentifiersURN: urn:nbn:se:uu:diva-189744ISI: 000323927702047ISBN: 978-2-9517408-7-7OAI: oai:DiVA.org:uu-189744DiVA: diva2:582220
Eight International Conference on Language Resources and Evaluation, MAY 21-27, 2012, Istanbul, Turkey