uu.seUppsala University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Legacy Data Migration: A pilot study on the methodological feasibility of conversion and enhancement of electronic resources.
Uppsala University, Humanistisk-samhällsvetenskapliga vetenskapsområdet, Faculty of Social Sciences, Department of Education. Faculty of Educational Sciences, Department of Teacher Training. (Forskningsgruppen för utbildnings- och kultursociologi (SEC))
2004 (English)In: Joint International Conference of the Association for Literary and Linguistic Computing and the Association for Computers and the Humanities: Göteborg University, June 11 - 16, 2004., 2004Conference paper, Published paper (Refereed)
Abstract [en]

In this paper we will describe a pilot study based on two different subsets of electronic resources to be used in the Virtual Corpus system developed at the Oxford Text Archive (OTA).

The Virtual Corpus system is designed to make the OTA more useful for researchers by enabling the selection of texts for a corpus on basis of metadata categories in the TEI header resource description. Currently these categories include such fields as language, date, genre, author etc. (Berglund & Wynne, 2003). In order to make the Virtual Corpus system (VC) even more useful the texts would benefit from data enhancement.

This pilot study's aim is to evaluate the necessary procedures for enhancing the metadata available in the TEI header with further categories and also to explore the possibilities for migration of legacy data in a wide range of formats into a TEI-conformant XML format.

Place, publisher, year, edition, pages
2004.
National Category
Information Science Language Technology (Computational Linguistics)
Identifiers
URN: urn:nbn:se:uu:diva-78888OAI: oai:DiVA.org:uu-78888DiVA: diva2:106801
Available from: 2007-02-14 Created: 2007-02-14

Open Access in DiVA

No full text

Other links

http://www.allc.org/refdocs/conf04_en.htm

Authority records BETA

Langerth Zetterman, Monica

Search in DiVA

By author/editor
Langerth Zetterman, Monica
By organisation
Department of EducationDepartment of Teacher Training
Information ScienceLanguage Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 543 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf