uu.seUppsala University Publications
Change search
ReferencesLink to record
Permanent link

Direct link
Legacy Data Migration: A pilot study on the methodological feasibility of conversion and enhancement of electronic resources.
Uppsala University, Humanistisk-samhällsvetenskapliga vetenskapsområdet, Faculty of Social Sciences, Department of Education. Faculty of Educational Sciences, Department of Teacher Training. (Forskningsgruppen för utbildnings- och kultursociologi (SEC))
2004 (English)In: Joint International Conference of the Association for Literary and Linguistic Computing and the Association for Computers and the Humanities: Göteborg University, June 11 - 16, 2004., 2004Conference paper (Refereed)
Abstract [en]

In this paper we will describe a pilot study based on two different subsets of electronic resources to be used in the Virtual Corpus system developed at the Oxford Text Archive (OTA).

The Virtual Corpus system is designed to make the OTA more useful for researchers by enabling the selection of texts for a corpus on basis of metadata categories in the TEI header resource description. Currently these categories include such fields as language, date, genre, author etc. (Berglund & Wynne, 2003). In order to make the Virtual Corpus system (VC) even more useful the texts would benefit from data enhancement.

This pilot study's aim is to evaluate the necessary procedures for enhancing the metadata available in the TEI header with further categories and also to explore the possibilities for migration of legacy data in a wide range of formats into a TEI-conformant XML format.

Place, publisher, year, edition, pages
National Category
Information Science Language Technology (Computational Linguistics)
URN: urn:nbn:se:uu:diva-78888OAI: oai:DiVA.org:uu-78888DiVA: diva2:106801
Available from: 2007-02-14 Created: 2007-02-14

Open Access in DiVA

No full text

Other links


Search in DiVA

By author/editor
Langerth Zetterman, Monica
By organisation
Department of EducationDepartment of Teacher Training
Information ScienceLanguage Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 192 hits
ReferencesLink to record
Permanent link

Direct link