Legacy Data Migration: A pilot study on the methodological feasibility of conversion and enhancement of electronic resources.
2004 (English)In: Joint International Conference of the Association for Literary and Linguistic Computing and the Association for Computers and the Humanities: Göteborg University, June 11 - 16, 2004., 2004Conference paper (Refereed)
In this paper we will describe a pilot study based on two different subsets of electronic resources to be used in the Virtual Corpus system developed at the Oxford Text Archive (OTA).
The Virtual Corpus system is designed to make the OTA more useful for researchers by enabling the selection of texts for a corpus on basis of metadata categories in the TEI header resource description. Currently these categories include such fields as language, date, genre, author etc. (Berglund & Wynne, 2003). In order to make the Virtual Corpus system (VC) even more useful the texts would benefit from data enhancement.
This pilot study's aim is to evaluate the necessary procedures for enhancing the metadata available in the TEI header with further categories and also to explore the possibilities for migration of legacy data in a wide range of formats into a TEI-conformant XML format.
Place, publisher, year, edition, pages
Information Science Language Technology (Computational Linguistics)
IdentifiersURN: urn:nbn:se:uu:diva-78888OAI: oai:DiVA.org:uu-78888DiVA: diva2:106801