Large scale style based dating of medieval manuscripts
2015 (English)In: Proc. 3rd International Workshop on Historical Document Imaging and Processing, ACM Digital Library, 2015Conference paper (Refereed)
In this paper we propose a novel approach for manuscript dating based on shape statistics. Our goal was to develop a strategy well suited for a large scale dating effort where heterogeneous collections of thousands of manuscripts could be automatically processed. The proposed method takes the gray scale image as input, then uses the stroke width transform and a statistical model of the gradient image to find ink boundaries. Finally, a distribution over common shapes, quantified using shape context descriptors, is produced for each manuscript image. The proposed method is binarization-free, rotational invariant and requires minimal segmentation. We evaluate our work on the 10000+ manuscripts collection “Svenskt diplomatariums huvudkartotek”, consisting of charters from the medieval period of todays Sweden. The images, originally intended for web viewing, were of low quality and had compression artifacts. Due to unsupervised feature learning and regression, the collection could be dated with a median absolute error below 19 years even though we only used 5% of the labels in the estimator training.
Place, publisher, year, edition, pages
ACM Digital Library, 2015.
Computer Vision and Robotics (Autonomous Systems)
Research subject Computerized Image Processing
IdentifiersURN: urn:nbn:se:uu:diva-261747DOI: 10.1145/2809544.2809560OAI: oai:DiVA.org:uu-261747DiVA: diva2:851125
HIP 2015, August 22, Nancy, France