uu.seUppsala University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Feature Weight Optimization and Pruning in Historical Text Recognition
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division of Visual Information and Interaction. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Computerized Image Analysis and Human-Computer Interaction.
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division of Visual Information and Interaction. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Computerized Image Analysis and Human-Computer Interaction.ORCID iD: 0000-0002-4405-6888
2013 (English)In: Advances of Visual Computing: 9th International Symposium, ISVC 2013, Rethymnon, Crete, Greece, July 29-31, 2013. Proceedings, Part II / [ed] George Bebis, Springer Berlin/Heidelberg, 2013, 98-107 p.Conference paper, Published paper (Refereed)
Abstract [en]

In handwritten text recognition, "sliding window" feature extraction represent the visual information contained in written text as feature vector sequences. In this paper, we explore the parameter space of feature weights in search for optimal weights and feature selection using the coordinate descent method. We report a gain of about 5% AUC performance. We use a public dataset for evaluation and also discuss the effects and limitations of "word pruning," a technique in word spotting that is commonly used to boost performance and save computational time.

Place, publisher, year, edition, pages
Springer Berlin/Heidelberg, 2013. 98-107 p.
Series
Lecture Notes in Computer Science, ISSN 0302-9743 ; 8034
Keyword [en]
handwritten text recognition
National Category
Computer Vision and Robotics (Autonomous Systems)
Research subject
Computerized Image Analysis; Computerized Image Processing
Identifiers
URN: urn:nbn:se:uu:diva-212536DOI: 10.1007/978-3-642-41939-3_10ISI: 000335169000010ISBN: 978-3-642-41939-3 (print)ISBN: 978-3-642-41938-6 (print)OAI: oai:DiVA.org:uu-212536DiVA: diva2:678242
Conference
9th International Symposium, ISVC 2013, July 29-31, 2013, Rethymnon, Crete, Greece
Projects
From Quill to Bytesq2bq2b_vr2012
Funder
Swedish Research Council, 2012-5743
Available from: 2013-12-11 Created: 2013-12-11 Last updated: 2017-02-05Bibliographically approved
In thesis
1. Interpreting the Script: Image Analysis and Machine Learning for Quantitative Studies of Pre-modern Manuscripts
Open this publication in new window or tab >>Interpreting the Script: Image Analysis and Machine Learning for Quantitative Studies of Pre-modern Manuscripts
2017 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

The humanities have for a long time been a collection of fields that have not gained from the advancements in computational power, as predicted by Moore´s law.  Fields like medicine, biology, physics, chemistry, geology and economics have all developed quantitative tools that take advantage of the exponential increase of processing power over time.  Recent advances in computerized pattern recognition, in combination with a rapid digitization of historical document collections around the world, is about to change this.

The first part of this dissertation focuses on constructing a full system for finding handwritten words in historical manuscripts. A novel segmentation algorithm is presented, capable of finding and separating text lines in pre-modern manuscripts.  Text recognition is performed by translating the image data of the text lines into sequences of numbers, called features. Commonly used features are analysed and evaluated on manuscript sources from the Uppsala University library Carolina Rediviva and the US Library of Congress.  Decoding the text in the vast number of photographed manuscripts from our libraries makes computational linguistics and social network analysis directly applicable to historical sources. Hence, text recognition is considered a key technology for the future of computerized research methods in the humanities.

The second part of this thesis addresses digital palaeography, using a computers superior capacity for endlessly performing measurements on ink stroke shapes. Objective criteria of character shapes only partly catches what a palaeographer use for assessing similarity. The palaeographer often gets a feel for the scribe's style.  This is, however, hard to quantify.  A method for identifying the scribal hands of a pre-modern copy of the revelations of saint Bridget of Sweden, using semi-supervised learning, is presented.  Methods for production year estimation are presented and evaluated on a collection with close to 11000 medieval charters.  The production dates are estimated using a Gaussian process, where the uncertainty is inferred together with the most likely production year.

In summary, this dissertation presents several novel methods related to image analysis and machine learning. In combination with recent advances of the field, they enable efficient computational analysis of very large collections of historical documents.

Place, publisher, year, edition, pages
Uppsala: Acta Universitatis Upsaliensis, 2017. 95 p.
Series
Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology, ISSN 1651-6214 ; 1475
Keyword
document analysis, machine learning, image analysis, digital humanities, document dating, writer identification, text recognition
National Category
Computer Vision and Robotics (Autonomous Systems)
Research subject
Computer Science
Identifiers
urn:nbn:se:uu:diva-314211 (URN)978-91-554-9814-6 (ISBN)
Public defence
2017-03-24, Tidskriftläsesalen, Carolina rediviva, Dag Hammarskjölds väg 1, Uppsala, 10:15 (English)
Opponent
Supervisors
Projects
q2b
Available from: 2017-03-02 Created: 2017-01-31 Last updated: 2017-03-06

Open Access in DiVA

fulltext(342 kB)46 downloads
File information
File name FULLTEXT01.pdfFile size 342 kBChecksum SHA-512
0bcdd4b0bea0fb70fabe43b275dce98722428cc6fde3ec419d911f3791caca7b5a39a42ea8c4c423b012a01697fd2af01a94799f57f947c565441d317347d293
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Authority records BETA

Wahlberg, FredrikBrun, Anders

Search in DiVA

By author/editor
Wahlberg, FredrikBrun, Anders
By organisation
Division of Visual Information and InteractionComputerized Image Analysis and Human-Computer Interaction
Computer Vision and Robotics (Autonomous Systems)

Search outside of DiVA

GoogleGoogle Scholar
Total: 46 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 410 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf