Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Paired Image to Image Translation for Strikethrough Removal from Handwritten Words
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division of Visual Information and Interaction.ORCID iD: 0000-0002-5010-9149
Uppsala University, Disciplinary Domain of Humanities and Social Sciences, Faculty of Arts, Department of ALM. Centre for Digital Humanities Uppsala.ORCID iD: 0000-0003-4480-3158
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division of Visual Information and Interaction.ORCID iD: 0000-0003-1054-2754
2022 (English)In: DOCUMENT ANALYSIS SYSTEMS, DAS 2022 / [ed] Uchida, S Barney, E Eglin, V, Springer Nature, 2022, Vol. 13237, p. 309-322Conference paper, Published paper (Refereed)
Abstract [en]

Transcribing struck-through, handwritten words, for example for the purpose of genetic criticism, can pose a challenge to both humans and machines, due to the obstructive properties of the superimposed strokes. This paper investigates the use of paired image to image translation approaches to remove strikethrough strokes from handwritten words. Four different neural network architectures are examined, ranging from a few simple convolutional layers to deeper ones, employing Dense blocks. Experimental results, obtained from one synthetic and one genuine paired strikethrough dataset, confirm that the proposed paired models outperform the CycleGAN-based state of the art, while using less than a sixth of the trainable parameters.

Place, publisher, year, edition, pages
Springer Nature, 2022. Vol. 13237, p. 309-322
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349
Keywords [en]
Strikethrough removal, Paired image to image translation, Handwritten words, Document image processing
National Category
Computer graphics and computer vision
Identifiers
URN: urn:nbn:se:uu:diva-488232DOI: 10.1007/978-3-031-06555-2_21ISI: 000870314500021ISBN: 978-3-031-06555-2 (electronic)ISBN: 978-3-031-06554-5 (print)OAI: oai:DiVA.org:uu-488232DiVA, id: diva2:1710684
Conference
15th IAPR International Workshop on Document Analysis Systems (DAS), MAY 22-25, 2022, La Rochelle Univ, La Rochelle, FRANCE
Funder
Swedish Research Council, 2018-05973Riksbankens Jubileumsfond, P19-0103:1Available from: 2022-11-14 Created: 2022-11-14 Last updated: 2025-02-07Bibliographically approved
In thesis
1. Document Image Processing for Handwritten Text Recognition: Deep Learning-based Transliteration of Astrid Lindgren’s Stenographic Manuscripts
Open this publication in new window or tab >>Document Image Processing for Handwritten Text Recognition: Deep Learning-based Transliteration of Astrid Lindgren’s Stenographic Manuscripts
2023 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Document image processing and handwritten text recognition have been applied to a variety of materials, scripts, and languages, both modern and historic. They are crucial building blocks in the on-going digitisation efforts of archives, where they aid in preserving archival materials and foster knowledge sharing. The latter is especially facilitated by making document contents available to interested readers who may have little to no practice in, for example, reading a specific script type, and might therefore face challenges in accessing the material.  

The first part of this dissertation focuses on reducing editorial artefacts, specifically in the form of struck-through words, in manuscripts. The main goal of this process is to identify struck-through words and remove as much of the strikethrough artefacts as possible in order to regain access to the original word. This step can serve both as preprocessing, to aid human annotators and readers, as well as in computerised pipelines, such as handwritten text recognition. Two deep learning-based approaches, exploring paired and unpaired data settings, are examined and compared. Furthermore, an approach for generating synthetic strikethrough data, for example, for training and testing purposes, and three novel datasets are presented. 

The second part of this dissertation is centred around applying handwritten text recognition to the stenographic manuscripts of Swedish children's book author Astrid Lindgren (1907 - 2002). Manually transliterating stenography, also known as shorthand, requires special domain knowledge of the script itself. Therefore, the main focus of this part is to reduce the required manual work, aiming to increase the accessibility of the material. In this regard, a baseline for handwritten text recognition of Swedish stenography is established. Two approaches for improving upon this baseline are examined. Firstly, a variety of data augmentation techniques, commonly-used in handwritten text recognition, are studied. Secondly, different target sequence encoding methods, which aim to approximate diplomatic transcriptions, are investigated. The latter, in combination with a pre-training approach, significantly improves the recognition performance. In addition to the two presented studies, the novel LION dataset is published, consisting of excerpts from Astrid Lindgren's stenographic manuscripts. 

Place, publisher, year, edition, pages
Uppsala: Acta Universitatis Upsaliensis, 2023. p. 87
Series
Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology, ISSN 1651-6214 ; 2294
Series
Skrifter utgivna av Svenska barnboksinstitutet, ISSN 0347-5387 ; 166
Keywords
document image processing, handwritten text recognition, stenography, strikethrough
National Category
Computer graphics and computer vision
Research subject
Computerized Image Processing
Identifiers
urn:nbn:se:uu:diva-509138 (URN)978-91-513-1873-8 (ISBN)
Public defence
2023-10-04, Room 101121, Ångströmlaboratoriet, Lägerhyddsvägen 1, Uppsala, 09:15 (English)
Opponent
Supervisors
Available from: 2023-09-11 Created: 2023-08-16 Last updated: 2025-02-07

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textarXiv

Authority records

Heil, RaphaelaVats, EktaHast, Anders

Search in DiVA

By author/editor
Heil, RaphaelaVats, EktaHast, Anders
By organisation
Division of Visual Information and InteractionDepartment of ALM
Computer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 147 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf