Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Word Spotting in Historical Handwritten Manuscripts using Capsule Networks
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Computerized Image Analysis and Human-Computer Interaction.
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division of Visual Information and Interaction. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Computerized Image Analysis and Human-Computer Interaction.ORCID iD: 0000-0003-4480-3158
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division of Visual Information and Interaction. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Computerized Image Analysis and Human-Computer Interaction.ORCID iD: 0000-0003-1054-2754
2018 (English)Conference paper, Poster (with or without abstract) (Other academic)
Abstract [en]

Word spotting is popularly used for digitisation and transcription of historical handwritten documents. Recently, deep learning based methods have dominated the current state-of-the-art in learning-based word spotting. However, deep learning architectures such as Convolutional Neural Networks (CNNs) require a large amount of training data, and suffer from translation invariance. Capsule Networks (CapsNet) have been recently introduced as a data-efficient alternative to CNNs. This work explores the applicability of CapsNets for segmentation-based word spotting, and is the first such effort in the Handwritten Text Recognition (HTR) community to the best of authors' knowledge. The effectiveness of CapsNets will be empirically evaluated on well-known historical handwritten datasets using standard evaluation measures. The impact of varying amounts of training data on the recognition performance will be investigated, along with a comparison with the state-of-the-art methods.

Place, publisher, year, edition, pages
2018.
National Category
Computer Vision and Robotics (Autonomous Systems)
Research subject
Computerized Image Processing
Identifiers
URN: urn:nbn:se:uu:diva-373514OAI: oai:DiVA.org:uu-373514DiVA, id: diva2:1278820
Conference
Bibliotheca Baltica Symposium
Available from: 2019-01-15 Created: 2019-01-15 Last updated: 2019-01-17

Open Access in DiVA

No full text in DiVA

Authority records

Vats, EktaHast, Anders

Search in DiVA

By author/editor
Heil, RaphaelaVats, EktaHast, Anders
By organisation
Computerized Image Analysis and Human-Computer InteractionDivision of Visual Information and Interaction
Computer Vision and Robotics (Autonomous Systems)

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 347 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf