Where are the Search Engines for Handwritten Documents?

Authors: van der Zant, Tijn1; Schomaker, Lambert1; Zinger, Svitlana2; van Schie, Henny3

Source: Interdisciplinary Science Reviews, Volume 34, Numbers 2-3, September 2009 , pp. 224-235(12)

Publisher: Maney Publishing

Buy & download fulltext article:

OR

Price: $39.00 plus tax (Refund Policy)

Abstract:

Although the problems of optical character recognition for contemporary printed text have been resolved, for historical printed and handwritten connected cursive text (i.e. western style writing), they have not. This does not mean that scanning historical documents is not useful. This article describes our research on retrieving digitized handwritten documents containing the information that the user is looking for. This task is essential for optimizing the archive's work. We investigated how to process historical documents and their transcriptions, so that a super computer could learn how to read. We applied artificial intelligence techniques to a large amount of image data and created a search engine. Our methods often require that the computer learns in interaction with a human. We have studied the requests of archive users in order to bring our research as close as possible to the current information needs. Our system learns continuously, allowing the constant improvement of search results. User requests stimulated us to delve into an unsolved topic: to search for the most elusive knowledge in text, namely the names of people and places. The solutions are described in this article.

Keywords: CULTURAL HERITAGE; HANDWRITTEN TEXT; PATTERN RECOGNITION; E-SCIENCE; SEARCH ENGINE

Document Type: Research Article

DOI: http://dx.doi.org/10.1179/174327909X441126

Affiliations: 1: AI Dept., University of Groningen, The Netherlands 2: VCA-SPS Dept., Technical University of Eindhoven, The Netherlands 3: National Archive, The Hague, The Netherlands

Publication date: 2009-09-01

More about this publication?
Related content

Tools

Key

Free Content
Free content
New Content
New content
Open Access Content
Open access content
Subscribed Content
Subscribed content
Free Trial Content
Free trial content

Text size:

A | A | A | A
Share this item with others: These icons link to social bookmarking sites where readers can share and discover new web pages. print icon Print this page