Skip to main content

Content-based Document Enhancement and Resizing

Buy Article:

$12.00 plus tax (Refund Policy)

Recent advances in information and communications technologies have increased the need for automated reading and processing of documents. Most of today's documents contain not only text and background, but also graphics, tables, and images. Common image enhancement and interpolation methods apply an interpolation or enhancement function indiscriminately to the whole image. The resulting document image suffers from objectionable moiré patterns, edge blurring and aliasing. Therefore, scanned documents must often be segmented before other document processing techniques, such as filtering, resizing, and compression can be applied. In this paper, we present a new system to segment and label document images into text, halftone images, and background using feature extraction and unsupervised clustering. Once the segmentation is performed, a specific enhancement or interpolation kernel can be applied to each document component. Each pixel is assigned a feature pattern consisting of a scaled family of differential geometrical invariant features and texture features extracted from the co-occurrence matrix. The invariant feature pattern is then assigned to a specific region using a two-stage neural network system. The first stage is a self-organizing principal components analysis (SOPCA) network that is used to project the feature vector onto its leading principal axes found by using principal components analysis (PCA). The next step is to cluster the output of the SOPCA network into different regions. This is accomplished using a self-organizing feature-map (SOFM) network. In this paper, we demonstrate the power of the SOPCA-SOFM approach to segment document images into text, halftone, and background. The proposed filtering and interpolation method results in a noticeable improvement in the enhanced image.
No Reference information available - sign in for access.
No Citation information available - sign in for access.
No Supplementary Data.
No Article Media
No Metrics

Document Type: Research Article

Publication date: 01 January 2000

More about this publication?
  • For more than 30 years, IS&T's series of digital printing conferences have been the leading forum for discussion of advances and new directions in 2D and 3D printing technologies. A comprehensive, industry-wide conference that brings together industry and academia, this meeting includes all aspects of the hardware, materials, software, images, and applications associated with digital printing systems?particularly those involved with additive manufacturing and fabrication?including bio-printing, printed electronics, page-wide, drop-on-demand, desktop and continuous ink jet, toner-based systems, and production digital printing, as well as the engineering capability, optimization, and science involved in these fields. In 2016, the conference changed its name formally to Printing for Fabrication to better reflect the content of the meeting and the evolving technology of printing.

    Please note: For purposes of its Digital Library content, IS&T defines Open Access as papers that will be downloadable in their entirety for free in perpetuity. Copyright restrictions on papers vary; see individual paper for details.

  • Information for Authors
  • Submit a Paper
  • Subscribe to this Title
  • Membership Information
  • Terms & Conditions
  • Ingenta Connect is not responsible for the content or availability of external websites
  • Access Key
  • Free content
  • Partial Free content
  • New content
  • Open access content
  • Partial Open access content
  • Subscribed content
  • Partial Subscribed content
  • Free trial content
Cookie Policy
X
Cookie Policy
Ingenta Connect website makes use of cookies so as to keep track of data that you have filled in. I am Happy with this Find out more