Skip to main content
padlock icon - secure page this page is secure

A Neural Network Based Color Document Segmentation

Buy Article:

$17.00 + tax (Refund Policy)

Document segmentation is defined as distinguishing different parts of the document image based on contents. In this paper, the document image is segmented into texts, pictures, and background. The algorithm we proposed includes background removal, block segmentation, feature extraction, and recognition. In background removal, we use local thresholds to extract foreground of the image. In block segmentation, run-length smoothing algorithm and connected component analysis are applied to divide the document image into a set of regions. And then, the features including image features and geometry features from the regions are extracted. Finally, these features are fed into the classifier which is a three-layer back-propagation neural network. The output of the neural network is the result of the recognition: texts or pictures. Through the experiments, we know that most document images with simple backgrounds can be segmented well by the method we proposed. Therefore, there are several advantages in our document segmentation system. 1. Localized thresholds to distinguish foreground from background based on color concepts. 2. Able to discriminate texts from pictures by extraction of good features. 3. Use a trainable neural network as the classifier where the structure can be adjusted flexibly. 4. Precise segmentation since the classifier is trained by mass of document images.
No Reference information available - sign in for access.
No Citation information available - sign in for access.
No Supplementary Data.
No Article Media
No Metrics

Document Type: Research Article

Publication date: January 1, 2003

More about this publication?
  • For more than 30 years, IS&T's series of digital printing conferences have been the leading forum for discussion of advances and new directions in 2D and 3D printing technologies. A comprehensive, industry-wide conference that brings together industry and academia, this meeting includes all aspects of the hardware, materials, software, images, and applications associated with digital printing systems?particularly those involved with additive manufacturing and fabrication?including bio-printing, printed electronics, page-wide, drop-on-demand, desktop and continuous ink jet, toner-based systems, and production digital printing, as well as the engineering capability, optimization, and science involved in these fields. In 2016, the conference changed its name formally to Printing for Fabrication to better reflect the content of the meeting and the evolving technology of printing.

    Please note: For purposes of its Digital Library content, IS&T defines Open Access as papers that will be downloadable in their entirety for free in perpetuity. Copyright restrictions on papers vary; see individual paper for details.

  • Information for Authors
  • Submit a Paper
  • Subscribe to this Title
  • Membership Information
  • Terms & Conditions
  • Ingenta Connect is not responsible for the content or availability of external websites
  • Access Key
  • Free content
  • Partial Free content
  • New content
  • Open access content
  • Partial Open access content
  • Subscribed content
  • Partial Subscribed content
  • Free trial content
Cookie Policy
X
Cookie Policy
Ingenta Connect website makes use of cookies so as to keep track of data that you have filled in. I am Happy with this Find out more