Skip to main content

Utilizing the Relation Sets of Entity Pairs to Recognize the Organization Names in Chinese Short Text

Buy Article:

$107.14 + tax (Refund Policy)

This paper proposes a method of recognizing the organization names in Chinese short text by utilizing the relation set of entity pairs according to entity relation theory. A three-step strategy is involved in the main idea of this paper: (i) Extracting the relation sets of entity pairs by using the inherent structural characteristics of Wikipedia. (ii) Vectorizing the context of relation sets of entity pairs and constructing the vector space model. (iii) Finally recognizing the organization name by similarity comparison combining with the method of using the HMM to recognize named entity and the method of using the search engine to reconstruct corpus. Through the experiment, the recall in this method reaches 58.72% and F 1 reaches 67.48%, which increase by 13.83% and 9.28% respectively compared with the results got by using Hidden Markov Model (HMM). Proved by the experiment this method is effective.

Keywords: CHINESE SHORT TEXT; ENTITY; ENTITY PAIRS; NAMED ENTITY RECOGNITION; ORGANIZATION NAME; RELATION SETS

Document Type: Research Article

Publication date: 01 March 2012

More about this publication?
  • ADVANCED SCIENCE LETTERS is an international peer-reviewed journal with a very wide-ranging coverage, consolidates research activities in all areas of (1) Physical Sciences, (2) Biological Sciences, (3) Mathematical Sciences, (4) Engineering, (5) Computer and Information Sciences, and (6) Geosciences to publish original short communications, full research papers and timely brief (mini) reviews with authors photo and biography encompassing the basic and applied research and current developments in educational aspects of these scientific areas.
  • Editorial Board
  • Information for Authors
  • Subscribe to this Title
  • Ingenta Connect is not responsible for the content or availability of external websites
  • Access Key
  • Free content
  • Partial Free content
  • New content
  • Open access content
  • Partial Open access content
  • Subscribed content
  • Partial Subscribed content
  • Free trial content