Hybrid Approaches for Automatic Segmentation and Annotation of a Chinese Text Corpus

Author: Zhiwei F.1

Source: International Journal of Corpus Linguistics, Volume 6, Special Issue, 2001 , pp. 35-42(8)

Publisher: John Benjamins Publishing Company

Key:
Free Content - Free Content
New Content - New Content
Subscribed Content - Subscribed Content
Free Trial Content - Free Trial Content

Abstract:

This paper describes the hybrid approaches for automatic segmentation and annotation of a Chinese text corpus. Some experiment results are given. Hybrid approaches combine the rule-based method, the statistic-based method, and the automatic learning method. It is a good approach, and it can obviously improve the precision of segmentation and annotation of a Chinese text corpus.

Keywords: segmentation; tagging; hybrid approach; rule-based approach; HMM (Hidden Markov Model); CLAWS (Constituent-Likelihood Automatic Word-tagging System) algorithm; TBED (Transform Based Error Driven); Brill method

Language: English

Document Type: Regular paper

DOI: 10.1075/ijcl.6.3.04fen

Affiliations: 1:

The full text electronic article is available for purchase. You will be able to download the full text electronic article after payment.

$37.87 plus tax      Refund Policy

 

OR

Back to top

Key:
Free Content - Free Content
New Content - New Content
Subscribed Content - Subscribed Content
Free Trial Content - Free Trial Content
Share this item with others: These icons link to social bookmarking sites where readers can share and discover new web pages.
Page Help Click here for Page Help
Shopping cart
Tools
Sign in






Need to register?
Sign up here
Text size: A | A | A | A