Improving term extraction by combining different techniques

Authors: Vivaldi, J.1; Rodríguez, H.2

Source: Terminology, Volume 7, Number 1, 2001 , pp. 31-48(18)

Publisher: John Benjamins Publishing Company

Buy & download fulltext article:

OR

Price: $37.41 plus tax (Refund Policy)

Abstract:

Two different reasons suggest that combining the performance of several term extractors could lead to an improvement in overall system accuracy. On the one hand, there is no clear agreement on whether to follow statistical, linguistic or hybrid approaches for (semi-) automatic term extraction. On the other hand, combining different knowledge sources (e.g. classifiers) has proved successful in improving the performance of individual sources on several NLP tasks (some of them closely related to or involved in term extraction), such as context-sensitive spelling correction, part-of-speech tagging, word sense disambiguation, parsing, text classification and filtering, etc.

In this paper, we present a proposal for combining a number of different term extraction techniques in order to improve the accuracy of the resulting system. The approach has been applied to the domain of medicine for the Spanish language. A number of tests have been carried out with encouraging results.

Keywords: term extraction; semantic data; statistics; medicine.

Document Type: Research article

DOI: http://dx.doi.org/10.1075/term.7.1.04viv

Affiliations: 1: Universitat Pompeu Fabra 2: Universitat Politècnica de Catalunya

Publication date: 2001-12-01

More about this publication?
  • International Journal of Theoretical and Applied Issues in Specialized Communication
Related content

Tools

Key

Free Content
Free content
New Content
New content
Open Access Content
Open access content
Subscribed Content
Subscribed content
Free Trial Content
Free trial content

Text size:

A | A | A | A
Share this item with others: These icons link to social bookmarking sites where readers can share and discover new web pages. print icon Print this page