An Analysis of Verb Subcategorization Frames in Three Special Language Corpora with a View towards Automatic Term Recognition

Authors: Eumeridou E.1; Nkwenti-Azeh B.2; McNaught J.3

Source: Computers and the Humanities, Volume 38, Number 1, February 2004 , pp. 37-60(24)

Publisher: Springer

Buy & download fulltext article:

OR

Price: $47.00 plus tax (Refund Policy)

Abstract:

Current term recognition algorithms have centred mostly on the notion of term based on the assumption that terms are monoreferential and as such independent of context. The characteristics and behaviour of terms in real texts are however far removed from this ideal because factors such as text type or communicative situation greatly influence the linguistic realisation of a concept. Context, therefore, is important for the correct identification of terms (Dubuc and Lauriston, 1997). Based on this assumption, we have shifted our emphasis from terms towards surrounding linguistic context, namely verbs, as verbs are considered the central elements in the sentence. More specifically, we have set out to examine whether verbs and verbal syntax in particular, could help us towards the task of automatic term recognition. Our findings suggest that term occurrence varies significantly in different argument structures and different syntactic positions. Additionally, deviant grammatical structures have proved rich environments for terms. The analysis was carried out in three different specialised subcorpora in order to explore how the effectiveness of verbal syntax as a potential indicator of term occurrence can be constrained by factors such as subject matter and text type.

Keywords: automatic term recognition; special languages; special language subcorpora; terms; term extraction; verb subcategorisation patterns

Document Type: Research article

DOI: http://dx.doi.org/10.1023/B:CHUM.0000009278.73498.f4

Affiliations: 1: Department of Information and Communication Systems, University of the Aegean, Karlovassi, Samos, Greece, Email: evmoir@icsd.aegean.gr 2: Centre for Computational Linguistics, UMIST, P.O. Box 88, Sackville Street, Manchester M60 1QD, UK, Email: blaise@ccl.umist.ac.uk 3: Department of Computation, UMIST, P.O. Box 88, Sackville Street, Manchester M60 1QD, UK, Email: Jock@co.umist.ac.uk

Publication date: 2004-02-01

Related content

Key

Free Content
Free content
New Content
New content
Open Access Content
Open access content
Subscribed Content
Subscribed content
Free Trial Content
Free trial content

Text size:

A | A | A | A
Share this item with others: These icons link to social bookmarking sites where readers can share and discover new web pages. print icon Print this page