Automatic term recognition based on statistics of compound nouns and their components
Abstract:In this paper, we propose a new approach to enhance automatic recognition systems for domain-specific terms. The approach is based on the statistics about the relation between a compound noun and its constituents that are simple nouns. More precisely, we focus on how many nouns adjoin the noun in question to form compound nouns. We propose several scoring methods based on this approach and experimentally evaluate them on the NTCIR1 TMREC test collection. The results are very promising, especially in low and high recall.
Document Type: Research Article
Publication date: 2003-01-01
More about this publication?
- International Journal of Theoretical and Applied Issues in Specialized Communication