Automatic term recognition based on statistics of compound nouns and their components
In this paper, we propose a new approach to enhance automatic recognition systems for domain-specific terms. The approach is based on the statistics about the relation between a compound noun and its constituents that are simple nouns. More precisely, we focus on how many nouns adjoin the noun in question to form compound nouns. We propose several scoring methods based on this approach and experimentally evaluate them on the NTCIR1 TMREC test collection. The results are very promising, especially in low and high recall.
No Reference information available - sign in for access.
No Citation information available - sign in for access.
No Supplementary Data.