GlossaNet: Parsing a web site as a corpus
Author: Fairon, C.
Source: Lingvisticae Investigationes, Volume 22, Number 2, 2000 , pp. 327-340(14)
Publisher: John Benjamins Publishing Company
Abstract:GlossaNet is an automated system that monitors Web sites. On dates and at intervals selected by the user, GlossaNet downloads the Web site, converts it to an electronic corpus and uses the intex programs (M. Silberztein 1993) and the linguistic resources of the ladl (electronic dictionaries and libraries of local grammars) to parse it. Once the software has been set up, it automatically repeats the task at regular periods of time (as the Web site is updated). Results, if any, are e-mailed to the user.
Document Type: Research article
Publication date: 2000-10-01