GlossaNet: Parsing a web site as a corpus

Author: Fairon, C.

Source: Lingvisticae Investigationes, Volume 22, Number 2, 2000 , pp. 327-340(14)

Publisher: John Benjamins Publishing Company

Buy & download fulltext article:

OR

Price: $36.33 plus tax (Refund Policy)

Abstract:

GlossaNet is an automated system that monitors Web sites. On dates and at intervals selected by the user, GlossaNet downloads the Web site, converts it to an electronic corpus and uses the intex programs (M. Silberztein 1993) and the linguistic resources of the ladl (electronic dictionaries and libraries of local grammars) to parse it. Once the software has been set up, it automatically repeats the task at regular periods of time (as the Web site is updated). Results, if any, are e-mailed to the user.

Document Type: Research article

DOI: http://dx.doi.org/10.1075/li.22.20fai

Affiliations: 1: Laboratoire d'Automatique Documentaire et Linguistique UMR N°7546 du CNRS, Université Paris 7

Publication date: 2000-10-01

Related content

Tools

Key

Free Content
Free content
New Content
New content
Open Access Content
Open access content
Subscribed Content
Subscribed content
Free Trial Content
Free trial content

Text size:

A | A | A | A
Share this item with others: These icons link to social bookmarking sites where readers can share and discover new web pages. print icon Print this page