The accidental corpus: some issues in extracting linguistic information from the Web
Authors: Renouf, Antoinette; Kehoe, Andrew; Mezquiriz, David
Source: Language and Computers, Advances in Corpus Linguistics. Papers from the 23rd International Conference on English Language Research on Computerized Corpora (ICAME 23) Göteborg 22-26 May 2002. Edited by Karin Aijmer and Bengt Altenberg , pp. 403-419(17)
Publisher: Rodopi
Abstract:
The Web is a text store which can potentially supplement traditional corpora as a source of up-to-date linguistic data. The WebCorp project investigates this potential, and in its second year tackles some residual problems inherent in the nature of Web text, thereby refining its retrieval and analysis tool for the facilitation of corpus linguistic study.Document Type: Research article
Publication date: 2004-04-01
- Subscribe to this Title
- ingentaconnect is not responsible for the content or availability of external websites
- In this: publication
- By this: publisher
- In this Subject: Computer Science , Language & Linguistics
- By this author: Renouf, Antoinette ; Kehoe, Andrew ; Mezquiriz, David

Shopping cart
Receive new issue alert
Get Permissions