Activate Activate Activate
contact  
Hello. Sign in to personalize your visit. New user? Register now.  

In
By author

Quarterly (March, June, September, December)
160 pp. per issue
6 3/4 x 10
Founded: 1974
ISSN 0891-2017
E-ISSN 1530-9312
2008 ISI Impact Factor: 2.656

Computational Linguistics

September 2003, Vol. 29, No. 3, Pages 333-347
Posted Online March 13, 2006.
(doi:10.1162/089120103322711569)
© 2003 Association for Computational Linguistics
Introduction to the Special Issue on the Web as Corpus

Adam Kilgarriff

Lexicography MasterClass Ltd. and ITRI University of Brighton, Lewes Rd, Brighton, BN2 4JG, UK.

Gregory Grefenstette

Clairvoyance Corporation, Suite 700, 5001 Baum Blvd, Pittsburgh, PA 15213-1854.

PDF (94.653 KB) PDF Plus (96.929 KB)

The Web, teeming as it is with language data, of all manner of varieties and languages, in vast quantity and freely available, is a fabulous linguists' playground. This special issue of Computational Linguistics explores ways in which this dream is being explored.

Cited by

D. Yu. Turdakov, S. D. Kuznetsov. (2010) Automatic word sense disambiguation based on document networks. Programming and Computer Software 36:1, 11-18
Online publication date: 1-Jan-2010.
CrossRef
Shaoqun Wu, Ian H. Witten, Margaret Franken. (2010) Utilizing lexical data from a Web-derived corpus to expand productive collocation knowledge. ReCALL 22:01, 83
Online publication date: 1-Jan-2010.
CrossRef
David Sánchez, David Isern. (2009) Automatic extraction of acronym definitions from the Web. Applied Intelligence
Online publication date: 30-Sep-2009.
CrossRef
Marco Baroni, Silvia Bernardini, Adriano Ferraresi, Eros Zanchetta. (2009) The WaCky wide web: a collection of very large linguistically processed web-crawled corpora. Language Resources and Evaluation 43:3, 209-226
Online publication date: 1-Sep-2009.
CrossRef
Rafael Guzmán-Cabrera, Manuel Montes-y-Gómez, Paolo Rosso, Luis Villaseñor-Pineda. (2009) Using the Web as corpus for self-training text categorization. Information Retrieval 12:3, 400-415
Online publication date: 1-Jun-2009.
CrossRef
Takenobu Tokunaga, Chu-Ren Huang, Sophia Yat Mei Lee. (2008) Asian language resources: the state-of-the-art. Language Resources and Evaluation 42:2, 109-116
Online publication date: 1-May-2008.
CrossRef
Asif Ekbal, Sivaji Bandyopadhyay. (2008) A web-based Bengali news corpus for named entity recognition. Language Resources and Evaluation 42:2, 173-182
Online publication date: 1-May-2008.
CrossRef
JOHN R. TAYLOR, KAM-YIU S. PANG. (2008) Seeing as though. English Language and Linguistics 12:01,
Online publication date: 1-Mar-2008.
CrossRef
Jae-Hoon Kim, Sung-Il Yang. (2007) Automatically Extracting Unknown Translations Using Phrase Alignment. The KIPS Transactions:PartB 14B:3, 231-240
Online publication date: 30-Jun-2007.
CrossRef
Adam Kilgarriff, Michael Rundell, Elaine Uí Dhonnchadha. (2007) Efficient corpus development for lexicography: building the New Corpus for Ireland. Language Resources and Evaluation 40:2, 127-152
Online publication date: 20-Feb-2007.
CrossRef
Vernor Vinge. (2006) 2020 Computing: The creativity machine. Nature 440:7083, 411-411
Online publication date: 20-Mar-2006.
CrossRef
Adam Kilgarriff. (2005) Language is never, ever, ever, random. Corpus Linguistics and Lingustic Theory 1:2, 263-276
Online publication date: 1-Nov-2005.
CrossRef
Technology Partner - Atypon Systems, Inc.
  CrossRef member COUNTER member