March 2007, Vol. 33, No. 1, Pages 147-151
Posted Online March 7, 2007.
(doi:10.1162/coli.2007.33.1.147)
© 2007 Massachusetts Institute of Technology
Googleology is Bad Science
Adam KilgarriffLexical Computing Ltd. and University of Sussex
In lieu of an abstract, this is the article's first page. See the link to the full PDF above.
Cited by
Alex Boulton. 2015. Applying data-driven learning to the web. Multiple Affordances of Language Corpora for Data-driven Learning267-296.
CrossRef Adam Kilgarriff,
Vít Baisa,
Jan Bušta,
Miloš Jakubíček,
Vojtěch Kovář,
Jan Michelfeit,
Pavel Rychlý,
Vít Suchomel. (2014) The Sketch Engine: ten years on.
Lexicography 17-36.
Online publication date: 1-Jul-2014.
CrossRef Claudia Leacock,
Martin Chodorow,
Michael Gamon,
Joel Tetreault. (2014) Automated Grammatical Error Detection for Language Learners, Second Edition.
Synthesis Lectures on Human Language Technologies 71-170.
Online publication date: 28-Feb-2014.
CrossRef Susan Nacey. (2014) MacArthur, F., Oncins-Martínez, J. L., Sánchez-García, M., & Piquer-Píriz, A. M. (Eds.). (2012).
Metaphor in Use: Context, Culture, and Communication.
Metaphor and the Social World 4:10.1075/msw.4.2299-306.
Online publication date: 1-Jan-2014.
CrossRef ZIQI ZHANG,
ANNA LISA GENTILE,
FABIO CIRAVEGNA. (2013) Recent advances in methods of lexical semantic relatedness – a survey.
Natural Language Engineering 19411-479.
Online publication date: 1-Oct-2013.
CrossRef Vera Sheinman,
Christiane Fellbaum,
Isaac Julien,
Peter Schulam,
Takenobu Tokunaga. (2013) Large, huge or gigantic? Identifying and encoding intensity relations among adjectives in WordNet.
Language Resources and Evaluation 47797-816.
Online publication date: 1-Sep-2013.
CrossRef Roland Schäfer,
Felix Bildhauer. (2013) Web Corpus Construction.
Synthesis Lectures on Human Language Technologies 61-145.
Online publication date: 19-Jul-2013.
CrossRef Shanmugapriya,
K. Latha. (2013) Measuring semantic similarity using web search engine.
International Conference on Advanced Nanomaterials & Emerging Engineering Technologies639-644.
CrossRef M. R. Sumalatha,
E. Pugazhendi,
D. J. Archana. (2013) Concept based document management in cloud storage.
2013 International Conference on Recent Trends in Information Technology (ICRTIT)90-96.
CrossRef Kate Wild,
Andrew Church,
Diana McCarthy,
Jacquelin Burgess. (2013) Quantifying lexical usage: vocabulary pertaining to ecosystems and the environment.
Corpora 853-79.
Online publication date: 1-May-2013.
CrossRef Gard B. Jenset,
Christer Johansson. (2013) Lexical Fillers Influence the Dative Alternation: Estimating Constructional Saliency Using Web Document Frequencies.
Journal of Quantitative Linguistics 2013-44.
Online publication date: 1-Feb-2013.
CrossRef M. Karthiga,
P. C. D. Kalaivaani,
S. Sankarananth. (2013) A semantic similarity approach based on web resources.
2013 International Conference on Information Communication and Embedded Systems (ICICES)226-231.
CrossRef Eduard Hovy,
Roberto Navigli,
Simone Paolo Ponzetto. (2013) Collaboratively built semi-structured content and Artificial Intelligence: The story so far.
Artificial Intelligence 1942-27.
Online publication date: 1-Jan-2013.
CrossRef Robert Albert Felty,
Adam Buchwald,
Thomas M. Gruenenfelder,
David B. Pisoni. (2013) Misperceptions of spoken words: Data from a random sample of American English words.
The Journal of the Acoustical Society of America 134572.
Online publication date: 1-Jan-2013.
CrossRef David Sánchez,
Jordi Castellà-Roca,
Alexandre Viejo. (2013) Knowledge-based scheme to create privacy-preserving but semantically-related queries for web search engines.
Information Sciences 21817-30.
Online publication date: 1-Jan-2013.
CrossRef Roberto Navigli,
Simone Paolo Ponzetto. (2012) BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network.
Artificial Intelligence 193217-250.
Online publication date: 1-Dec-2012.
CrossRef Aleksander Wawer,
Dominika Rogozinska. (2012) How Much Supervision? Corpus-Based Lexeme Sentiment Estimation.
2012 IEEE 12th International Conference on Data Mining Workshops724-730.
CrossRef U. N. Gopal. (2012) A methodology for E-Content preparation using semantic similarity between words.
2012 International Conference on Radar, Communication and Computing (ICRCC)235-238.
CrossRef William H. Fletcher. 2012. Corpus Analysis of the World Wide Web. The Encyclopedia of Applied Linguistics.
CrossRef Martijn Goudbeek,
Emiel Krahmer. (2012) Alignment in Interactive Reference Production: Content Planning, Modifier Ordering, and Referential Overspecification.
Topics in Cognitive Scienceno-no.
Online publication date: 1-Mar-2012.
CrossRef Stephen McDonald,
Colin Wren. (2012) Informative Brand Advertising and Pricing Strategies in Internet Markets with Heterogeneous Consumer Search.
International Journal of the Economics of Business 19103-117.
Online publication date: 1-Feb-2012.
CrossRef David Sánchez,
Antonio Moreno,
Luis Del Vasto-Terrientes. (2011) Learning relation axioms from text: An automatic Web-based approach.
Expert Systems with Applications.
Online publication date: 1-Nov-2011.
CrossRef Hélène Margerie. (2011) Grammaticalising constructions:
to death
as a peripheral degree modifier.
Folia Linguistica Historica 32115-147.
Online publication date: 1-Oct-2011.
CrossRef M. R. Scott,
Xiaohua Liu,
Ming Zhou. (2011) Towards a Specialized Search Engine for Language Learners [Point of View].
Proceedings of the IEEE 991462-1465.
Online publication date: 1-Sep-2011.
CrossRef Danushka Bollegala,
Yutaka Matsuo,
Mitsuru Ishizuka. (2011) A Web Search Engine-Based Approach to Measure Semantic Similarity between Words.
IEEE Transactions on Knowledge and Data Engineering 23977-990.
Online publication date: 1-Jul-2011.
CrossRef Maite Taboada,
Julian Brooke,
Milan Tofiloski,
Kimberly Voll,
Manfred Stede. (2011) Lexicon-Based Methods for Sentiment Analysis.
Computational Linguistics 37:2267-307.
Online publication date: 1-Jun-201126-May-2011.
Abstract | PDF (621 KB) | PDF Plus (557 KB) 
Paul Cook. (2011)
A Way with Words: Recent Advances in Lexical Theory and Analysis: A Festschrift for Patrick Hanks Gilles-Maurice de Schryver (editor) (Ghent University and University of the Western Cape)Kampala: Menha Publishers, 2010, vii+375 pp; ISBN 978-9970-10-101-6, €59.95.
Computational Linguistics 37:2403-406.
Online publication date: 1-Jun-201126-May-2011.
Citation | PDF (44 KB) | PDF Plus (45 KB) 
Duncan Hull,
Steve Pettifer,
Douglas Kell. 2011. Defrosting the Digital Library. Library and Information Science13-51.
CrossRef Wilson Wong,
Wei Liu,
Mohammed Bennamoun. (2011) Constructing specialised corpora through analysing domain representativeness of websites.
Language Resources and Evaluation.
Online publication date: 2-Mar-2011.
CrossRef Guoquan Sha. (2010) Using Google as a super corpus to drive written language learning: a comparison with the British National Corpus.
Computer Assisted Language Learning 23377-393.
Online publication date: 1-Dec-2010.
CrossRef Tian Tian,
James Geller,
Soon Ae Chun. (2010) Predicting Web Search Hit Counts.
2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology162-166.
CrossRef David Sánchez. (2010) A methodology to learn ontological attributes from the Web.
Data & Knowledge Engineering 69573-597.
Online publication date: 1-Jun-2010.
CrossRef Phil Maguire,
Edward J. Wisniewski,
Gert Storms. (2010) A corpus study of semantic patterns in compounding.
Corpus Linguistics and Linguistic Theory 649-73.
Online publication date: 1-May-2010.
CrossRef Zhi Quan Zhou,
ShuJia Zhang,
Markus Hagenbuchner,
T. H. Tse,
Fei-Ching Kuo,
T. Y. Chen. (2010) Automated functional testing of online search services.
Software Testing, Verification and Reliabilityn/a-n/a.
Online publication date: 1-Jan-2010.
CrossRef Phil Maguire,
Edward J. Wisniewski,
Gert Storms. (2010) A corpus study of semantic patterns in compounding.
Corpus Linguistics and Linguistic Theory 6.
Online publication date: 1-Jan-2010.
CrossRef Grazyna Chamielec,
Dawid Weiss. (2008) Modeling the frequency of phrasal verbs with search engines.
2008 International Multiconference on Computer Science and Information Technology381-388.
CrossRef Ted Pedersen. (2008) Empiricism Is Not a Matter of Faith.
Computational Linguistics 34:3465-470.
Online publication date: 1-Sep-200821-Aug-2008.
Citation | PDF (56 KB) | PDF Plus (94 KB) 
Takenobu Tokunaga,
Chu-Ren Huang,
Sophia Yat Mei Lee. (2008) Asian language resources: the state-of-the-art.
Language Resources and Evaluation 42109-116.
Online publication date: 1-May-2008.
CrossRef (2007) Rezensionen.
Zeitschrift für Sprachwissenschaft 26:10.1515/zfsw.2007.26.issue-2371-385.
Online publication date: 1-Nov-2007.
CrossRef Karen Spärck Jones. (2007) Computational Linguistics: What About the Linguistics?.
Computational Linguistics 33:3437-441.
Online publication date: 1-Sep-200717-Aug-2007.
Citation | PDF (46 KB) | PDF Plus (50 KB) 