Googleology is Bad Science. Article (PDF Available) in Computational Linguistics 33(1) · March with Reads. You are here: Home / Programmer / Referencing Sketch Engine and bibliography / Googleology is bad science. Googleology is bad science. Last Words: Googleology is Bad Science. Anthology: J; Volume: Computational Linguistics, Volume 33, Number 1, March ; Author: Adam Kilgarriff.

Author: Brabar Shataxe
Country: Finland
Language: English (Spanish)
Genre: Sex
Published (Last): 14 January 2017
Pages: 445
PDF File Size: 9.57 Mb
ePub File Size: 3.96 Mb
ISBN: 479-8-64304-548-8
Downloads: 47070
Price: Free* [*Free Regsitration Required]
Uploader: Kajira

scidnce Skip to search form Skip to main content. As we discover, on ever more fronts, that language analysis and generation benefit from big data, so it becomes appealing to use the Web as a data source.

The question, then, is how. The low-entry-cost way to use the Web is via a commercial search engine. This paper has citations.

This paper has been referenced on Twitter 3 times over the past 90 days. From This Paper Figures, tables, and topics from this paper.


Googleology is Bad Science

Web search engine Search for additional papers on this topic. Topics Discussed in This Paper.

Web search engine Big data Workaround Information retrieval. Text corpus Part-of-speech tagging Experiment Programming paradigm. World Wide Web Spatial variability.

1 Googleology is bad science Adam Kilgarriff Lexical Computing Ltd Universities of Sussex, Leeds.

Citations Publications citing this paper. Showing of extracted citations. Constructing specialised corpora through analysing domain representativeness of websites Wilson WongWei LiuMohammed Bennamoun Language Resources and Evaluation Estimating search engine index size variability: Systematic Search and Evaluation of Five Domains. Citation Statistics Citations 0 20 40 ’09 ’12 ’15 ‘ Semantic Scholar estimates that this publication has citations based on the available data.

See our Googkeology for additional information. References Publications referenced by this paper. Showing of 8 references.

Googleology is bad science – Sketch Engine

Randomized Algorithms and NLP: Search Engine Statistics Beyond the n-Gram: Mining the Web for Synonyms: Syntactic Clustering of the Web Andrei Z. BroderSteven C. GlassmanMark S. ManasseGeoffrey Zweig Computer Networks By clicking accept or continuing to use the site, you agree to the terms outlined in our Privacy PolicyTerms of Serviceand Dataset License.