GoogleSearch is a library of simple R functions and scripts (Mazanec, 2008) to launch
automated Google queries for capturing occurrence and co-occurrence frequencies of keywords. Two
examples taken from a city tourism application are provided. In these examples city names denote
the rows and attributes define the columns of the co-occurrence matrix. A routine for non-disjunctive
hierarchical clustering (Peay, 1975) is also provided. See the Run... scripts for detailed working
steps. A second demo script tailored for the www.tripadvisor.com
site is also available (JM, 10/2008). It captures the frequencies reported under the 'What Our Users
Are Saying Results ...' heading. For small destinations and very specific attributes the limited search
domain of TripAdvisor may deliver zero occurrences or data too sparse for hierarchical clustering and/or
multidimensional scaling. Note that keywords must be in English and composite words are set between quotation marks
("New York", "peace and quiet").
One of the functions is based on Cilibrasi, R. L. and M. B. Vitányi (2007) The Google Similarity Distance, IEEE Transactions on Knowledge and Data Engineering, 19 (3), 370-383. For CLIP clustering see Peay, E. R. (1975) Nonmetric Grouping: Clusters and Cliques, Psychometrika, 40, 297-313, Mazanec, J. A. (1978) Strukturmodelle des Konsumverhaltens, Vienna: Orac, pp. 447-453, and Mazanec. J. A. (1997) International City Tourism, Analysis and Strategy, London & Washington: Pinter, pp. 240-242. The list of positive connotations in example #2 below is based on the System for Connotative Analysis of Discourse (US Patent of December 18, 2001) by Wayne O. Chase. Chapter 10 of Mazanec, J. A. and K. W. Woeber (2009), Analysing International City Tourism, Vienna: Springer, pp. 191-210 presents a fully elaborated case study for tourist cities. A study on receiving countries is forthcoming in the Journal of Travel Research (Mazanec, J. A., Tourism Receiving Countries in Connotative Google Space).
Requirements
You need the open source computing environment R which you may freely
download from http://cran.r-project.org/. Please read and accept our download conditions before! Download and unzip one of the zip files to a separate directory of your choice and make it
your R working directory. Then start R, edit the RunGoogleS or RunTripAdvS script and execute steps 1-5
sequentially. Submitting 58 connotative search items for 23 cities may take a while.
File
Version
OS
Location
Size
demo_6_4.zip contains the R scripts, CLIP50.exe and demo data for 6 cities and 4 arbitrary search items 1.2 (06/2010) Windows XP, Vista; R 2.7.0 or higher 64 KB demo_23_58.zip contains the R scripts, CLIP50.exe and data for 23 cities and 58 connotative search items 1.2 (06/2010) Windows XP, Vista; R 2.7.0 or higher 71 KB tripadv.zip contains the R scripts, Clip50.exe and demo data for 6 cities and 4 arbitrary search items for searching at www.tripadvisor.com 1.2 (08/2009) Windows XP, Vista; R 2.7.0 or higher 64 KB
GoogleSearch