Software for Business Studies

GoogleSearch

Description

GoogleSearch is a library of simple R functions and scripts (Mazanec, 2008) to launch automated Google queries for capturing occurrence and co-occurrence frequencies of keywords. Two examples taken from a city tourism application are provided. In these examples city names denote the rows and attributes define the columns of the co-occurrence matrix. A routine for non-disjunctive hierarchical clustering (Peay, 1975) is also provided. See the Run... scripts for detailed working steps.

A second demo script tailored for the www.tripadvisor.com site is also available (JM, 10/2008). It captures the frequencies reported under the 'What Our Users Are Saying Results ...'  heading. For small destinations and very specific attributes the limited search domain of TripAdvisor may deliver zero occurrences or data too sparse for hierarchical clustering and/or multidimensional scaling.

Note that keywords must be in English and composite words are set between quotation marks ("New York", "peace and quiet").

Literature

One of the functions is based on Cilibrasi, R. L. and M. B. Vitányi (2007) The Google Similarity Distance, IEEE Transactions on Knowledge and Data Engineering, 19 (3), 370-383.

For CLIP clustering see Peay, E. R. (1975) Nonmetric Grouping: Clusters and Cliques, Psychometrika, 40, 297-313, Mazanec, J. A. (1978) Strukturmodelle des Konsumverhaltens, Vienna: Orac, pp. 447-453, and Mazanec. J. A. (1997) International City Tourism, Analysis and Strategy, London & Washington: Pinter, pp. 240-242.

The list of positive connotations in example #2 below is based on the System for Connotative Analysis of Discourse (US Patent of December 18, 2001) by Wayne O. Chase.

Chapter 10 of Mazanec, J. A. and K. W. Woeber (2009), Analysing International City Tourism, Vienna: Springer, pp. 191-210 presents a fully elaborated case study for tourist cities.

A study on receiving countries is forthcoming in the Journal of Travel Research (Mazanec, J. A., Tourism Receiving Countries in Connotative Google Space).

Download

Requirements

You need the open source computing environment R which you may freely download from http://cran.r-project.org/.

Please read and accept our download conditions before!

Download and unzip one of the zip files to a separate directory of your choice and make it your R working directory. Then start R, edit the RunGoogleS or RunTripAdvS script and execute steps 1-5 sequentially. Submitting 58 connotative search items for 23 cities may take a while.

File

Version

OS

Location

Size

demo_6_4.zip contains the R scripts, CLIP50.exe and demo data for 6 cities and 4 arbitrary search items

1.2 (06/2010)

Windows XP, Vista;

R 2.7.0 or higher

HTTP

64 KB

demo_23_58.zip contains the R scripts, CLIP50.exe and data for 23 cities and 58 connotative search items

1.2 (06/2010)

Windows XP, Vista; 

R 2.7.0 or higher

HTTP

71 KB

tripadv.zip contains the R scripts, Clip50.exe and demo data for 6 cities and 4 arbitrary search items for searching at www.tripadvisor.com

1.2 (08/2009)

Windows XP, Vista;

R 2.7.0 or higher

HTTP

64 KB


Copyright 2010
| J. Mazanec and Institute for Tourism and Leisure Studies | 14.04.2010