The Google Search Appliance was set up with a static IP address on the production network at the University of Hawaii's Advanced Network Computing Lab. After initial setup, the testing was done remotely from the InfoWorld Test Center in San Francisco.
To test the Google Search Appliance, I chose two common enterprise environments: a publicly available content Web site consisting primarily of HTML documents and an intranet site with a mixture of file types, including Word, Excel, PDF, and HTML documents.
For the Internet site, I created a collection (Google's term for a selection of indexed documents) that began indexing the InfoWorld.com home page and created a subcollection that consisted only of article pages, which were specified by a regular expression that limited searches to those pages.
For the intranet site, I made a copy of the production InfoWorld intranet and used it as a collection. The Google Search Appliance crawler was scheduled to index the content at 10 p.m. nightly during the tests.