Wikipedia founder Jimmy Wales is targeting the fourth quarter of this year for the unveiling of an open-source search engine that he hopes could challenge the dominance of market-leaders Google Inc. and Yahoo Inc.
The project is being run through Wikia Inc., a for-profit company founded by Wales that seeks to use a similar model to the Wikipedia community-written and edited encyclopedia. He hopes to provide the tools and technology to allow programmers across the Internet to collaborate on the development and testing of a search engine and make the results freely available.
"The essential core principles are that I think search is now a fundamental part of the infrastructure of the Internet and it's really fundamental to society as a whole and therefore as citizens of the world we should be concerned about it being a secretive black box," he said.
Efforts to create open search engines aren't new but one of the stumbling blocks they face is a difficulty in running large-scale tests of the search algorithm, said Wales. The algorithm is the code that sits at the heart of the search engine and is responsible for its accuracy or lack thereof.
"To create a full-scale crawling spider of the Web actually requires a great deal of investment in hardware," he said. Wikia is planning to provide resources to enable full-scale crawling of the World Wide Web so the software can be fully tested and tuned.
The project is still in the planning stages and Wales expects that the first test version due this year will help programmers spot bugs that occur with real-world usage and speed up the development process.
"Probably what we'll do is launch something in the fourth quarter of this year with a really big warning 'It sucks, we know it sucks, it's experimental, don't panic. This is just an experiment to show what could be and now we're going to start working to see how we could make it better'," he said.
Already the project is attracting attention, not just from engineers who want to lend a hand but from companies that are already offering search engines.
"We are getting a lot of interest from second-tier search players who are really interested in some of the alternatives that might be available. If you're not one of the top three or four you've got to really wonder how could you ever catch up with Google and their billions of dollars. This provides kind of a level playing field where lots of people can contribute," said Wales.
Wales cites a story he heard about research done on search results. In the study users were presented with results from Google and Yahoo but with the brand names switched, so the Google name was above the Yahoo results and vice versa. In most cases users picked the Google-branded results as better, he said.
"To me this shows a great vulnerability if Google is only competing on brand image," said Wales. "If good quality search results are becoming a commodity, if the problem of search is in some sense solved then if I can make that free it really changes the structure of competition on the Internet."
This whitepaper explains the terminology and concepts behind Data Replication technologies and establishes some sizing rules through worked examples. Learn the new paradigm in disaster tolerance—protect data anywhere.
Download now »Server virtualization is a popular option for dealing with mounting datacenter costs. Another equally promising approach is the use of an Application Delivery Controller. Citrix NetScaler provides a low-cost way for organizations to reduce their server count and accrue cost savings from a reduction in space, cooling, power and personnel.
Download now »
The emergence of WLANs has created a new breed of security threats to enterprise networks.
Included in HP ProCurve WLAN solutions is security technology that alleviates threats from WLANs through:
* Monitoring wireless activity inside and out of the enterprise
* Classifying WLAN transmissions into harmful and harmless
* Preventing transmissions that pose a security threat to the enterprise network
* Locating participating devices for physical remediation
Effectively address data protection challenges, implementing solutions that help store and protect businesscritical data while cutting costs and improving efficiency and reliability.
Download now »
Sign up to receive Data Management Resource Alerts
