Free Newsletters
Technology & Business Daily

InfoWorld
Log-in | Register

Update: AOL reportedly released search data

AOL's apparent release of details of users' Internet searches raises privacy concerns

By Jeremy Kirk, IDG News Service
August 07, 2006
 

AOL has apparently released details of Internet searches performed over a period of three months by hundreds of thousands of its subscribers, raising privacy concerns.

Free IT resource

Hear how top CIOs turn change into a competitive advantage.

Sponsored by HP

Free IT resource

TechNet: More ways to know it, share it, and keep it running.

Sponsored by Microsoft

The data, apparently made available for research purposes, is no longer available at the Web site http://research.aol.com, but details of the data were cited by technology blog site Techcrunch, and the page linking to it was cached by Google's search engine.

The cached copy of the page said the data comprised about 19 million Web searches performed by 658,000 users from March through May. The page warned of sexually explicit language in some of the queries, and said of the data, "This collection is distributed for noncommercial research use only." The page contained a link to a compressed copy of the data archive.

The page asked researchers using the data to cite a research paper entitled "A Picture of Search" based on the data, which names two AOL employees as co-authors. That paper is still available for download here.

AOL officials in London are aware of the issue, they said Monday morning. They had no further comment, and referred queries to the company's U.S. headquarters. Reached in the U.S., company officials did not have an immediate comment.

The release of such information poses serious privacy concerns. Major search engine companies fought a request for similar data on user searches last year by the U.S. Department of Justice.

The U.S. government wanted to use the data to check the effectiveness of a federal law aimed at minors' access to harmful material. In January it filed a motion with the court to compel Google to comply with its subpoena and turn over a "random sample" of 1 million Web site addresses found in its search engine index.

It also asked the company the text of all queries filed on the search engine during a specific week. America Online, Yahoo, and Microsoft's MSN were also subpoenaed, and complied to varying degrees.

The alleged release of AOL's data has sparked concern over how it might be used after its widespread release. While the original page is gone, the data has since been made available on several other Web sites.

The data is valuable from a market research perspective, said David Bradshaw, principal analyst at Ovum. Normally, similar kinds of data sets are only released to trusted researchers, not the general public, he said.

Even then, the resulting research is released as a batch of aggregated statistics, masking signs of individual users' behavior, he said.

"I do think this was foolhardy at best and a complete disaster or worse for AOL," Bradshaw said. "If I were an AOL user, I'd be up in arms."

The researchers who used the data wrote in an introduction that user IDs were replaced with an anonymous number. However, observers are expressing concern about whether users could be tracked based on their queries.

The data also contains the time when a particular query was executed. If a user clicked on a result, the rank of the item was recorded, along with the domain portion of the URL (uniform resource locator).

The release of the AOL data prompted numerous comments on blog entries dedicated to the issue.

Ben Noble of Aberystwyth, Wales, wrote in a blog posting that the data is anonymous enough that "there's still an amount of deniability, but it's appalling that anyone should be put in the position of having to deny anything."

Noble wrote that AOL could possess a file linking anonymous users with their real ID and their searches.

The data's public release may violate AOL's privacy policy, said Sean McManus, contacted after posting a comment on the issue.

McManus, who said he does not use AOL as an ISP (Internet service provider), examined AOL's privacy policy after finding it through a Google search.

"I think the big issue is whether the data should be available at all," said McManus. "Users have a reasonable expectation of privacy when they use the Internet, particularly since they use the Internet on the condition of AOL's own privacy policy."





 

TOP NEWS:


»  Troubleshooting tool for Java offered
Sun's Java VisualVM open-source technology views apps while they run on a JVM and is billed as all-in-one solution

»  Python backing eyed for NetBeans
Scripting language capabilities of the open-source IDE continue to expand

»  Microsoft sets Windows XP SP3 automatic download for Thursday
The latest service pack for Windows XP will be pushed to Automatic Update at 7a.m. EDT on July 10

»  Real Software, Veryant bolster dev tools
RealBasic, Cobol apps platforms get improvements

»  Microsoft sets hosted-services pricing, irks partners
By offering 38 percent discount to customers who buy entire hosted business productivity suite, Microsoft undercuts partners selling similar services

»  Adobe readying new mashup tool for business users
Mashup interface code-named 'Genesis' will open up desktop 'workspace' combining business application data, documents, analytics, and instant messaging




Solutions to the Toughest IT Challenges in Remote Offices
Though small in size, remote offices face many of the same IT challenges as larger central offices. This Webcast zeroes in on the top line challenges to deliver information that can provide immediate benefits to your business. Sponsor: AMD and Dell

»  Click here to view this Webcast
  Zombie PCs Are Attacking Your LAN
A recent study showed that malware-infected zombie PCs are now a bigger threat to ISPs and Web infrastructure than DoS attacks. As this brand new IT Strategy Guide explains, an increased use of peer-to-peer techniques by the attackers has made it harder to fight back. Download now, compliments of Verio:

»  Click here to download now

- Special Advertising Partners -
WHITE PAPERS
 

» Technology White Papers Library

Technology White Papers by Topic

Technology White Papers E-mail Alert

Find out when the latest white paper is available:
 
 
INFOWORLD MARKETPLACE
 
» BUY A LINK NOW
 

FIND PRODUCTS AND COMPANIES
» COMPLETE PRODUCT GUIDE



TECHNOLOGY INDEX
• Applications
• Application Development
• Security
• Networking
• Wireless
• Platforms
• Hardware
• Data Management
• Storage
• Web Services
• Business
• Telecom
• Professional Services
• Standards

TECH WATCH 


What's the 411 on GOOG-411?
Just as Google has become synonymous with "performing a Web search," 411 is understood to mean "information" -- as in "what's the 411?" I was thus surprised to discover, from a billboard, no less, that the king of search is taking on the ...

Apple HTML source reveals 'iPhone Extreme'
"This one's a stretch..." reports AppleInsider. Um, yeah. Reporting on HTML code sightings of product names could be called a stretch, but iPhone Extreme has a ring to it. Now, that sounds like the product Apple should have released first, rather ...

COLUMNISTS

Unified under law
Ephraim Schwartz's Column and Blog (InfoWorld) - In the litigious world we live in, deploying a unified communications platform in your enterprise could...
» MORE COLUMNISTS

MORE INFOWORLD BLOGS


Open Sources 
Product Management
When I joined MySQL four years ago, there was quite a lot of debate about product management. We didn't actually have ...

Zero Day 
Botnet herders tending smaller flocks
New research backs up the theory that botnet operators are keeping their networks smaller in a continued effort to keep ...



• Advice Line
• Database Underground
• The Deep End
• Enterprise Mac
• Geeks in Paradise
• Grid Meter
• The Gripe Line
• InfoWorld Daily
• Inside IT
• IT Troubleshooter
• ITXtreme
• Open Sources
• ProdBlog
• Real World SOA
• Reality Check
• Security Adviser
• SMB IT
• The Storage Network
• Tech Watch
• Virtualization Report
• Zero Day

ADVERTISEMENT


RESOURCE CENTERadvertisement 

GOVERNMENT IT & POLICY
'If you don't go after the network, you're never going to stop these guys. Never.'
From the State Department, All the News for Inquiring Minds
TechPresident, the Internet Citizenry's New Consensus Taker



Sponsored Technology Links

 
 
 HOME  NEWS  BLOGS  PODCASTS  VIDEOS  TECHNOLOGIES  TEST CENTER  EVENTS  CAREERS   About | Advertise | Awards | RSS | Contact Us 

Copyright © 2008, Reprints, Permissions, Licensing, IDG Network, Privacy Policy, Terms of Service.
All Rights reserved. InfoWorld is a leading publisher of technology information and product reviews on topics including viruses,
phishing, worms, firewalls, security, servers, storage, networking, wireless, databases, and web services.

CIO :: ComputerWorld :: CSO :: Demo :: GamePro :: Games.net :: IDG Connect :: IDG World Expo
Industry Standard :: IT World :: JavaWorld :: LinuxWorld :: MacUser :: Macworld :: Network World :: PC World :: Playlist