Free Newsletters
Technology & Business Daily

InfoWorld
Log-in | Register

Study documents data boom

Data storage has doubled during last three years

By Grant Gross, IDG News Service
October 28, 2003
 

WASHINGTON - If you're feeling overwhelmed by information overload, you may not be alone. The amount of new information stored on various media such as hard drives has doubled in the past three years, to five exabytes of new information produced in 2002, according to a study released Tuesday by the University of California, Berkeley.

Free IT resource

Virtualization Insights from Top Experts - Learn how virtualization gets real!

Sponsored by Dell

Free IT resource

TechNet: More ways to know it, share it, and keep it running.

Sponsored by Microsoft

That's exabytes, as in one byte with 18 zeros behind it, six zeros more than a terabyte. The amount of information put into storage in 2002, five exabytes, was equal to the contents of a half a million new libraries, each containing a digitized version of the print collection of the entire U.S. Library of Congress, according to the study by professors Peter Lyman and Hal Varian of the UC Berkeley School of Information Management and Systems. The professors estimated that between two and three exabytes of information was generated in 1999.

Most of that data -- 92 percent of it -- was stored on magnetic media, primarily hard drives, the study estimates.

The study, a follow-up to a 2000 study by UC Berkeley, doesn't dwell on how people and companies process these massive amounts of information coming at them, Lyman said, but his next goal is to produce a study examining that very issue. "I'm going to spend the next year on the consumption of information," he said. "How do people make sense of this? How do they cope?"

The current study doesn't address the quality of information and how people choose good information sources, he added. Significant differences exist in the "accessibility and usability and trustworthiness" of information between various sources, Lyman noted. "We treated it all the same, simply to understand how much there was ... but when you get into consumption, the discrimination over the quality of information, and how you make that decision, really becomes important," he added.

With the amount of stored information growing at a rate of about 30 percent a year, a "real change in our human ecology" is taking place, said Lyman, who presented the study at a conference in Florida Tuesday. "Everything is public," he said. "Everything is on the record."

One problem with all this information being stored is that it's not always accurate, he added. As information passes through multiple hands, it can be condensed or mischaracterized. So commentaries or reports on a speech or a paper Lyman gave 20 years ago sometimes contain distortions, he said.

"There are multiple renditions, only one of which I remember," he added.

The study underscores the need for companies to smartly manage their information, said Gil Press, director of corporation information at EMC Corp., an information storage vendor and a sponsor of the study. But IT solutions aren't the only answer, because humans still need to look at information with a critical eye, he added.

"We are getting swamped, and we need better ways to organize and manage information," Press said. "Hopefully, information technology will never replace smart thinking and the human analytical thinking."

The amount of stored information is not all the information that's being produced. Electronic channels -- including TV, radio, the telephone and the Internet -- produced three and a half times as much information as was stored in 2002. Most of that information was exchanged through voice telephone calls and not recorded or stored, Lyman said. The telephone accounts for the largest percentage of information flow -- 17.3 exabytes if stored in digital form -- followed by e-mail, which generates about 400,000 terabytes of new information each year, the study's authors said.

The researchers estimated that the World Wide Web contains 172 terabytes of information on public pages.

The UC Berkeley researchers used various methods to estimate the amount of information generated and stored, including statistics such as hard drive and paper sales, publication statistics and a sampling of the Web. The research team's methods are described in more detail at http://www.sims.berkeley.edu/research/projects/how-much-info-2003/.

One surprise for Lyman was that while digital storage continues to grow, the use of paper to transmit information is not shrinking. His team estimated that the number of terabytes of information put on paper each year increased by 36 percent from 1999 to 2001, while the amount of data stored magnetically each year increased by 80 percent between 1999 and 2002.

North Americans each consume 11,916 sheets of paper each year, while residents of the European Union consume 7,280 sheets, the team estimated. The majority of that paper information is produced by office documents and mail, not in formally published titles such as books or newspapers.





 

TOP NEWS:


»  Troubleshooting tool for Java offered
Sun's Java VisualVM open-source technology views apps while they run on a JVM and is billed as all-in-one solution

»  Python backing eyed for NetBeans
Scripting language capabilities of the open-source IDE continue to expand

»  Microsoft sets Windows XP SP3 automatic download for Thursday
The latest service pack for Windows XP will be pushed to Automatic Update at 7a.m. EDT on July 10

»  Real Software, Veryant bolster dev tools
RealBasic, Cobol apps platforms get improvements

»  Microsoft sets hosted-services pricing, irks partners
By offering 38 percent discount to customers who buy entire hosted business productivity suite, Microsoft undercuts partners selling similar services

»  Adobe readying new mashup tool for business users
Mashup interface code-named 'Genesis' will open up desktop 'workspace' combining business application data, documents, analytics, and instant messaging




5 Things You Need to Know About Storage Virtualization
This Webcast feature insights from various InfoWorld articles, as well as primary research conducted by InfoWorld and sister company IDC to better understand demand drivers, challenges and opportunities provided by storage virtualization, as well as other flavors or approaches to virtualization Sponsor: HP

»  Click here to view this Webcast
  The Silver Lining: Cloud Computing
This IT Strategy Guide digs deep into cloud computing helping put you ahead of the curve on this hot topic. It explores the differences between cloud computing, grid computing and utility computing and then helps you see where and how each applies to your business. Sponsored by Box.net

»  Click here to download now

- Special Advertising Partners -
WHITE PAPERS
 

» Technology White Papers Library

Technology White Papers by Topic

Technology White Papers E-mail Alert

Find out when the latest white paper is available:
 
 
INFOWORLD MARKETPLACE
 
» BUY A LINK NOW
 

FIND PRODUCTS AND COMPANIES
» COMPLETE PRODUCT GUIDE



TECHNOLOGY INDEX
• Applications
• Application Development
• Security
• Networking
• Wireless
• Platforms
• Hardware
• Data Management
• Storage
• Web Services
• Business
• Telecom
• Professional Services
• Standards

TECH WATCH 


What's the 411 on GOOG-411?
Just as Google has become synonymous with "performing a Web search," 411 is understood to mean "information" -- as in "what's the 411?" I was thus surprised to discover, from a billboard, no less, that the king of search is taking on the ...

Apple HTML source reveals 'iPhone Extreme'
"This one's a stretch..." reports AppleInsider. Um, yeah. Reporting on HTML code sightings of product names could be called a stretch, but iPhone Extreme has a ring to it. Now, that sounds like the product Apple should have released first, rather ...

COLUMNISTS

Unified under law
Ephraim Schwartz's Column and Blog (InfoWorld) - In the litigious world we live in, deploying a unified communications platform in your enterprise could...
» MORE COLUMNISTS

MORE INFOWORLD BLOGS


Open Sources 
Product Management
When I joined MySQL four years ago, there was quite a lot of debate about product management. We didn't actually have ...

Zero Day 
Botnet herders tending smaller flocks
New research backs up the theory that botnet operators are keeping their networks smaller in a continued effort to keep ...



• Advice Line
• Database Underground
• The Deep End
• Enterprise Mac
• Geeks in Paradise
• Grid Meter
• The Gripe Line
• InfoWorld Daily
• Inside IT
• IT Troubleshooter
• ITXtreme
• Open Sources
• ProdBlog
• Real World SOA
• Reality Check
• Security Adviser
• SMB IT
• The Storage Network
• Tech Watch
• Virtualization Report
• Zero Day

ADVERTISEMENT


RESOURCE CENTERadvertisement 

GOVERNMENT IT & POLICY
'If you don't go after the network, you're never going to stop these guys. Never.'
From the State Department, All the News for Inquiring Minds
TechPresident, the Internet Citizenry's New Consensus Taker



Sponsored Technology Links

 
 
 HOME  NEWS  BLOGS  PODCASTS  VIDEOS  TECHNOLOGIES  TEST CENTER  EVENTS  CAREERS   About | Advertise | Awards | RSS | Contact Us 

Copyright © 2008, Reprints, Permissions, Licensing, IDG Network, Privacy Policy, Terms of Service.
All Rights reserved. InfoWorld is a leading publisher of technology information and product reviews on topics including viruses,
phishing, worms, firewalls, security, servers, storage, networking, wireless, databases, and web services.

CIO :: ComputerWorld :: CSO :: Demo :: GamePro :: Games.net :: IDG Connect :: IDG World Expo
Industry Standard :: IT World :: JavaWorld :: LinuxWorld :: MacUser :: Macworld :: Network World :: PC World :: Playlist