November 04, 2005

Microsoft moves to digitize British Library

MSN Book Search site to offer 25 million pages from the British Library's collection next year

The scanning race has started: Microsoft announced an agreement Friday to scan 25 million pages from the British Library's collection that will eventually be made available on its MSN Book Search site next year.

Around 100,000 books from the British Library's 13 million book collection will be digitized, according to a joint press release. MSN Book Search, launched earlier this month, is scheduled for a beta release next year.

The agreement comes as Microsoft's competitors, such as Google and Yahoo, are aggressively moving toward compiling online libraries of books amid copyright concerns. The titles to be scanned at the British Library are no longer under copyright restrictions.

Microsoft is contracting with the Internet Archive, a nonprofit group based in San Francisco that works on digital preservation projects, to do the scanning, said Richard Boulderstone, director of e-Strategy for the British Library. Microsoft is not paying the library for access; however, the library will benefit, as it has been working for the last 10 years on digitization.

Despite a decade of work, only 0.2 percent of the library's vast collection has been digitized, Boulderstone said.

"Actually, for us to have some of these commercial players come along and want to work with us on digitizing these collections, it's fantastic," Boulderstone said.

Microsoft's announcement comes as Google said Thursday it had made a significant addition of scans from public-domain books to its Google Print site. The company is working from the collections of the University of Michigan, Harvard University, Stanford University, the New York Public Library and Oxford University.

Google is also facing two copyright infringement lawsuits over the scanning of copyright works in those collections, a practice that the company has halted but vows to resume, citing laws that allow certain liberties with the use of protected material. Google said it will focus on out-of-print and older selections.

Yahoo and Microsoft have thrown their support behind the Open Content Alliance (OCA), a group based in San Francisco working to digitize public domain text and films. The Internet Archive is one member of the OCA. Yahoo has offered to index content while also funding the digitization of a collection of American literature selected by the University of California.

MSN said last week it is talking with libraries and publishers about offering copyrighted material in its index. Microsoft eventually plans to build a business model around the search service for copyright works, but so far has said it doesn't intend to charge for searches of noncopyrighted material.

Close

On Twitter now

Platforms

Powered by Twitter

On Twitter now

White Paper

D2D Virtual Tape Library Replication Primer

This whitepaper explains the terminology and concepts behind Data Replication technologies and establishes some sizing rules through worked examples. Learn the new paradigm in disaster tolerance—protect data anywhere.

Download now »

White Paper

An Alternative to Virtualization for Datacenter Cost Savings

Server virtualization is a popular option for dealing with mounting datacenter costs. Another equally promising approach is the use of an Application Delivery Controller. Citrix NetScaler provides a low-cost way for organizations to reduce their server count and accrue cost savings from a reduction in space, cooling, power and personnel.

Download now »

White Paper

Why Your Firewall, VPN, and IEEE 802.11i Aren't Enough to Protect Your Network

The emergence of WLANs has created a new breed of security threats to enterprise networks.

Included in HP ProCurve WLAN solutions is security technology that alleviates threats from WLANs through:
* Monitoring wireless activity inside and out of the enterprise
* Classifying WLAN transmissions into harmful and harmless
* Preventing transmissions that pose a security threat to the enterprise network
* Locating participating devices for physical remediation

Download now »

White Paper

Bringing the Edge to the Data Center

Effectively address data protection challenges, implementing solutions that help store and protect business–critical data while cutting costs and improving efficiency and reliability.

Download now »

Sign up to receive Platforms Resource Alerts

Subscribe to the Today's Headlines: First Look Newsletter

Find out what will be news for the day, with our first-thing-in-the-morning briefing.

©1994-2009 Infoworld, Inc.