Free Newsletters
Technology & Business Daily

InfoWorld
Log-in | Register

IBM develops speech recognition in Indian language

IBM's speech recognition software for Hindi has both commercial and social applications


IBM’s India Research Laboratory (IRL) has developed a speech recognition software for Hindi, one of the key languages in India.

The software has both commercial applications and social applications such as bridging the digital divide, Daniel Dias, director of the lab, said in a telephone interview on Thursday.

The Indian government and other local agencies have been promoting the use of local languages in computing, but the development of an input device for Indian languages has proven to be quite difficult. The Devnagri script used in Hindi has over 40 basic characters, and some 12 modifiers to the characters that are represented above or below the basic characters.

There are a number of keyboards available for the Devnagri script, but to input one character of the script, the user has to punch a combination of keys, said Ashish Verma, a senior researcher at IRL, and the lead on this project.

Hewlett-Packard’s lab in India has designed a touch-pad, which it calls the "gesture keyboard", which uses a combination of tapping and gestures to handle Hindi. The touch-pad has the basic characters and numbers of the Devnagri script on it. The character with the required modifier can be input into the computer by specific user gestures when tapping the basic character with a pen-based input device.

IBM, by contrast, has opted for a technology based on speech recognition, both because it is simpler, and also because it can be used by the large number of Indians who are semi-literate or not familiar with a computer keyboard. The dictionary, developed by IRL for the speech recognition system, has over 75,000 words in Hindi, with a provision to add new words, Verma said.

One of the challenges in developing a speech recognition system for Hindi was that words in the language are often pronounced quite differently in various parts of the country. "We had to come up with multiple pronunciations for a given word in Hindi, and include them in the dictionary, and get them recognized (by the system) in the testing phase," Verma said.

The core technology, developed by IRL, can be used in PC applications such as data entry, letter-writing, sending emails, as well as to speech-enable ATMs (automated teller machines), kiosks and other devices, Dias said. The software can  also be used for issuing commands to the computer, and for IVR (interactive voice response) applications in telephony, he added. As the software supports Unicode it can be integrated with a number of word processing and email applications including from Microsoft, Verma said.

The software produces word recognition rates in the range of 90-95 percent with speaker adaptation and 80-90 percent without speaker adaptation. The accuracy is close to 100 percent for ‘command and control’ kind of applications, such as operating ATMs and kiosks, since the vocabulary is limited in these applications, IBM said.

The technology is being integrated by the Center for Development of Advanced Computing (C-DAC), an Indian government run research organization, into its programs for facilitating computing in Indian languages. IBM may also make this technology available to other markets, including for commercial applications. The company has not yet decided on the business model, Dias said.


Talkback:

commentPost a Comment

 

MOST COMMENTS

 
 





Application Grid: Oracle's Vision for Next-Generation Application Servers and Infrastructure
View this live Webcast to hear senior Oracle executives Hasan Rizvi and Steve Harris discuss the application grid. Learn how Oracle is combining cutting-edge technologies from its recent acquisition of BEA with the Fusion Middleware portfolio. Discover a new level of reliability, performance, and "scale-agility" in your data center, with emphasis on efficiency for today's challenging economic environment. Sponsored by Oracle

»  Click here to view this Webcast
  Virtualization Solutions Guide
This comprehensive IT Strategy Guide covers Virtualization and puts you at the forefront of the discussion. You'll learn all you need to know from the cost of virtualization, how to implement it for your business, how to back it up safely and which products are best. Sponsored by Riverbed

»  Click here to download now

- Special Advertising Partners -
WHITE PAPERS
 

» Technology White Papers Library

Technology White Papers by Topic

Technology White Papers E-mail Alert

Find out when the latest white paper is available:
 
 
INFOWORLD MARKETPLACE
 
» BUY A LINK NOW
 
 

Video

 
 
 

Podcasts

 
IFW Daily 09/05/2008

Sun to craft software stack into NAS appliances, former CA CEO Sanjay...

 
 

 

Columnists

 
 
 

Resource Center


Ads by techwords beta  [See your link here]
 




Sponsored Technology Links

 
 
 HOME  NEWS  BLOGS  PODCASTS  VIDEOS  TECHNOLOGIES  TEST CENTER  EVENTS  CAREERS   About | Advertise | Awards | RSS | Contact Us 

Copyright © 2008, Reprints, Permissions, Licensing, IDG Network, Privacy Policy, Terms of Service.
All Rights reserved. InfoWorld is a leading publisher of technology information and product reviews on topics including viruses,
phishing, worms, firewalls, security, servers, storage, networking, wireless, databases, and web services.

CIO :: ComputerWorld :: CSO :: Demo :: GamePro :: Games.net :: IDG Connect :: IDG World Expo
Industry Standard :: IT World :: JavaWorld :: LinuxWorld :: MacUser :: Macworld :: Network World :: PC World :: Playlist