Free Newsletters
Technology & Business Daily

InfoWorld
Log-in | Register

NTT DoCoMo develops speech recognition without speech

System can be used in noisy environments where other systems have problems

By Martyn Williams, IDG News Service
March 03, 2004
 

YOKOSUKA, JAPAN -- NTT DoCoMo Inc. lifted the lid Tuesday on its five-year-old research and development (R&D) center in Japan and demonstrated a couple of the technologies the operator is working on, including a speech recognition system that doesn't require speech.

Free IT resource

Hear how top CIOs turn change into a competitive advantage.

Sponsored by HP

Free IT resource

Attend the SOA Executive Forum: Breaking SOA Bottlenecks SOAExecForum.com/may2007

Sponsored by InfoWorld

The company, Japan's largest cellular operator, last year spent around ¥150 billion ($1.4 billion) on R&D. It employs more than 1,100 people to look into new technologies and communications methods. Many of those people are based here in Yokosuka, west of Tokyo. The carrier, which also has laboratories in the U.S. and Germany, is in the process of establishing a center in China.

Research covers areas of importance to the company's business. For example, HSDPA (high-speed downlink packet access) data communications technology, which allows downlink speeds of up to 14Mbps (bits per second), is being developed in Yokosuka, as well as multimedia and human-machine interaction technologies, such as the speech recognition system.

The system, which is still a prototype, works by measuring the electrical activity in muscles that are used when a person speaks using a system called electromyography (EMG). This means the user still has to mouth the words as if they are being spoken but audible speech itself isn't necessary.

Three electrodes need to be touching various areas of the face to measure the electrical activity. In the demonstration, a user had sensors mounted on his thumb and first two fingers. The thumb was placed under his chin, the forefinger was held vertically touching his cheek bone and the second finger held just above his top lip. While the actual positioning may look strange, it is not particularly difficult and doesn't impede speech or the mouthing of words a great deal.

At present the system, which is the result of three years of work by a small number of researchers, has been programmed to recognize the five vowels of Japanese.

In the demonstration, the system worked well, accurately recognizing the vowels mouthed by a researcher.

The next stage in development is recognition of consonants, and developers are also working on other languages, said Tomoyuki Ohya, director of NTT DoCoMo's Multimedia Signal Processing Laboratory. Because EMG doesn't require sound to recognize what is being said, the system can be used in noisy environments where existing speech recognition systems have problems, according to NTT DoCoMo.

The company also demonstrated a three-dimensional audio communication system for cell phones. Similar to surround-sound, the system adds depth to the basic left and right distinction offered by stereo and then adapts the sound being heard depending on a listerner's position in relation to that of the person speaking.

As an example, NTT DoCoMo offered the example of two people trying to meet each other in a crowded place. Usually, a cellular telephone conversation between the two would include relating details of close-by objects -- for example: "I'm next to the coffee shop near the clock." But the new technology would make the other party's voice appear to be coming from that person's actual location. So, in this case, if the coffee shop were 100 meters behind and to the right of the listener, the voice would appear to come from that direction. As the listener turns around, the voice would continue to appear from the same physical location, which after turning would be ahead and left.

In its current configuration, the system requires a pair of headphones because it needs to deliver sound to both ears to give the illusion of depth. It also requires some type of positioning technology that is able to accurately measure the location of users and also the direction they are facing.

Research on EMG technology is ongoing. Nobuhiko Naka, a research engineer at the Multimedia Signal Processing Laboratory, wasn't willing, however, to estimate when it could be complete.

The two prototypes aren't the first shown by NTT DoCoMo to challenge current cellular telephone technology. In 2000, the operator demonstrated at a Japanese trade show a telephone that sends sound to the ear by creating vibrations that travel along bones.

Although the company hasn't yet sold a product based on the technology, rival carrier Tu-Ka Cellular Tokyo Inc. earlier this year launched a handset produced by Sanyo Electric Co. Ltd. that uses a similar system. The Sanyo TS-41 handset looks and functions as a conventional clamshell handset but can be pressed against the user's face and the vibration function turned on when in a noisy environment.

 





 

TOP NEWS:


»  Four quick tips for choosing an IM security product
71 percent of businesses will invest in real-time messaging this year. If you're one of them, be sure to protect your enterprise

»  Forrester analysts ID hot IT jobs
Research group finds 16 IT roles with a promising future

»  Nvidia claims 10 hours of HD video on Tegra chip
The Tegra 600 and 650 can be used with hard disk drives and are designed partly for mobile Internet devices

»  Database vendors add Google's MapReduce
Greenplum and Aster Data Systems will support Google's programming technique, developed for parallel processing of large data sets across commodity hardware

»  Network management: Tips for managing costs
New technologies, changing requirements, and ongoing equipment maintenance and upgrades cost money, but there are ways to manage expenses

»  EMC targets SMBs, branch offices with new low-end storage
Celerra NX4 highlights include thin provisioning, snapshot technology for data recovery and backups, and Web-based console for management of storage volumes




Solutions to the Toughest IT Challenges in Remote Offices
Though small in size, remote offices face many of the same IT challenges as larger central offices. This Webcast zeroes in on the top line challenges to deliver information that can provide immediate benefits to your business. Sponsor: AMD and Dell

»  Click here to view this Webcast
  The Path to Enterprise Security
This is your comprehensive guide to Enterprise Security. In it you'll find solutions to the most pressing security threats facing you and your company. Learn the latest on insider threats and how to effectively minimize risk within your organization. Sponsored by Nokia

»  Click here to download now

- Special Advertising Partners -
WHITE PAPERS
 

» Technology White Papers Library

Technology White Papers by Topic

Technology White Papers E-mail Alert

Find out when the latest white paper is available:
 
 
INFOWORLD MARKETPLACE
 
» BUY A LINK NOW
 

FIND PRODUCTS AND COMPANIES
» COMPLETE PRODUCT GUIDE



TECHNOLOGY INDEX
• Applications
• Application Development
• Security
• Networking
• Wireless
• Platforms
• Hardware
• Data Management
• Storage
• Web Services
• Business
• Telecom
• Professional Services
• Standards

TECH WATCH 


What's the 411 on GOOG-411?
Just as Google has become synonymous with "performing a Web search," 411 is understood to mean "information" -- as in "what's the 411?" I was thus surprised to discover, from a billboard, no less, that the king of search is taking on the ...

Apple HTML source reveals 'iPhone Extreme'
"This one's a stretch..." reports AppleInsider. Um, yeah. Reporting on HTML code sightings of product names could be called a stretch, but iPhone Extreme has a ring to it. Now, that sounds like the product Apple should have released first, rather ...

COLUMNISTS

Unified under law
Ephraim Schwartz's Column and Blog (InfoWorld) - In the litigious world we live in, deploying a unified communications platform in your enterprise could...
» MORE COLUMNISTS

MORE INFOWORLD BLOGS


Open Sources 
Product Management
When I joined MySQL four years ago, there was quite a lot of debate about product management. We didn't actually have ...

Zero Day 
Botnet herders tending smaller flocks
New research backs up the theory that botnet operators are keeping their networks smaller in a continued effort to keep ...



• Advice Line
• Database Underground
• The Deep End
• Enterprise Mac
• Geeks in Paradise
• Grid Meter
• The Gripe Line
• InfoWorld Daily
• Inside IT
• IT Troubleshooter
• ITXtreme
• Open Sources
• ProdBlog
• Real World SOA
• Reality Check
• Security Adviser
• SMB IT
• The Storage Network
• Tech Watch
• Virtualization Report
• Zero Day

ADVERTISEMENT


RESOURCE CENTERadvertisement 

GOVERNMENT IT & POLICY
'If you don't go after the network, you're never going to stop these guys. Never.'
From the State Department, All the News for Inquiring Minds
TechPresident, the Internet Citizenry's New Consensus Taker



Sponsored Technology Links

 
 
 HOME  NEWS  BLOGS  PODCASTS  VIDEOS  TECHNOLOGIES  TEST CENTER  EVENTS  CAREERS   About | Advertise | Awards | RSS | Contact Us 

Copyright © 2008, Reprints, Permissions, Licensing, IDG Network, Privacy Policy, Terms of Service.
All Rights reserved. InfoWorld is a leading publisher of technology information and product reviews on topics including viruses,
phishing, worms, firewalls, security, servers, storage, networking, wireless, databases, and web services.

CIO :: ComputerWorld :: CSO :: Demo :: GamePro :: Games.net :: IDG Connect :: IDG World Expo
Industry Standard :: IT World :: JavaWorld :: LinuxWorld :: MacUser :: Macworld :: Network World :: PC World :: Playlist