Free Newsletters
InfoWorld Daily

InfoWorld
Log-in | Register

Speech vendors target developers with multimodal tools

SALT Forum and IBM offer two different solutions to one problem

By Ephraim Schwartz
July 15, 2002
 

DUELING SPEECH TECHNOLOGY announcements will come to the fore on Monday when the SALT (Speech Application Language Tags) Forum ships Version 1.0 of its multimodal specification and IBM reveals its X+V technology for multimodal applications.

Free IT resource

Hear how top CIOs turn change into a competitive advantage.

Sponsored by HP

Free IT resource

Try Sun servers, workstations and storage products free for 60-days.

Sponsored by Sun Microsystems

While the W3C (World Wide Web Consortium) standards body reviews IBM's X+V as a solution to multimodal capabilities on handheld devices, sources close to the W3C say that it will soon acknowledge receipt of a SALT Forum submission for the same technology.

Multimodal refers to a developer's ability to mix both a voice and visual interface into a single application.

The IBM technology, X+V, an acronym for xHTML plus Voice, will ship in October and will be included as an extension to Big Blue's WebSphere Everyplace Access in the first half of next year.

"A brokerage customer requesting trading positions uses his voice, but he doesn't want to hear the results because they can be fairly long. With multimodal, the request is verbal and the response is visual," said Sunil Soares, director of product management for IBM's Pervasive Computing initiative, in Somers, N.Y.

IBM submitted its specification late last year for X+V to the W3C and it has been acknowledged -- terminology for the standards body's willingness to consider a specification as a standard. The W3C has, in fact, set up a working group to focus on multimodal, Soares said.

"The spec lets you operate across three modes. It lets you operate in a multimodal environment, but if you want to use just visual it ignores the VXML tags and vice versa," Soares said.

In October IBM also will ship a set of tools for developers to write multimodal applications using VXML.

Despite appearances of dueling standards, the W3C will most likely come up with a single solution, according to Bill Meisel, president of TMA Associates in Tarzana, Calif.

"VXML will be broken into pieces and some of the same ideas and concepts in SALT will be applied to those pieces," Meisel said.

However, there is one non-technical difference between the two proposed standards that could lead to problems down the road. While the SALT specification is royalty-free the VXML specification is not, Meisel said.

"The technical people on the VXML Forum wanted it royalty-free, but they were over-ridden by their companies. W3C could never get the VXML members to drop all of their IP requirements," Meisel said.

However, it is highly unlikely that any company will try and sabotage the VXML initiative by claiming someone is violating its patent, Meisel added, although the possibility does cloud the issue.

The release on Monday by the SALT Forum is the first fully functioning specification suitable for developers to take and build applications around, said Peter Gavalakis, marketing manager, at Intel in Parsippany, N.J.

Gavalakis also sees the possibility that the standards organizations might take a look at the different paths toward multimodal and decide to converge them. However, there is another option.

"They might provide multiple options, one more high-level using XML -- and that would be VXML -- and a more low-level solution using SALT," Gavalakis said.

As to the long-standing disagreement between SALT and VXML proponents, it appears that it still continues. Soares called SALT "verbose."

"From a developer perspective, SALT requires you to write a boatload of Java script to manage voice dialogues," said Soares, adding that VXML, on the other hand, has an FIA (forms interpretation algorithm) that automatically manages voice dialogues.

Salt's representative, Gavalakis, didn't disagree but had his own counterpunch.

"He's [Soares] right. The strength of VXML was engineered to enable telephony applications and having the FIA embedded provides a faster development time for certain types of applications that are system directed, that use prompts. But Web applications and multimodal applications are more user-directed and FIA limits your flexibility," Gavalakis said.

TMA Associates' Meisel had the final word: "The W3C will sort it out."





 


 
InfoWorld Editor at Large Ephraim Schwartz is based in San Francisco.
 

TOP NEWS:


»  Four quick tips for choosing an IM security product
71 percent of businesses will invest in real-time messaging this year. If you're one of them, be sure to protect your enterprise

»  Forrester analysts ID hot IT jobs
Research group finds 16 IT roles with a promising future

»  Nvidia claims 10 hours of HD video on Tegra chip
The Tegra 600 and 650 can be used with hard disk drives and are designed partly for mobile Internet devices

»  Database vendors add Google's MapReduce
Greenplum and Aster Data Systems will support Google's programming technique, developed for parallel processing of large data sets across commodity hardware

»  Network management: Tips for managing costs
New technologies, changing requirements, and ongoing equipment maintenance and upgrades cost money, but there are ways to manage expenses

»  EMC targets SMBs, branch offices with new low-end storage
Celerra NX4 highlights include thin provisioning, snapshot technology for data recovery and backups, and Web-based console for management of storage volumes




REMOTE ACCESS: MAINTAIN SECURITY AND DECREASE THE BURDEN ON IT
Join this interactive webcast to discover how IT Managers can control access rights, end-user security settings and end-point authorization. Sponsor: Citrix(R) GoToMyPC(R) Corporate

»  Click here to view this Webcast
  WAN Emulation Sponsored Solutions Guide
WAN emulation technology enables IT organizations to predict reliably how applications will perform in a networked environment, before application rollout, mitigating development risk and costs.This Sponsores Solutions Guide has everything you need to now about WAN emulation and WAN and how to best implement it in your organization. Sponsored by Shunra

»  Click here to download now

- Special Advertising Partners -
WHITE PAPERS
 

» Technology White Papers Library

Technology White Papers by Topic

Technology White Papers E-mail Alert

Find out when the latest white paper is available:
 
 
INFOWORLD MARKETPLACE
 
» BUY A LINK NOW
 

FIND PRODUCTS AND COMPANIES
» COMPLETE PRODUCT GUIDE



TECHNOLOGY INDEX
• Applications
• Application Development
• Security
• Networking
• Wireless
• Platforms
• Hardware
• Data Management
• Storage
• Web Services
• Business
• Telecom
• Professional Services
• Standards

TECH WATCH 


What's the 411 on GOOG-411?
Just as Google has become synonymous with "performing a Web search," 411 is understood to mean "information" -- as in "what's the 411?" I was thus surprised to discover, from a billboard, no less, that the king of search is taking on the ...

Apple HTML source reveals 'iPhone Extreme'
"This one's a stretch..." reports AppleInsider. Um, yeah. Reporting on HTML code sightings of product names could be called a stretch, but iPhone Extreme has a ring to it. Now, that sounds like the product Apple should have released first, rather ...

COLUMNISTS

Unified under law
Ephraim Schwartz's Column and Blog (InfoWorld) - In the litigious world we live in, deploying a unified communications platform in your enterprise could...
» MORE COLUMNISTS

MORE INFOWORLD BLOGS


Open Sources 
Product Management
When I joined MySQL four years ago, there was quite a lot of debate about product management. We didn't actually have ...

Zero Day 
Botnet herders tending smaller flocks
New research backs up the theory that botnet operators are keeping their networks smaller in a continued effort to keep ...



• Advice Line
• Database Underground
• The Deep End
• Enterprise Mac
• Geeks in Paradise
• Grid Meter
• The Gripe Line
• InfoWorld Daily
• Inside IT
• IT Troubleshooter
• ITXtreme
• Open Sources
• ProdBlog
• Real World SOA
• Reality Check
• Security Adviser
• SMB IT
• The Storage Network
• Tech Watch
• Virtualization Report
• Zero Day

ADVERTISEMENT


RESOURCE CENTERadvertisement 

GOVERNMENT IT & POLICY
'If you don't go after the network, you're never going to stop these guys. Never.'
From the State Department, All the News for Inquiring Minds
TechPresident, the Internet Citizenry's New Consensus Taker



Sponsored Technology Links

 
 
 HOME  NEWS  BLOGS  PODCASTS  VIDEOS  TECHNOLOGIES  TEST CENTER  EVENTS  CAREERS   About | Advertise | Awards | RSS | Contact Us 

Copyright © 2008, Reprints, Permissions, Licensing, IDG Network, Privacy Policy, Terms of Service.
All Rights reserved. InfoWorld is a leading publisher of technology information and product reviews on topics including viruses,
phishing, worms, firewalls, security, servers, storage, networking, wireless, databases, and web services.

CIO :: ComputerWorld :: CSO :: Demo :: GamePro :: Games.net :: IDG Connect :: IDG World Expo
Industry Standard :: IT World :: JavaWorld :: LinuxWorld :: MacUser :: Macworld :: Network World :: PC World :: Playlist