Free Newsletters
Technology & Business Daily

InfoWorld
Log-in | Register

Microsoft pitches voice spec

SALT support trumps Voice XML as Speech Server sounds return of enterprise voice

By Ephraim Schwartz
July 11, 2003
 

Microsoft has welcomed a new addition to its server family: the Speech Server.

Free IT resource

Open Source Business Conference (OSBC) May 22-23, 2007

Sponsored by OSBC

Free IT resource

TechNet: More ways to know it, share it, and keep it running.

Sponsored by Microsoft

Running on Windows Server 2003, the first public beta of Speech Server will ship with Beta 3 of Microsoft’s Speech Application SDK (Software Development Kit) in what signals speech technology’s return to the corporate agenda.

Due for manufacturing release before mid-2004, the product will include a text-to-speech engine from SpeechWorks — Microsoft’s own speech-recognition engine — and a telephony interface manager. The offering will also include middleware that is being designed in partnership with Santa Clara, Calif.-based Intel and Dallas-based Intervoice to connect the Microsoft product to an enterprise telephony infrastructure.

But it is the server’s SALT (Speech Application Language Tags) voice browser that sets Microsoft apart from the standards crowd.

Rather than adhering to VXML (Voice XML) — the current W3C standard for developing speech-based telephony applications — Speech Server is compatible only with applications that use the specifications developed by the SALT Forum, of which Microsoft is a founding member.

The SALT Forum has submitted its specifications to a W3C working group, but they are far from becoming a standard.

“The process could take years,” admitted James Mastan, director of marketing for the speech technologies group at Redmond, Wash.-based Microsoft.

The SALT specification was originally targeted at the multimodal market for browsing the Web on handheld devices. The theory was that users required multiple ways to interface with smaller devices and that voice would be chief among them, but the market for multimodal handhelds has not materialized.

Microsoft executives believe the SALT-based Speech Server is ideally suited to call centers where the cost of using live operators is becoming prohibitive.

An InStat/MDR research report stated that live agents cost $1 to $5 per call as opposed to 20 cents for a speech-recognition system.

“This is not a desktop solution but an enterprise application,” said Elizabeth Herrell, an analyst at Forrester Research in Santa Clara, Calif.

Bill Meisel, a principal at TMA Associates, a leading speech technology research company based in Tarzana, Calif., said enterprise voice adoption will increase due to Microsoft’s market influence. Yet, because Speech Server will compete directly with established VXML applications, Microsoft’s actions will make speech technology adoption a more complex exercise for the enterprise, according to Meisel.

Competing speech technology vendor IBM is a case in point. Big Blue supports VXML and the W3C standard, according to Gene Cox, director of mobile solutions at Armonk, N.Y.-based IBM.

Cox said significant VXML applications already exist in the enterprise at companies such as AT&T, General Motors’ OnStar division, and Sprint PCS.

“VXML conforms to all W3C royalty-free polices. But SALT is like Internet Explorer; it is free as long as you buy Windows,” Cox said.

The debate over which technology to use will not be fought out at the customer level, said Forrester’s Herrell, but rather by developers.

“Customers just want a solution that works. Developers will decide which platform to use based on its quality, and for that, it is too early to tell,” Herrell said.

VXML is a separate language that developers must learn, TMA’s Meisel said. For VXML to support Web-based applications, such as those residing in call centers, VXML must connect with the back-end servers.

“People are using J2EE to drive VXML applications and to provide a more standard interface to Web services,” Meisel said.

Irvine, Calif.-based NewportWorks, an information service provider for the real estate industry, is one example of an IBM customer that will be hard to shift away from Voice XML. According to CEO Ken Stockman, the company could not exist without Voice XML. NewportWorks aggregates the data from the MLS (Multiple Listing Service), uses IBM’s WebSphere Speech Server to convert the listings for voice access, and sells the service to real estate agencies. The MLS data is either sent through a real estate industry XML gateway called RETS (Real Estate Transaction Standard), dumped into a flat file and sent via FTP, or MLS creates a SQL view.

“Our service couldn’t exist without speech technology. The economics of a call center don’t work,” said Stockman, who added that the company has not investigated SALT-based technologies. Regardless, NewportWorks has thrown its support behind Java, which provides most of the heavy lifting for its solution.

A Java application layer manages the interaction between client and data with Java dynamically generating the VXML, using IBM WebSphere to handle the calls. 

Stockman said the learning curve on VXML for developers was negligible. Microsoft, on the other hand, argues that Web developers don’t want to learn a new language. Instead, they want SALT tag plug-ins for existing Web-based applications.

According to Intervoice, the argument may be resolved through tools such as its Invision, which allows a developer to automatically generate VXML and to possibly generate SALT code in the future.





 


 
Ephraim Schwartz is an editor at large at InfoWorld.

  More of Ephraim Schwartz's column

Newsletter Check out all of our free newsletters!
Enter e-mail address:




 

TOP NEWS:


»  You don't know tech: The InfoWorld news quiz
Match your weekly tech news wits against our snarky quiz master

»  Antitrust review of Google-Yahoo deal no surprise
While serious antitrust problems are unlikely, both Google and Yahoo expected their partnership to be subjected to instense DOJ scrutiny

»  Top 10: Coreflood, more Microsoft-Yahoo, iPhone plans
This week's wrapup of the top tech news stories includes more Microsoft-Yahoo rumors, iPhone updates, Flash searches, Oracle's BEA roadmap, and more

»  Four 'important' Microsoft patches due Tuesday
Not rated "critical," fixes apply to "Elevation of Privileges" and "spoofing" bugs for Windows, Exchange, and SQL

»  Judge grants RIM a stay in Visto patent trial
Trial delayed from beginning next week while patent office studies validity of certain parts of e-mail provider Visto's patents as requested by RIM

»  Developers satisfied with Apple's enterprise work
Mac developers feel that Apple shouldn't try to make a broad attempt to win over enterprises and should instead focus on certain areas within the enterprise




Are you ready for event-driven business?
"Faster than a speeding bullet" doesn't just refer to superheroes anymore, it's the velocity your business needs to compete. In this webcast you will learn strategies you can implement today that will keep your systems ahead of the increased business velocity. Sponsor: Progress Sonic

»  Click here to view this Webcast
  The Silver Lining: Cloud Computing
This IT Strategy Guide digs deep into cloud computing helping put you ahead of the curve on this hot topic. It explores the differences between cloud computing, grid computing and utility computing and then helps you see where and how each applies to your business. Sponsored by Box.net

»  Click here to download now

- Special Advertising Partners -
WHITE PAPERS
 

» Technology White Papers Library

Technology White Papers by Topic

Technology White Papers E-mail Alert

Find out when the latest white paper is available:
 
 
INFOWORLD MARKETPLACE
 
» BUY A LINK NOW
 
SEE ALSO
• Speech vendors target developers with multimodal tools
• Speech vendors shout for standards


FIND PRODUCTS AND COMPANIES
» COMPLETE PRODUCT GUIDE



TECHNOLOGY INDEX
• Applications
• Application Development
• Security
• Networking
• Wireless
• Platforms
• Hardware
• Data Management
• Storage
• Web Services
• Business
• Telecom
• Professional Services
• Standards

TECH WATCH 


What's the 411 on GOOG-411?
Just as Google has become synonymous with "performing a Web search," 411 is understood to mean "information" -- as in "what's the 411?" I was thus surprised to discover, from a billboard, no less, that the king of search is taking on the ...

Apple HTML source reveals 'iPhone Extreme'
"This one's a stretch..." reports AppleInsider. Um, yeah. Reporting on HTML code sightings of product names could be called a stretch, but iPhone Extreme has a ring to it. Now, that sounds like the product Apple should have released first, rather ...

COLUMNISTS

Unified under law
Ephraim Schwartz's Column and Blog (InfoWorld) - In the litigious world we live in, deploying a unified communications platform in your enterprise could...
» MORE COLUMNISTS

MORE INFOWORLD BLOGS


Open Sources 
Product Management
When I joined MySQL four years ago, there was quite a lot of debate about product management. We didn't actually have ...

Zero Day 
Botnet herders tending smaller flocks
New research backs up the theory that botnet operators are keeping their networks smaller in a continued effort to keep ...



• Advice Line
• Database Underground
• The Deep End
• Enterprise Mac
• Geeks in Paradise
• Grid Meter
• The Gripe Line
• InfoWorld Daily
• Inside IT
• IT Troubleshooter
• ITXtreme
• Open Sources
• ProdBlog
• Real World SOA
• Reality Check
• Security Adviser
• SMB IT
• The Storage Network
• Tech Watch
• Virtualization Report
• Zero Day

ADVERTISEMENT


RESOURCE CENTERadvertisement 

GOVERNMENT IT & POLICY
'If you don't go after the network, you're never going to stop these guys. Never.'
From the State Department, All the News for Inquiring Minds
TechPresident, the Internet Citizenry's New Consensus Taker



Sponsored Technology Links

 
 
 HOME  NEWS  BLOGS  PODCASTS  VIDEOS  TECHNOLOGIES  TEST CENTER  EVENTS  CAREERS   About | Advertise | Awards | RSS | Contact Us 

Copyright © 2008, Reprints, Permissions, Licensing, IDG Network, Privacy Policy, Terms of Service.
All Rights reserved. InfoWorld is a leading publisher of technology information and product reviews on topics including viruses,
phishing, worms, firewalls, security, servers, storage, networking, wireless, databases, and web services.

CIO :: ComputerWorld :: CSO :: Demo :: GamePro :: Games.net :: IDG Connect :: IDG World Expo
Industry Standard :: IT World :: JavaWorld :: LinuxWorld :: MacUser :: Macworld :: Network World :: PC World :: Playlist