July 11, 2003

Microsoft pitches voice spec

SALT support trumps Voice XML as Speech Server sounds return of enterprise voice

Microsoft has welcomed a new addition to its server family: the Speech Server.

Running on Windows Server 2003, the first public beta of Speech Server will ship with Beta 3 of Microsoft’s Speech Application SDK (Software Development Kit) in what signals speech technology’s return to the corporate agenda.

Due for manufacturing release before mid-2004, the product will include a text-to-speech engine from SpeechWorks — Microsoft’s own speech-recognition engine — and a telephony interface manager. The offering will also include middleware that is being designed in partnership with Santa Clara, Calif.-based Intel and Dallas-based Intervoice to connect the Microsoft product to an enterprise telephony infrastructure.

But it is the server’s SALT (Speech Application Language Tags) voice browser that sets Microsoft apart from the standards crowd.

Rather than adhering to VXML (Voice XML) — the current W3C standard for developing speech-based telephony applications — Speech Server is compatible only with applications that use the specifications developed by the SALT Forum, of which Microsoft is a founding member.

The SALT Forum has submitted its specifications to a W3C working group, but they are far from becoming a standard.

“The process could take years,” admitted James Mastan, director of marketing for the speech technologies group at Redmond, Wash.-based Microsoft.

The SALT specification was originally targeted at the multimodal market for browsing the Web on handheld devices. The theory was that users required multiple ways to interface with smaller devices and that voice would be chief among them, but the market for multimodal handhelds has not materialized.

Microsoft executives believe the SALT-based Speech Server is ideally suited to call centers where the cost of using live operators is becoming prohibitive.

An InStat/MDR research report stated that live agents cost $1 to $5 per call as opposed to 20 cents for a speech-recognition system.

“This is not a desktop solution but an enterprise application,” said Elizabeth Herrell, an analyst at Forrester Research in Santa Clara, Calif.

Bill Meisel, a principal at TMA Associates, a leading speech technology research company based in Tarzana, Calif., said enterprise voice adoption will increase due to Microsoft’s market influence. Yet, because Speech Server will compete directly with established VXML applications, Microsoft’s actions will make speech technology adoption a more complex exercise for the enterprise, according to Meisel.

Competing speech technology vendor IBM is a case in point. Big Blue supports VXML and the W3C standard, according to Gene Cox, director of mobile solutions at Armonk, N.Y.-based IBM.

Cox said significant VXML applications already exist in the enterprise at companies such as AT&T, General Motors’ OnStar division, and Sprint PCS.

“VXML conforms to all W3C royalty-free polices. But SALT is like Internet Explorer; it is free as long as you buy Windows,” Cox said.

The debate over which technology to use will not be fought out at the customer level, said Forrester’s Herrell, but rather by developers.

Close

On Twitter now

Platforms

Powered by Twitter

On Twitter now

White Paper

D2D Virtual Tape Library Replication Primer

This whitepaper explains the terminology and concepts behind Data Replication technologies and establishes some sizing rules through worked examples. Learn the new paradigm in disaster tolerance—protect data anywhere.

Download now »

White Paper

An Alternative to Virtualization for Datacenter Cost Savings

Server virtualization is a popular option for dealing with mounting datacenter costs. Another equally promising approach is the use of an Application Delivery Controller. Citrix NetScaler provides a low-cost way for organizations to reduce their server count and accrue cost savings from a reduction in space, cooling, power and personnel.

Download now »

White Paper

Why Your Firewall, VPN, and IEEE 802.11i Aren't Enough to Protect Your Network

The emergence of WLANs has created a new breed of security threats to enterprise networks.

Included in HP ProCurve WLAN solutions is security technology that alleviates threats from WLANs through:
* Monitoring wireless activity inside and out of the enterprise
* Classifying WLAN transmissions into harmful and harmless
* Preventing transmissions that pose a security threat to the enterprise network
* Locating participating devices for physical remediation

Download now »

White Paper

Bringing the Edge to the Data Center

Effectively address data protection challenges, implementing solutions that help store and protect business–critical data while cutting costs and improving efficiency and reliability.

Download now »

Sign up to receive Platforms Resource Alerts

Subscribe to the Today's Headlines: First Look Newsletter

Find out what will be news for the day, with our first-thing-in-the-morning briefing.

©1994-2009 Infoworld, Inc.