Welcome | Sign In
CRMBuyer.com
Applications

IBM Makes Speech Code Open Source

Print Version
E-Mail Article
Reprints
IBM Makes Speech Code Open Source

IBM's motive is not altruistic. By making the software open-source, IBM hopes that other programmers will make improvements, thus speeding the development of speech applications and allowing IBM to compete with Microsoft in this growing market. In March Microsoft introduced Microsoft Speech Server 2004 for speech-enabled applications.


Considering CRM solutions?
You first need to understand CRM best practices. Before committing to a CRM purchase and implementation, it's good to know the experience of those who have already "been there, done that." It can save time and prevent costly missteps. Download Free Research.

IBM (NYSE: IBM) today announced it is contributing proprietary voice-recognition software to two different open-source software groups.

The Apache Software Foundation will receive Reusable Dialog Components (RDCs), used for handling simple words for dates, times and locations. In addition, IBM is proposing a project at the Eclipse Foundation to donate markup editors for speech standards established by the W3C, the company said in a statement.

Voice-recognition technology is used in call centers, for hand-free mobile phone use in cars, and elsewhere.

Competing with Microsoft

According to the New York Times, the software IBM is donating cost the company about $10 million to develop.

IBM's motive is not altruistic. By making the software open source, IBM hopes that other programmers will make improvements, thus speeding the development of speech applications and allowing IBM to compete with Microsoft (Nasdaq: MSFT) in this growing market.

In March Microsoft introduced Microsoft Speech Server 2004 for speech-enabled applications. According to Microsoft, more than 100,000 programmers have downloaded Microsoft's free developers' kit for building speech applications on its Windows .Net technology.

Driving Innovation

"This is the latest step in IBM's contribution to open source and to speech technology," said Gary Cohen of IBM Pervasive Computing in a statement.

"By giving more standards-based speech resources to the development community, IBM hopes to accelerate development and drive innovation in all areas of the speech ecosystem -- from speech vendors, to ISVs, to platform providers," Cohen added.

IBM says the move is aimed at ending the battles over competing, proprietary specifications. It will spur the availability of speech-enabled applications by making it easier for developers to build and add speech recognition capability in a standardized way, the company said in a statement.

Supporters of the IBM open-source initiative include Apptera, AT&T (NYSE: T), Audium, Avaya, Cisco (Nasdaq: CSCO), Fluency, Genesys, Kirusa, Loquendo, Motorola (NYSE: MOT), Nortel (NYSE: NT), Nuance, Openstream, ScanSoft (Nasdaq: SSFT), Siebel, Syntellect, Telisma, TuVox, V-Enable, Viecore, Vocomo, VoiceGenie, Voice Partners and VoxGeneration.

Apache and RDCs

RDCs are pre-built speech software components that handle basic functions such as date, time, currency and locations (major cities, states, zip codes). For example, they allow a caller to book a flight over the phone.

Developed by IBM Research, RDCs are Java Server Page tags that enable dynamic development of voice applications and multimodal user interfaces, according to an IBM statement.

IBM says that it hopes to ensure that speech components built using it will work together, regardless of the vendor that created them. Both the framework and a set of example tags are to be contributed to the Apache Software Foundation.

Eclipse and Speech Markup Editors

IBM's contribution of speech markup editors to Eclipse is aimed at making it easier for developers to write standards-based speech applications as well as create and utilize RDCs within those applications, the company said in a statement.

Still in proposal stage, this contribution gives speech developers a standard way of writing VoiceXML applications. It will also give Web developers tools to more easily add speech access to their Web applications.

This comprises the initial formation of a project at Eclipse for open source tools for voice application development, which will be further developed by several companies in the VoiceXML community.


Print Version E-Mail Article Reprints More by Mark Essig


More by Mark Essig

Point and Snoot: Introducing the Nose Mouse
September 16, 2004
Using the Nouse in three-dimensional environments, such as design and game software, requires the use of two broadly spaced Web cams. Both cameras locate the nose tip, and the software uses the two images to calculate the user's head movements in three dimensions. To demonstrate the software's potential, the Canadian lab has worked out a few applications, including multiple-user NousePong.
Can Microsoft Slay Apple's Music Goliath?
September 02, 2004
Content owners may be gravitating toward the Microsoft format because of Apple's refusal to license its software. "As it was with Windows, if iTunes is eclipsed it won't be because of Microsoft's player or service," according to analyst Rob Enderle. "It will be because of their partners and the leverage that these partners bring to the table."
Fed Bullish on Economy, Investors Have Doubts
August 11, 2004
The new report was consistent with Alan Greenspan's forecast last month that the current "soft patch" in the economy would be prelude to quicker growth. The accuracy of that prediction will depend heavily on oil prices, which have increased more than 30 percent since the start of the year and now stand above US$44. The Fed's outlook relies on expectations that oil prices will moderate.
Don't miss a story -- sign up for our FREE e-mail newsletters and view the latest headlines at a glance.
Tech News Flash [ View Sample ]
E-Commerce Minute [ View Sample ]
ECT News Network Weekly Newsletter [ View Sample ]
Shortcuts
ECT News Network Information
Reader Services
Corporate
ECT News Network