|
Article Excerpt One thing weve learned from Web-based application development is that tools are useful only if they can reuse components and third-party libraries and make it easy to assemble applications. This article reviews how we can build modular speech applications using VoiceXML. The focus will be on the language constructs that VoiceXML provides for modularization and reusability and on vendor-specific approaches toward creation of a library of reusable dialogs for speech applications.
As a language, VoiceXML is designed for reusability and modularity. Similar to the Web paradigm, VoiceXML supports modularization of the application components through document-to-document navigation (through menus, form submits, subdialogs etc.). The "src" attribute associated with a number of VoiceXML elements allows URL-based loosely coupled integration.
Since we're used to a hyperlinked world today, we take this for granted, but if we compare it with speech applications developed earlier, it's really like comparing a huge compiled application with a loosely coupled yet integrated application. We're used to separating Web application functionality into multiple sets of scripts (Perl scripts, PHP pages, ASP.NET pages, JavaServer Pages), images, and other media. This capability has been leveraged by VoiceXML as the language supports division of applications into VoiceXML documents (which can be dynamically generated through server-side programming), grammars, prompts, and subdialogs.
However, we'd be kidding ourselves if we didn't acknowledge that development of rich, high-quality speech applications is complex. VoiceXML does make it really simple to assemble a speech application, but we still need to create complex grammars and dialogs that can handle the complex conversations that the application needs to support.
This is where we can utilize the skills available within the speech recognition industry in the form of dialog and grammar experts. And this is where reusable components come to the rescue. Reusable components represent best practices and frequently used dialogs/ grammars that can be used by an application developer (or, rather, an assembler) to create a high-quality speech application.
For instance, if we were to develop a stock-trading application, we'd need the ability to recognize the various companies traded on the stock exchange. If application developers were to include this capability as part of...
|
|

More articles from XML Journal
jAllora by Hit Software.(Evaluation), May 01, 2002 Ipedo releases Database 3.0. (XML News).(Brief Article)(Product Announ..., May 01, 2002 New software module for real-time business visibility. (XML News).(Per..., May 01, 2002 OASIS Committee to define XML standard for biometrics. (XML News).(OAS..., May 01, 2002 IBM offers another first: production printer support for XML. (XML New..., May 01, 2002
Looking for additional articles?
Search our database of over 3 million articles.
Looking for more in-depth information on this industry?
Search our complete database of Industry & Market reports by text, subject, publication
name or publication date.
About Goliath
Whether you're looking for sales prospects, competitive information, company
analysis or best practices in managing your organization,
Goliath can help you meet your business needs.
Our extensive business information databases empower business
professionals with both the breadth and depth of credible,
authoritative information they need to support their business
goals. Whether it be strategic planning, sales prospecting,
company research or defining management best practices -
Goliath is your leading source for accurate information.
|
|