IBM WebSphere Voice Toolkit
V6.0
Readme
June 2005
(C) Copyright IBM Corporation 2001, 2005. All Rights Reserved.
Note to U.S. Government Users Restricted Rights - Use,
duplication or disclosure restricted by GSA ADP Schedule
Contract with IBM Corp.
Note: Before using this information and the product it supports, read the
general information under Notices
in this document.
IBM(R) WebSphere(R) Voice Toolkit is the new name of the previously released software entitled Voice Toolkit for WebSphere Studio. Also, for users familiar with WebSphere Voice Toolkit for WebSphere Studio software and documentation, the term call flow is now referred to as communication flow.
Contents
This document contains the following topics for the IBM WebSphere Voice Toolkit:
Introduction
This readme provides information on known problems and software limitations for the WebSphere Voice Toolkit.
For installation information, see the IBM WebSphere Voice Toolkit V6.0 Installation Readme, located in the <IBM_Rational_dir>\VoiceToolkit\Readme
directory, where <IBM_Rational_dir> is the full path to the IBM Rational(R)
Software Development Platform. By default, the path to readme files is as follows:
Program Files\IBM\Rational\SDP\6.0\VoiceToolkit\Readme
Getting Started
To launch the WebSphere Voice Toolkit, click Start > Programs > IBM Rational and select your base Rational product.
For example, if you have Rational Application Developer (RAD) installed, click IBM Rational Application Developer V6.0 > Rational Application Developer.
For information on using the toolkit, refer to the online Help.
From the Help menu, click Help Contents.
When the Help window appears, select Developing Voice applications from the list.
For additional information about the samples included with the toolkit, see
the sample readme files that are located in each of the sample directories
in <IBM_Rational_dir>\VoiceToolkit\Samples.
- Developing Voice applications
In addition to the toolkit help topics, this package contains related
publications in PDF format. These documents are available from the Help > Help Contents pane. For example, click Developing Voice applications and expand the Related documents folder to open the Developing Java Applications publication.
To view or print PDF files, use Adobe Reader, Version 4.0.5 or later. For
optimum viewing, use Adobe Reader, Version 6.0, available at http://www.adobe.com.
Click Get Adobe Reader and follow directions.
What's new
The following features and functions are new or updated in this release:
- IBM Rational Software Development Platform base support
- IBM WebSphere Voice Toolkit is now based on the IBM Rational Software Development Platform, which is installed with IBM Rational Application Developer (RAD) 6.0, IBM Rational Web Developer (RWD) 6.0, or IBM Rational Software Architect (RSA) 6.0.
The Rational product you choose is dependent on the application being developed. For example, if you develop applications for WebSphere Voice Server, the minimum product requirement is RWD. However, if you develop Voice Portlets using the WebSphere Voice Application Access Toolkit, the minimum requirement is RAD and its Portal Tools feature.
A free trial of Rational Application Developer V6.0 is available for download at:
http://www14.software.ibm.com/webapp/download/searchquery.jsp
- VoiceXML 2.1 support
- This allows you to create and test VXML 2.1-based applications.
- Multi-paged SRGS-XML Grammar Editor
- Includes a new graphical environment for creating command-and-control grammars and their associated rules.
- Communication Flow Builder
- Includes the following enhancements:
- A new, improved user interface.
- An import feature, which supports the import of VoiceXML files created by Call Flow Builder V5.0 or higher.
This allows you to recreate the graphic for a communication flow from a previously-generated XML file.
- Navigation of xml, vxml, or cfb files directly from the canvas.
- Added support for the use of Reusable Dialog Components (RDCs) in the application flow, as well as support for RDC development.
- A Prompt Configuration window, which allows you to create prompts based on a list of shorter, more manageable components.
- Test Grammar on MRCP
- Allows you to compare audio files (what is said) versus what is understood and provides a confidence score of what was recognized. This tool supports WebSphere Voice Server V5.1.
- Voice Trace Analyzer
- Displays information and statistics on the grammar recognition events that have been logged on a WebSphere Voice Server V5.1 server for analysis of recognition accuracy and tuning of the grammars and server settings.
- Prompt Manager
- Organizes the audio files in your project for easier generation of prompts, as well as exporting and importing of professionally recorded audio.
- SIP-based VoiceXML Simulator
- A VoiceXML interpreter and debugger that has SIP connectivity from a virtual "phone" in the toolkit, and uses MRCP to interface with the speech engines just like a real IVR.
Known problems and software limitations
General
- If after installation of the toolkit some functions do not appear or work as expected, please close your IBM Rational Software Development Platform and restart it using the
–clean
command line parameter. You can add this parameter by viewing the properties of the startup file and just adding –clean
to the executable (example: rationalsdp.exe –clean
). This increases the startup time slightly and needs to be done only once. After you have restarted you can remove the –clean
parameter so that it is not executed again.
NOTE: If you use this parameter more than once, the only thing that you notice is a longer start-up time for your Rational Software Development Platform.
- If you cannot start the IBM Voice Server, it might be related to a lengthy installation path. One way to verify this is to open a Command Prompt and go to the WAS test environment directory, such as <IBM_Rational_dir>\runtimes\base_v51\bin and issue this command:
startIBMVoiceServer.bat
If you get an error message saying the "Input line is too long" then it might be due to the
lengthy
path.
Workaround: It is recommended that the Rational Software Development Platform product be installed to a directory such as C:/rad6 or similar.
- If your program stops responding, close the Rational Software Development Platform and any javaw.exe processes. If this fails to resolve the problem, restart your computer.
- In some configurations, one or more javaw.exe processes might continue to run after you exit the toolkit, particularly after an abnormal termination of the program.
Workaround: Use the Windows Task Manager to locate and end these processes.
- The projects and files that you create using the
WebSphere Voice Toolkit are saved in the workspace used by the Rational Software Development Platform. If you uninstall the toolkit, the workspace remains on
your system so that the files are still available.
- Your microphone can pick up output from your computer speakers and
interpret it as words in a grammar. To avoid this, turn down the volume
on your speakers or use a headset.
- If you deploy your grammars and/or lexicon files to a web server for testing, you might find that updates to those files are not being loaded by the Voice Server test environment in the VoiceXML Simulator or the Test Grammar on MRCP tool. This occurs because the server honors the caching policies of the web server to avoid having to retrieve the same document repeatedly. To circumvent this, you should set the caching policies for the web server such that the files are not cached, or assign an immediate expiration in the caching properties of your application documents when deploying them to your web server.
- If you experience JavaScript related errors, it might be due to the incorrect version of the Rhino JavaScript Interpreter. The Rational Software Development Platform ships with the Rhino JavaScript Interpreter v1.5 release 2, however this toolkit package has been optimized to work with Rhino JavaScript Interpreter v1.5 release 3. It is recommended that this version, Rhino JavaScript Interpreter v1.5 release 3, be downloaded from http://www.mozilla.org/rhino/download.html using the download link rhino15R3.zip.
Important: Do not install any other version than v1.5 release 3, as this will cause error conditions with this product.
To install the Rhino JavaScript Intrepeter download the rhino15R3.zip, extract the file js.jar, and copy to the
following directories:
<IBM_Rational_dir>\eclipse\plugins\com.ibm.voicetools.es_6.0.0
<IBM_Rational_dir>\runtimes\base_v51\InstalledApps\localhost\WVX5.1-localhost.ear
- Files saved in the following directories are deleted when uninstalling the WebSphere Voice Toolkit.
If you create files in these directories, copy them to another folder before uninstalling the toolkit:
- <IBM_Rational_dir>\eclipse\features\com.ibm.wpstools_6.0.0
- <IBM_Rational_dir>\eclipse\logs
- <IBM_Rational_dir>\VoiceToolkit
- When you change the Workbench or Workbench > Appearance
preferences, the Restore Defaults button might not restore
the original defaults. Close and
restart the Rational Software Development Platform for changes to take effect.
- If you receive an "Internal Error - Widget Disposed" error
message, close and restart the Rational Software Development Platform to recover from the
error.
- Use only alphanumeric characters and spaces for
directory, folder, and file names. Names should not include
non-alphanumeric characters. For example:
~!@#$%^&*)
- To import or use files from a network resource, map a drive letter to that network
resource and import from the network drive.
- You might experience a problem running VoiceXML applications when
using personal firewall software on your computer. If you encounter an error
message, such as "Host manager did not initialize properly," disable your
firewall and run your VoiceXML applications again.
Language-specific considerations
- WebSphere Voice Toolkit 6.0 does not support Lexicons for AP Languages. This
is because a required feature, part-of-speech, is not supported by the
WebSphere Voice Server 5.1 MRCP server.
- If you choose to install non-IBM Text-to-Speech (TTS) languages for
use on your voice application server, such as WebSphere Voice Server,
you should consult the vendor's documentation for details on the vendor's
implementation of VoiceXML 2.1 and Speech Synthesis Markup Language
(SSML). In addition, if you use another vendor's languages, you should
also use that vendor's tools for generating pronunciations. If you use
IBM tools to produce SSML for TTS output, the resulting code might not
run as expected in a deployment environment for that language.
- The VoiceXML Programmer's Guide (pgmguide.pdf), packaged with
the toolkit contains incorrect information about the alphabets supported
in the phoneme tag for SSML. To view the guide, from the Help menu,
click Help Contents and select Developing Voice applications
. In the
Contents list, select Related documents. For updated information,
see the IBM Text-to-Speech SSML Programming Guide (tts_ssml.pdf),
also located under Related documents.
- Do not create a project name using DBCS characters. If you do, you
receive validation errors and XML files do not validate.
- When using the Unknown Pronunciation view, Cantonese words are flagged
as "unknown." For correct Cantonese functionality of the tools, your
Windows regional setting should be "Chinese-Taiwan", rather than "Chinese-HongKong."
If you change the voice language in Voice Tools preferences, close the Rational Software Development Platform for the changes to take effect.
Editors
- In the Navigator view, right click on the project, and on the pop-up
menu, click Properties.
- On the Properties dialog, select Validation. Also select the
Override validation preferences check box, and clear the SRGS-XML
Form Validator check box.
- Click Apply, and then click OK.
Note: This turns off the SRGS-XML editor validation, but the compiler 's
XML validation still occurs.
Multi-paged SRGS-XML Grammar Editor
Grammar Conversions
Communication Flow Builder
- Communication Flow Builder files created with previous versions of this application do not display correctly
because the appearance of the CFB has been updated. To view older communication flow builder files, you must import the
files into a new communication flow file.
To do so, follow these steps:
- Ensure that you have a copy of the .xml file for the old .cfb file.
- Create a new .cfb file and remove the objects in the canvas.
- Right mouse click and select Import....
- Import either the .cfb or .xml file.
- After the file is imported, reposition the communication flow in the canvas.
-
When using CTTS to generate a multi-sentence audio file, the audio file is created, but only the first sentence is used.Workaround: If you require a multi-sentence audio file, divide the prompt into multiple audio files—one for each sentence.
Reusable Dialog Component (RDC) release notes:
- If you receive a message stating that Windows cannot open the JSP file, click Window > Preferences > Workbench > Capabilities and clear the Web Developer (typical) check box. Next, generate the JSP file again.
- If you use the group.rdg file, do not change the strategy and submit default values.
- This Preview does not support a group within a group.
- By default, the order of the RDCs in a group is determined by the ID of the RDC objects. To order objects differently, you must drag in a Start object and then use Connect objects to achieve the desired order.
- In a Group Communication Flow Model, you can drag in only RDC files, not VoiceXML palette objects.
- When you launch the Tomcat server from the Workbench, an HTTP Status 500 error is displayed in the Web browser.
Workaround:
- Install J2SDK1.4.2, which is specified in the RUNNING.txt file in the Tomcat installation directory.
- Copy the tools.jar file from <J2SDK_install_dir>/lib to the <jakarta_tomcat_installation_dir>\common\lib.
- Restart the Tomcat server in the Workbench.
Grammar Test Tool
VoiceXML testing and debugging
- If the VoiceXML browser loads a document that specifies an
HTTP URI for its DTD, and the DTD cannot be fetched, (for example,
if you are disconnected from the network
or Internet), the VoiceXML application you are running or
debugging ends.
- When using the Run menu, if the Run As submenu
is missing use the Run toolbar icon to open the drop-down list.
- When debugging a VoiceXML application, double-clicking in the ruler
bar does not add a breakpoint. To add a breakpoint, right-click
in the ruler bar and select Add Breakpoint.
- SSML tags in your VoiceXML prompts should be separated from the surrounding text by white space, or the text may not be read correctly by the Text-to-Speech engine. For example, "this is<break/>a test" should be written as "this is <break/> a test."
WebSphere Voice Toolkit documentation
- If you have Adobe Acrobat(R) 6.0 installed on your system
and you open a PDF file in the toolkit's Help window, the Help
window might stop responding when you close the PDF file or
change to another topic. If this occurs, use the Task Manager
to end the Help application, and then reopen the Help window.
Notices
The following paragraph does not apply to the
United Kingdom or any country where such
provisions are inconsistent with local law:
INTERNATIONAL BUSINESS MACHINES CORPORATION
PROVIDES THIS PUBLICATION "AS IS" WITHOUT WARRANTY
OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING,
BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF
NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR
A PARTICULAR PURPOSE. Some states do not allow
disclaimer of express or implied warranties in
certain transactions, therefore, this statement
may not apply to you.
This information could include technical
inaccuracies or typographical errors. Changes are
periodically made to the information herein; these
changes will be incorporated in new editions of
the publication. IBM may make improvements and\or
changes in the product(s) and\or the program(s)
described in this publication at any time without
notice.
Any references in this information to non-IBM Web
sites are provided for convenience only and do not
in any manner serve as an endorsement of those Web
sites. The materials at those Web sites are not
part of the materials for this IBM product and
use of those Web sites is at your own risk.
Information concerning non-IBM products was
obtained from the suppliers of those products,
their published announcements or other publicly
available sources. IBM has not tested those
products and cannot confirm the accuracy of
performance, compatibility or any other claims
related to non-IBM products. Questions on the
capabilities of non-IBM products should be
addressed to the suppliers of those products.
Trademarks
The following terms are trademarks or registered
trademarks of the International Business Machines
Corporation in the United States, other countries,
or both:
IBM
Rational
WebSphere
Intel and Pentium are trademarks or registered
trademarks of Intel Corporation in the United
States, other countries, or both.
Microsoft and Windows are trademarks of Microsoft
Corporation in the United States, other countries,
or both.
Java and all Java-based trademarks and logos are
trademarks or registered trademarks of Sun
Microsystems, Inc. in the United States, other
countries, or both.
Other company, product, and service names may be
trademarks or service marks of others.