NAGOYA, Japan, Sept. 25 /PRNewswire-Asia/ -- The Speech Processing Laboratory of the Nagoya Institute of Technology has produced a prototype of "MMDAgent," a software toolkit to build a voice interaction system running on PCs. A voice interaction system built with "MMDAgent" allows a PC user to have a lively chat with a digital 3D character shown on the display screen as if the character were a real person. "MMDAgent" is a software toolkit combining several element technologies such as voice recognition, speech synthesis, 3D character display and voice interaction control. Its features include the following:
- "MMDAgent" represents the high-level integration of self-developed, state-of-the-art speech synthesis and voice recognition technologies. It combines the speech synthesis toolkit HTS (HMM-based Speech Synthesis System), developed over a long period of time and made public by the laboratory, and the voice recognition engine Julius, realizing high-speed, accurate and expressive conversation capabilities.
- The toolkit has advanced 3D character-rendering capabilities based on OpenGL. It permits real 3D rendering by employing toon rendering and shadow mapping, and realistic expressions with the use of a physics engine.
- The toolkit's voice interaction control part allows a user without expert knowledge to describe delicate and rich voice dialogue scenarios, responding to various inner conditions and outer developments, including voice input.
- "MMDAgent" is planned for release as open-source software. As the format of its various models and other data adheres to open-source specifications, users can customize not only 3D character models, motions and voices but also entire dialog scenarios, or use existing models and data.
Examples of application:
- Digital signage
- Entertainment
- Reception desk guidance system
- Promotion
Reference images:
- Outer appearance of the system (the digital character is "Mei-chan" appearing on the screen of the Nagoya Institute of Technology's campus information guide system.
- "Miku Hatsune," a digital character created by Crypton Future Media, Inc., is shown on the execution screen.
(Logo: http://photos.prnewswire.com/prnh/20100924/NY69879LOGO )
(Logo: http://www.newscom.com/cgi-bin/prnh/20100924/NY69879LOGO )
For more information, please contact:
Speech Processing Laboratory
Nagoya Institute of Technology
Gokiso-chou, Shouwa-ku, Nagoya city, Japan
Email: uratec@sp.nitech.ac.jp
Phone: +81-052-735-7549