Speech Recognition

From RobotinoWiki
Revision as of 17:52, 8 February 2011 by Verbeek (talk | contribs) (Setting up Speech Recognition)

Speech Recognition on Robotino

Robotino can now comprehend basic human speech commands. Robotino uses the open source speech recognition engine called Julius.

Setting up Speech Recognition

Setting up speech recognition is done in two parts. The first part involves training of the speech data and creation of the acoustic model and the second involves the actual execution of the speech recognition engine with the created acoustic model.

Creating an Acoustic Model

The HTK toolkit (version 3.4) shall be used to create an acoustic model. Please follow the following steps.

Step 1

You will need to register with HTK before you can download it. Please do so.

Step 2

Download the sources for HTK toolkit 3.4 from here.

Step 3

  • Move to your home directory
 cd ~ 
  • Create a directory called bin
 mkdir bin 
  • Unpack the downloaded HTK toolkit sources in a folder called htk-3.4 in the bin directory. The bin directory should look as follows
 
$ ls bin 
htk-3.4 
  • If you have a newer version of the gcc compiler (version 4 or above), you will need to install gcc version 3.4 so that HTK will compile properly. Use the following gcc's version command to see which version is installed on your system
gcc -v
  • If your gcc version is 4 and above, follow the listed commands to install gcc 3.4
 
sudo apt-get install gcc-3.4 
sudo rm /usr/bin/gcc
sudo ln -s /usr/bin/gcc-3.4 /usr/bin/gcc