Ubuntu 18.04 and VOSK Speech Recognition API
Just some quick notes on how to install and use VOSK on Ubuntu 18.04 LTS.
- Install using:
pip3 install vosk
- Get samples from https://github.com/alphacep/vosk-api/tree/master/python/example
- Using
ffmpeg
create input audio filesffmpeg -i video.mkv -c:a pcm_s16le -ac 1 output.wav
It will create a file of the following typeoutput.wav: RIFF (little-endian) data, WAVE audio, Microsoft PCM, 16 bit, mono 48000 Hz
- Export text to JSON
python3 ./test_simple.py output.wav