Bytefreaks.net – a place for hacks

Ubuntu 18.04 and VOSK Speech Recognition API

17 June 2020 in GNU/Linux tagged ubuntu / VOSK by Tux

Just some quick notes on how to install and use VOSK on Ubuntu 18.04 LTS.

Install using: pip3 install vosk
Get samples from https://github.com/alphacep/vosk-api/tree/master/python/example
Using ffmpeg create input audio files ffmpeg -i video.mkv -c:a pcm_s16le -ac 1 output.wav
It will create a file of the following type output.wav: RIFF (little-endian) data, WAVE audio, Microsoft PCM, 16 bit, mono 48000 Hz
Export text to JSON python3 ./test_simple.py output.wav

This post is also available in: Greek

Leave a ReplyCancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.