Город МОСКОВСКИЙ
00:12:23

VOSK Offline Speech Recognition, Speech To Text for Linux Android iOS Mac OSX Windows

Аватар
JS Вдохновляющий код
Просмотры:
43
Дата загрузки:
29.11.2023 11:40
Длительность:
00:12:23
Категория:
Технологии и интернет

Описание

https://alphacephei.com/vosk/
https://alphacephei.com/vosk/models
https://medium.com/analytics-vidhya/offline-speech-recognition-made-easy-with-vosk-c61f7b720215
https://github.com/alphacep/vosk-api
https://kdenlive.org/en/2021/04/kdenlive-21-04-released/

Vosk is a speech recognition toolkit. The best things in Vosk are:
Supports 17 languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino. More to come.
Works offline, even on lightweight devices - Raspberry Pi, Android, iOS
Installs with simple pip3 install vosk
Portable per-language models are only 50Mb each, but there are much bigger server models available.
Provides streaming API for the best user experience (unlike popular speech-recognition python packages)
There are bindings for different programming languages, too - java/csharp/javascript etc.
Allows quick reconfiguration of vocabulary for best accuracy.
Supports speaker identification beside simple speech recognition.

Рекомендуемые видео