Commit Graph

12 Commits

Author SHA1 Message Date
b-ak
851fb4ea90 Adding streaming API Support to the GUI Tool
Changes:
1. Added streaming API support to the GUI tool
2. Minor modifciations to how models are loaded upon repeated transcriptions
3. Updated to Deepspeech v0.3.0
4. Image in the documentation changed

Changes v2:
1. Added streaming support to cmd interface also
2018-11-17 00:00:16 +05:30
bhargav-ak
89620bc448 Transcribing longer audio clips
Prerequisites
-------------
~/Deepspeech$ sudo apt install virtualenv
~/Deepspeech$ cd examples/vad_transcriber
~/Deepspeech/examples/vad_transcriber$ virtualenv -p python3 venv
~/Deepspeech/examples/vad_transcriber$ source venv/bin/activate
(venv) ~/Deepspeech/examples/vad_transcriber$ pip3 install -r requirements.txt

Command line tool
-----------------
The command line tool processes a wav file of any duration and returns a trancript
which will the saved in the same directory as the input audio file.

(venv) ~/Deepspeech/examples/vad_transcriber
$ python3 audioTranscript_cmd.py --aggressive 1 --audio ./audio/guido-van-rossum.wav --model ./models/0.2.0/

Minimalistic GUI
----------------
The GUI tool does the same job as the CLI tool. The VAD is fixed at an aggressiveness of 1.
The output is displayed in the transcription window and saved into the directory as the input
audio file as well.

(venv) ~/Deepspeech/examples/vad_transcriber
$ python3 audioTranscript_gui.py

Changes(v1):
1. Using Deepspeech python module instead of subprocess
2. Moved VAD code to a module
3. Moved all files to bin/ and renamed README.md to Audio_Transcription.md

Changes(v2):
Renamed files

Changes (v2.1):
1. Refactoring between CMD and GUI code
2. Documenting pre-requisites with a virtualenv
3. Loading model only once per long wav file
4. CMD and GUI tool do the same job, perform VAD and consolidate the output.
5. Chunks are not saved in the disk. Using a numpy interger array to store them.

Changes (v2.2):
1. Argparse module for command line arguments
2. Everything in virtualenv, with a requirements.txt
3. Older APIs aligned with 0.2.0 release
4. Moved all files into examples/vad_transcriber

Changes (v2.3)
1. Updated requirements.txt
2018-10-03 02:29:26 +05:30
Reuben Morais
498b66867e Switch docs theme from alabaster to classic 2017-08-29 21:33:16 +02:00
Graham Voysey
388d04beab updated requirements.txt and moved to project root 2017-03-22 17:06:43 -04:00
Pannous
d399fad9d5 ammended https://github.com/kpu/kenlm/archive/master.zip 2017-02-28 15:51:40 +01:00
Reuben Morais
dad0b53a3b Add missing math contexts in DeepSpeech.rst 2017-02-03 09:16:30 -02:00
Reuben Morais
ee50d82f8c Add requirements.txt so readthedocs.org can import our code 2017-02-02 23:42:36 -02:00
Reuben Morais
af8d3ed675 Add Sphinx configuration files, documentation index and makefiles 2017-02-02 23:42:36 -02:00
Reuben Morais
56eee9adaf Convert documentation to Sphinx RST 2017-02-02 23:42:36 -02:00
Reuben Morais
dc13d4be06 Remove readme2tex infrastructure 2017-02-02 23:42:25 -02:00
Tilman Kamp
fe1abe90cc Math support for markdown documents 2017-01-28 18:23:16 -02:00
Reuben Morais
88f0e3543b Add extended documentation to doc/ directory 2017-01-28 18:22:23 -02:00