DeepSpeech

mirror of https://github.com/mozilla/DeepSpeech.git synced 2025-10-26 11:19:39 +00:00

Author	SHA1	Message	Date
b-ak	851fb4ea90	Adding streaming API Support to the GUI Tool Changes: 1. Added streaming API support to the GUI tool 2. Minor modifciations to how models are loaded upon repeated transcriptions 3. Updated to Deepspeech v0.3.0 4. Image in the documentation changed Changes v2: 1. Added streaming support to cmd interface also	2018-11-17 00:00:16 +05:30
bhargav-ak	89620bc448	Transcribing longer audio clips Prerequisites ------------- ~/Deepspeech$ sudo apt install virtualenv ~/Deepspeech$ cd examples/vad_transcriber ~/Deepspeech/examples/vad_transcriber$ virtualenv -p python3 venv ~/Deepspeech/examples/vad_transcriber$ source venv/bin/activate (venv) ~/Deepspeech/examples/vad_transcriber$ pip3 install -r requirements.txt Command line tool ----------------- The command line tool processes a wav file of any duration and returns a trancript which will the saved in the same directory as the input audio file. (venv) ~/Deepspeech/examples/vad_transcriber $ python3 audioTranscript_cmd.py --aggressive 1 --audio ./audio/guido-van-rossum.wav --model ./models/0.2.0/ Minimalistic GUI ---------------- The GUI tool does the same job as the CLI tool. The VAD is fixed at an aggressiveness of 1. The output is displayed in the transcription window and saved into the directory as the input audio file as well. (venv) ~/Deepspeech/examples/vad_transcriber $ python3 audioTranscript_gui.py Changes(v1): 1. Using Deepspeech python module instead of subprocess 2. Moved VAD code to a module 3. Moved all files to bin/ and renamed README.md to Audio_Transcription.md Changes(v2): Renamed files Changes (v2.1): 1. Refactoring between CMD and GUI code 2. Documenting pre-requisites with a virtualenv 3. Loading model only once per long wav file 4. CMD and GUI tool do the same job, perform VAD and consolidate the output. 5. Chunks are not saved in the disk. Using a numpy interger array to store them. Changes (v2.2): 1. Argparse module for command line arguments 2. Everything in virtualenv, with a requirements.txt 3. Older APIs aligned with 0.2.0 release 4. Moved all files into examples/vad_transcriber Changes (v2.3) 1. Updated requirements.txt	2018-10-03 02:29:26 +05:30
Reuben Morais	498b66867e	Switch docs theme from alabaster to classic	2017-08-29 21:33:16 +02:00
Graham Voysey	388d04beab	updated requirements.txt and moved to project root	2017-03-22 17:06:43 -04:00
Pannous	d399fad9d5	ammended https://github.com/kpu/kenlm/archive/master.zip	2017-02-28 15:51:40 +01:00
Reuben Morais	dad0b53a3b	Add missing math contexts in DeepSpeech.rst	2017-02-03 09:16:30 -02:00
Reuben Morais	ee50d82f8c	Add requirements.txt so readthedocs.org can import our code	2017-02-02 23:42:36 -02:00
Reuben Morais	af8d3ed675	Add Sphinx configuration files, documentation index and makefiles	2017-02-02 23:42:36 -02:00
Reuben Morais	56eee9adaf	Convert documentation to Sphinx RST	2017-02-02 23:42:36 -02:00
Reuben Morais	dc13d4be06	Remove readme2tex infrastructure	2017-02-02 23:42:25 -02:00
Tilman Kamp	fe1abe90cc	Math support for markdown documents	2017-01-28 18:23:16 -02:00
Reuben Morais	88f0e3543b	Add extended documentation to doc/ directory	2017-01-28 18:22:23 -02:00

12 Commits