DeepSpeech

mirror of https://github.com/mozilla/DeepSpeech.git synced 2025-10-26 11:19:39 +00:00

Author	SHA1	Message	Date
Reuben Morais	97941db3d8	Move .NET bindings to native_client folder	2019-04-11 13:09:45 -03:00
Igor Fritzsch	e8169160b6	Improved Nodejs streaming inference with VAD and FFmpeg	2019-03-18 14:32:33 +01:00
lissyx	481dd7d2c6	Merge pull request #1901 from jorxster/feature/examples_mic_vad_streaming_input_rate Feature/resample examples mic vad streaming input rate	2019-03-18 11:32:03 +01:00
Carlos Fonseca M	eb94aadb76	Add NET Framework targets	2019-03-14 21:45:36 -06:00
Alexandre Lissy	75149f33df	Building on Windows Fixes #1793 Fixes #1794	2019-03-12 22:21:01 +01:00
Carlos Fonseca M	5ff17dfa03	Remove unused configurations from CSharp projects Fixes: https://discourse.mozilla.org/t/compiling-for-windows/32939/65?u=carlfm01	2019-03-04 21:39:53 -06:00
Carlos Fonseca M	a20c17bc48	Change CSharp clients to use disposable interface	2019-03-04 21:21:03 -06:00
Carlos Fonseca M	3a3b8e33d2	Add CSharp interface docs	2019-03-04 21:18:02 -06:00
lissyx	dca8c40ae9	Merge pull request #1900 from dsteinman/nodejs-example add simple nodejs example	2019-03-04 12:03:19 +01:00
Jordan Olafsen	77b2f8f3ec	Address pull request feedback from @lissyx	2019-03-04 23:00:00 +13:00
Dan Steinman	7c3d56b8c0	Add a Nodejs wav file example	2019-03-03 18:58:27 -05:00
Carlos Fonseca M	21fbbc689b	Remove unnecessary read key The readkey is keeping DS process alive.	2019-02-28 13:09:28 -06:00
Reuben Morais	95290530e2	Add NuPKG spec files and folder structure	2019-02-28 11:55:45 -03:00
Carlos Fonseca M	f9f9e4643d	Add Windows build doc	2019-02-23 00:01:50 -06:00
Jordan Olafsen	dedf2911da	Add resampling to microphone stream if different from processing sample rate. Uses scipy.signal	2019-02-23 13:35:45 +13:00
Jordan Olafsen	56df4ebf03	Add Input Rate to examples/mic_vad_streaming. Add -r for input device sample rate and -d for device index by PyAudio	2019-02-21 22:20:05 +13:00
Davud Kakaie	0ef26b450f	Removed leftover comment	2019-01-30 22:51:16 +03:30
Davud Kakaie	7900ae5272	Process speech result as UTF-8	2019-01-30 22:30:27 +03:30
Reuben Morais	366a82d7e7	Update decoder parameter names in native client	2019-01-15 09:32:31 -02:00
Alexandre Lissy	3dc3651384	Bump VERSION to 0.4.1	2019-01-10 14:41:00 +01:00
Reuben Morais	ff2db35aea	Bump version to 0.4.0	2019-01-03 12:21:16 -02:00
Reuben Morais	fa7cb1a983	Update decoder hyperparameters	2018-12-28 16:12:09 -02:00
Carlos Fonseca Murillo	fe2963158d	Merge branch 'master' into master	2018-12-15 17:45:59 +00:00
Carlos Fonseca M	33f5b640e4	Add CSharp client + examples	2018-12-14 14:44:46 -06:00
Mikhail Kuznetcov	f9a70a1b29	Update README.md add note for MacOS users how to install portaudio	2018-12-08 19:46:03 +01:00
ifr	6f3c48c428	Create README.MD	2018-12-03 13:04:57 +01:00
ifr	52d262c367	Add package.json	2018-12-03 13:03:18 +01:00
ifr	f869862f85	Add index.js	2018-12-03 13:01:52 +01:00
lissyx	fd56082c04	Merge pull request #1718 from daanzu/pr-example Add example for Python streaming from mic with VAD	2018-11-22 13:43:57 +01:00
Benjamin ABEL	cdd65d6d4b	Example Vad Add note about wav file format	2018-11-22 11:22:12 +01:00
daanzu	74cebb83b6	Add example for Python streaming from mic with VAD	2018-11-22 02:46:37 -05:00
b-ak	851fb4ea90	Adding streaming API Support to the GUI Tool Changes: 1. Added streaming API support to the GUI tool 2. Minor modifciations to how models are loaded upon repeated transcriptions 3. Updated to Deepspeech v0.3.0 4. Image in the documentation changed Changes v2: 1. Added streaming support to cmd interface also	2018-11-17 00:00:16 +05:30
kdavis-mozilla	111262a3ec	Fixed #1649 (Update README's for 0.3.0)	2018-10-15 16:52:21 +02:00
kdavis-mozilla	c38dc099d3	Fixed #1638 (Update Hyperparameters for 0.3.0)	2018-10-12 17:26:32 +02:00
bhargav-ak	89620bc448	Transcribing longer audio clips Prerequisites ------------- ~/Deepspeech$ sudo apt install virtualenv ~/Deepspeech$ cd examples/vad_transcriber ~/Deepspeech/examples/vad_transcriber$ virtualenv -p python3 venv ~/Deepspeech/examples/vad_transcriber$ source venv/bin/activate (venv) ~/Deepspeech/examples/vad_transcriber$ pip3 install -r requirements.txt Command line tool ----------------- The command line tool processes a wav file of any duration and returns a trancript which will the saved in the same directory as the input audio file. (venv) ~/Deepspeech/examples/vad_transcriber $ python3 audioTranscript_cmd.py --aggressive 1 --audio ./audio/guido-van-rossum.wav --model ./models/0.2.0/ Minimalistic GUI ---------------- The GUI tool does the same job as the CLI tool. The VAD is fixed at an aggressiveness of 1. The output is displayed in the transcription window and saved into the directory as the input audio file as well. (venv) ~/Deepspeech/examples/vad_transcriber $ python3 audioTranscript_gui.py Changes(v1): 1. Using Deepspeech python module instead of subprocess 2. Moved VAD code to a module 3. Moved all files to bin/ and renamed README.md to Audio_Transcription.md Changes(v2): Renamed files Changes (v2.1): 1. Refactoring between CMD and GUI code 2. Documenting pre-requisites with a virtualenv 3. Loading model only once per long wav file 4. CMD and GUI tool do the same job, perform VAD and consolidate the output. 5. Chunks are not saved in the disk. Using a numpy interger array to store them. Changes (v2.2): 1. Argparse module for command line arguments 2. Everything in virtualenv, with a requirements.txt 3. Older APIs aligned with 0.2.0 release 4. Moved all files into examples/vad_transcriber Changes (v2.3) 1. Updated requirements.txt	2018-10-03 02:29:26 +05:30

35 Commits