Commit Graph

35 Commits

Author SHA1 Message Date
Reuben Morais
97941db3d8 Move .NET bindings to native_client folder 2019-04-11 13:09:45 -03:00
Igor Fritzsch
e8169160b6 Improved Nodejs streaming inference with VAD and FFmpeg 2019-03-18 14:32:33 +01:00
lissyx
481dd7d2c6
Merge pull request #1901 from jorxster/feature/examples_mic_vad_streaming_input_rate
Feature/resample examples mic vad streaming input rate
2019-03-18 11:32:03 +01:00
Carlos Fonseca M
eb94aadb76 Add NET Framework targets 2019-03-14 21:45:36 -06:00
Alexandre Lissy
75149f33df Building on Windows
Fixes #1793
Fixes #1794
2019-03-12 22:21:01 +01:00
Carlos Fonseca M
5ff17dfa03 Remove unused configurations from CSharp projects
Fixes:
https://discourse.mozilla.org/t/compiling-for-windows/32939/65?u=carlfm01
2019-03-04 21:39:53 -06:00
Carlos Fonseca M
a20c17bc48 Change CSharp clients to use disposable interface 2019-03-04 21:21:03 -06:00
Carlos Fonseca M
3a3b8e33d2 Add CSharp interface docs 2019-03-04 21:18:02 -06:00
lissyx
dca8c40ae9
Merge pull request #1900 from dsteinman/nodejs-example
add simple nodejs example
2019-03-04 12:03:19 +01:00
Jordan Olafsen
77b2f8f3ec Address pull request feedback from @lissyx 2019-03-04 23:00:00 +13:00
Dan Steinman
7c3d56b8c0 Add a Nodejs wav file example 2019-03-03 18:58:27 -05:00
Carlos Fonseca M
21fbbc689b Remove unnecessary read key
The readkey is keeping DS process alive.
2019-02-28 13:09:28 -06:00
Reuben Morais
95290530e2 Add NuPKG spec files and folder structure 2019-02-28 11:55:45 -03:00
Carlos Fonseca M
f9f9e4643d Add Windows build doc 2019-02-23 00:01:50 -06:00
Jordan Olafsen
dedf2911da Add resampling to microphone stream if different from processing sample rate. Uses scipy.signal 2019-02-23 13:35:45 +13:00
Jordan Olafsen
56df4ebf03 Add Input Rate to examples/mic_vad_streaming.
Add -r for input device sample rate and -d for device index by PyAudio
2019-02-21 22:20:05 +13:00
Davud Kakaie
0ef26b450f Removed leftover comment 2019-01-30 22:51:16 +03:30
Davud Kakaie
7900ae5272 Process speech result as UTF-8 2019-01-30 22:30:27 +03:30
Reuben Morais
366a82d7e7 Update decoder parameter names in native client 2019-01-15 09:32:31 -02:00
Alexandre Lissy
3dc3651384 Bump VERSION to 0.4.1 2019-01-10 14:41:00 +01:00
Reuben Morais
ff2db35aea Bump version to 0.4.0 2019-01-03 12:21:16 -02:00
Reuben Morais
fa7cb1a983 Update decoder hyperparameters 2018-12-28 16:12:09 -02:00
Carlos Fonseca Murillo
fe2963158d
Merge branch 'master' into master 2018-12-15 17:45:59 +00:00
Carlos Fonseca M
33f5b640e4 Add CSharp client + examples 2018-12-14 14:44:46 -06:00
Mikhail Kuznetcov
f9a70a1b29
Update README.md
add note for MacOS users how to install portaudio
2018-12-08 19:46:03 +01:00
ifr
6f3c48c428
Create README.MD 2018-12-03 13:04:57 +01:00
ifr
52d262c367
Add package.json 2018-12-03 13:03:18 +01:00
ifr
f869862f85
Add index.js 2018-12-03 13:01:52 +01:00
lissyx
fd56082c04
Merge pull request #1718 from daanzu/pr-example
Add example for Python streaming from mic with VAD
2018-11-22 13:43:57 +01:00
Benjamin ABEL
cdd65d6d4b
Example Vad Add note about wav file format 2018-11-22 11:22:12 +01:00
daanzu
74cebb83b6 Add example for Python streaming from mic with VAD 2018-11-22 02:46:37 -05:00
b-ak
851fb4ea90 Adding streaming API Support to the GUI Tool
Changes:
1. Added streaming API support to the GUI tool
2. Minor modifciations to how models are loaded upon repeated transcriptions
3. Updated to Deepspeech v0.3.0
4. Image in the documentation changed

Changes v2:
1. Added streaming support to cmd interface also
2018-11-17 00:00:16 +05:30
kdavis-mozilla
111262a3ec Fixed #1649 (Update README's for 0.3.0) 2018-10-15 16:52:21 +02:00
kdavis-mozilla
c38dc099d3 Fixed #1638 (Update Hyperparameters for 0.3.0) 2018-10-12 17:26:32 +02:00
bhargav-ak
89620bc448 Transcribing longer audio clips
Prerequisites
-------------
~/Deepspeech$ sudo apt install virtualenv
~/Deepspeech$ cd examples/vad_transcriber
~/Deepspeech/examples/vad_transcriber$ virtualenv -p python3 venv
~/Deepspeech/examples/vad_transcriber$ source venv/bin/activate
(venv) ~/Deepspeech/examples/vad_transcriber$ pip3 install -r requirements.txt

Command line tool
-----------------
The command line tool processes a wav file of any duration and returns a trancript
which will the saved in the same directory as the input audio file.

(venv) ~/Deepspeech/examples/vad_transcriber
$ python3 audioTranscript_cmd.py --aggressive 1 --audio ./audio/guido-van-rossum.wav --model ./models/0.2.0/

Minimalistic GUI
----------------
The GUI tool does the same job as the CLI tool. The VAD is fixed at an aggressiveness of 1.
The output is displayed in the transcription window and saved into the directory as the input
audio file as well.

(venv) ~/Deepspeech/examples/vad_transcriber
$ python3 audioTranscript_gui.py

Changes(v1):
1. Using Deepspeech python module instead of subprocess
2. Moved VAD code to a module
3. Moved all files to bin/ and renamed README.md to Audio_Transcription.md

Changes(v2):
Renamed files

Changes (v2.1):
1. Refactoring between CMD and GUI code
2. Documenting pre-requisites with a virtualenv
3. Loading model only once per long wav file
4. CMD and GUI tool do the same job, perform VAD and consolidate the output.
5. Chunks are not saved in the disk. Using a numpy interger array to store them.

Changes (v2.2):
1. Argparse module for command line arguments
2. Everything in virtualenv, with a requirements.txt
3. Older APIs aligned with 0.2.0 release
4. Moved all files into examples/vad_transcriber

Changes (v2.3)
1. Updated requirements.txt
2018-10-03 02:29:26 +05:30