DeepSpeech

mirror of https://github.com/mozilla/DeepSpeech.git synced 2025-10-26 11:19:39 +00:00

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

deep-learning deepspeech embedded machine-learning neural-networks offline on-device speech-recognition speech-to-text tensorflow

Go to file

Reuben Morais b85ad3ea74 Refactor train.py into separate scripts Currently train.py is overloaded with many independent features. Understanding the code and what will be the result of a training call requires untangling the entire script. It's also an error prone UX. This is a first step at separating independent parts into their own scripts.		2020-12-23 13:06:23 +00:00
.circleci	Move linting job to CircleCI	2020-12-08 13:21:05 +02:00
.github	Add lock bot config	2018-12-28 19:37:01 -02:00
bin	Remote training I/O once more (#3437 )	2020-12-07 13:07:34 +01:00
data	Add small bytes output mode scorer for tests	2020-10-06 18:19:34 +02:00
doc	Bump version to v0.10.0-alpha.0	2020-12-18 23:29:54 +00:00
examples	Revert "Merge branch 'rename-real'"	2020-08-26 11:46:09 +02:00
images	Updating Geometry	2019-12-02 11:04:27 +01:00
kenlm@689a25aae9	Fix #3299 : Build KenLM on CI	2020-09-25 13:25:38 +02:00
native_client	Bump version to v0.10.0-alpha.0	2020-12-18 23:29:54 +00:00
taskcluster	Use payload.env instead of forwarding variables manually	2020-12-18 17:00:00 +00:00
tensorflow@23ad988fcd	Updating commit of submodule	2020-08-25 13:21:12 +02:00
tests	Revert "Merge pull request #3237 from lissyx/rename-training-package"	2020-08-26 11:46:08 +02:00
training/deepspeech_training	Refactor train.py into separate scripts	2020-12-23 13:06:23 +00:00
util	Revert "Merge pull request #3237 from lissyx/rename-training-package"	2020-08-26 11:46:08 +02:00
.cardboardlint.yml	Update cardboardlint configuration	2019-10-04 13:56:41 +02:00
.compute	Fix .compute for packaged training code	2020-04-01 16:24:36 +02:00
.gitattributes	Address review comments and update docs	2020-02-11 19:44:36 +01:00
.gitignore	Enable static build of DeepSpeech iOS framework	2020-12-04 15:19:49 +02:00
.gitmodules	Fix #3299 : Build KenLM on CI	2020-09-25 13:25:38 +02:00
.isort.cfg	Sort importer imports with isort	2020-03-31 13:43:00 +02:00
.pylintrc	Fix linter errors	2020-02-11 19:44:36 +01:00
.readthedocs.yml	Re-enable readthedocs.io	2019-09-24 10:55:26 +02:00
.taskcluster.yml	Separate branch and tag	2020-12-19 09:23:32 +00:00
bazel.patch	Proper re-use of Bazel cache	2018-01-31 18:50:36 +01:00
BIBLIOGRAPHY.md	Branding cleanup	2020-12-07 10:07:39 +02:00
build-python-wheel.yml-DISABLED_ENABLE_ME_TO_REBUILD_DURING_PR	Move to ARMbian Buster	2019-08-21 22:58:10 +02:00
CODE_OF_CONDUCT.md	Add Mozilla Code of Conduct file	2019-03-29 14:58:39 -07:00
CODE_OWNERS.rst	Add paragraph on expected behavior from module owners	2020-12-17 08:59:36 +00:00
CONTRIBUTING.rst	Add listing of code owners/reviewers and reference from contribution guidelines	2020-12-04 15:17:09 +02:00
DeepSpeech.py	Revert "Merge pull request #3237 from lissyx/rename-training-package"	2020-08-26 11:46:08 +02:00
Dockerfile.build.tmpl	Fix #3321 : Update NCCL dep to 2.7 following NVIDIA update	2020-09-15 13:40:17 +02:00
Dockerfile.train.tmpl	Use correct 1.15.4 docker image	2020-09-28 12:43:12 +02:00
ds_generic.supp	Fix #3355 : Add valgrind runs	2020-10-01 15:31:21 +02:00
ds_lib.supp	Fix #3355 : Add valgrind runs	2020-10-01 15:31:21 +02:00
ds_openfst.supp	Fix #3355 : Add valgrind runs	2020-10-01 15:31:21 +02:00
ds_sox.supp	Fix #3355 : Add valgrind runs	2020-10-01 15:31:21 +02:00
evaluate_tflite.py	Revert "Merge branch 'rename-real'"	2020-08-26 11:46:09 +02:00
evaluate.py	Revert "Merge pull request #3237 from lissyx/rename-training-package"	2020-08-26 11:46:08 +02:00
GRAPH_VERSION	Revert "Merge pull request #3237 from lissyx/rename-training-package"	2020-08-26 11:46:08 +02:00
ISSUE_TEMPLATE.md	Revert "Merge pull request #3243 from lissyx/rename-stt-master"	2020-08-26 11:45:06 +02:00
LICENSE	Added LICENSE	2016-09-20 19:12:29 +02:00
lm_optimizer.py	Revert "Merge pull request #3237 from lissyx/rename-training-package"	2020-08-26 11:46:08 +02:00
Makefile	Revert "Merge pull request #3239 from lissyx/rename-circleci"	2020-08-26 11:46:08 +02:00
parse_valgrind_suppressions.sh	Fix #3355 : Add valgrind runs	2020-10-01 15:31:21 +02:00
README.rst	Use HTTPS in README.md	2020-10-27 11:04:32 +01:00
RELEASE.rst	Move from Markdown to reStructuredText	2019-10-04 12:07:32 +02:00
requirements_eval_tflite.txt	Revert "Merge branch 'rename-real'"	2020-08-26 11:46:09 +02:00
requirements_tests.txt	Converting importers from multiprocessing.dummy to multiprocessing	2020-03-18 11:04:36 +01:00
requirements_transcribe.txt	remove bad reference to requirements.txt	2020-05-11 18:27:49 -07:00
setup.py	Branding cleanup	2020-12-07 10:07:39 +02:00
stats.py	Revert "Merge pull request #3237 from lissyx/rename-training-package"	2020-08-26 11:46:08 +02:00
SUPPORT.rst	Revert "Merge pull request #3243 from lissyx/rename-stt-master"	2020-08-26 11:45:06 +02:00
tensorflow_full_runtime.supp	Fix #3355 : Add valgrind runs	2020-10-01 15:31:21 +02:00
tensorflow_tflite_runtime.supp	Fix #3355 : Add valgrind runs	2020-10-01 15:31:21 +02:00
transcribe.py	Revert "Merge pull request #3237 from lissyx/rename-training-package"	2020-08-26 11:46:08 +02:00
VERSION	Revert "Merge pull request #3237 from lissyx/rename-training-package"	2020-08-26 11:46:08 +02:00

README.rst

Project DeepSpeech
==================


.. image:: https://readthedocs.org/projects/deepspeech/badge/?version=latest
   :target: https://deepspeech.readthedocs.io/?badge=latest
   :alt: Documentation


.. image:: https://community-tc.services.mozilla.com/api/github/v1/repository/mozilla/DeepSpeech/master/badge.svg
   :target: https://community-tc.services.mozilla.com/api/github/v1/repository/mozilla/DeepSpeech/master/latest
   :alt: Task Status


DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on `Baidu's Deep Speech research paper <https://arxiv.org/abs/1412.5567>`_. Project DeepSpeech uses Google's `TensorFlow <https://www.tensorflow.org/>`_ to make the implementation easier.

Documentation for installation, usage, and training models are available on `deepspeech.readthedocs.io <https://deepspeech.readthedocs.io/?badge=latest>`_.

For the latest release, including pre-trained models and checkpoints, `see the latest release on GitHub <https://github.com/mozilla/DeepSpeech/releases/latest>`_.

For contribution guidelines, see `CONTRIBUTING.rst <CONTRIBUTING.rst>`_.

For contact and support information, see `SUPPORT.rst <SUPPORT.rst>`_.