Commit Graph

87 Commits

Author SHA1 Message Date
Reuben Morais
cc2763e0b7 Add small bytes output mode scorer for tests 2020-10-06 18:19:34 +02:00
Alexandre Lissy
bf5ae9cf8a Fix #3299: Build KenLM on CI 2020-09-25 13:25:38 +02:00
Reuben Morais
da55cfae86 Revert "Merge pull request #3237 from lissyx/rename-training-package"
This reverts commit 3dcb3743ac, reversing
changes made to 457198c88d.
2020-08-26 11:46:08 +02:00
Reuben Morais
c62a604876 Revert "Merge pull request #3248 from lissyx/rtd-rename"
This reverts commit ce71910ab4, reversing
changes made to 7c6108a199.
2020-08-26 11:45:06 +02:00
Alexandre Lissy
fffc6ad455 Update name of readthedocs 2020-08-13 22:50:57 +02:00
Alexandre Lissy
6f84bd1996 Rename deepspeech_training package 2020-08-10 16:58:18 +02:00
Reuben Morais
2835151951 Remove external scorer file and documentation and flag references 2020-07-27 21:09:32 +02:00
Reuben Morais
65915c7f57 Address review comments 2020-07-02 14:09:42 +02:00
Reuben Morais
8f6106b35d Update docs to refer to new generate_scorer_package 2020-06-30 16:47:41 +02:00
Reuben Morais
f82c77392d Rewrite data/lm/generate_package.py into native_client/generate_scorer_package.cpp 2020-06-30 09:52:44 +02:00
Alexandre Lissy
6c2cbbd725 Fix #3053: Check output stream when producing scorer 2020-06-16 23:28:01 +02:00
Reuben Morais
99c34df368
Update dangling reference to removed scorer scripts docs
See #3016
2020-05-26 17:35:08 +02:00
Reuben Morais
4356a2764b Add data/lm doc to RTD, and some general doc improvements and fixes 2020-05-24 15:35:10 +02:00
Reuben Morais
a590e3726b Add link to RTD, actually exit on error 2020-05-18 19:04:33 +02:00
Reuben Morais
e8647aa5fa Add missing import in generate_package.py 2020-05-18 18:52:30 +02:00
Reuben Morais
361e216297 Only ignore (expected) missing trie error in generate_package.py 2020-05-18 17:30:49 +02:00
david gauchard
117324e665
Add a --discount_fallback option to generate_lm.py (#2945) 2020-04-28 11:58:41 +02:00
kdavis-mozilla
f3b4943e18 Updated alpha and beta 2020-04-23 07:19:37 +02:00
DanBmh
bfe778482c
Refactor generate_package.py (#2903)
* Improve formatting and paths in LM README

* Improve logging in generate_package.py

Co-authored-by: Daniel <daniel@mail.de>
2020-04-17 21:20:45 +02:00
Reuben Morais
c27387fd98 README tweaks 2020-04-14 13:07:44 +02:00
Daniel
f82a77f249 Update readme. 2020-04-14 13:00:23 +02:00
Daniel
8c73bf6fbf Small fixes. 2020-04-09 16:58:22 +02:00
Daniel
00e4dbe3fd Merge remote-tracking branch 'upstream/master' 2020-04-08 20:27:43 +02:00
Daniel
c29c0beb72 Default to required params. 2020-04-08 20:23:04 +02:00
Daniel
a291e23041 Update readme. 2020-04-04 10:50:28 +02:00
Daniel
e16b72ff28 Use os.join and kenlm parameter usage description. 2020-04-03 17:58:52 +02:00
Daniel
e862cd41db Read from input.txt.gz again. 2020-04-01 17:29:02 +02:00
Daniel
38afe38f0b Implement some change request. 2020-04-01 17:15:52 +02:00
Daniel
b27e0347b1 Add more parameters.
Implement some change request.
2020-04-01 16:54:58 +02:00
Reuben Morais
83d22e591b
Merge pull request #2856 from reuben/training-install
Package training code to avoid sys.path hacks
2020-03-31 15:42:42 +02:00
Reuben Morais
09673581a4 Error early in generate_package.py if no alphabet was specified and not using UTF-8 mode
X-DeepSpeech: NOBUILD
2020-03-30 18:16:56 +02:00
Daniel
f97c79e0e8 Make generate_lm.py language independent. 2020-03-29 12:29:18 +02:00
Reuben Morais
a05baa35c9 Package training code to avoid sys.path hacks 2020-03-25 21:34:50 +01:00
Daniel
f808720b5b Update readme. 2020-03-09 16:34:04 +01:00
Daniel
9c73700ac7 Add error hint and default values for alpha and beta. 2020-03-09 16:26:57 +01:00
Daniel
ef095881ca Fix too many arguments for format string. 2020-03-03 16:58:47 +01:00
Daniel
c6109c30f3 Add some statistics. 2020-03-03 16:49:52 +01:00
Daniel
c9a433486f Add more arguments. Rename file variables. 2020-03-03 16:48:43 +01:00
Daniel
c505a4ec6c Update some comments. 2020-02-27 17:46:16 +01:00
Daniel
15a75c77ff Rewrite generate_lm.py to allow usage with other languages. 2020-02-27 17:18:19 +01:00
JRMeyer
5bba9ea5d1 Transfer-learning support 2020-02-17 08:29:10 +01:00
Reuben Morais
8dedda7759 Address review comments 2020-02-11 19:44:36 +01:00
Reuben Morais
1d3b3a31a1 Address review comments and update docs 2020-02-11 19:44:36 +01:00
Reuben Morais
3b54f54524 Fix linter errors
X-DeepSpeech: NOBUILD
2020-02-11 19:44:36 +01:00
Reuben Morais
1e2eb96248 Update all API consumers 2020-02-11 19:44:36 +01:00
Reuben Morais
a156d28504 Switch smoke test scorer to new format 2020-02-11 19:44:36 +01:00
Reuben Morais
b34723588d Switch to new scorer format 2020-02-11 19:44:29 +01:00
Reuben Morais
16d5632d6f Write default values for alpha and beta into trie header 2020-02-11 19:44:26 +01:00
Reuben Morais
214b50f490 Add generate_package tool to create combined scorer package 2020-02-11 19:44:26 +01:00
Reuben Morais
7c0354483e Stop including vocabulary data in LM.binary. 2020-02-11 19:44:19 +01:00