DeepSpeech/data
2019-07-18 16:59:40 +05:30
..
lm Update trie files to renenerated versions 2019-06-21 23:24:21 -03:00
smoke_test Update trie files to renenerated versions 2019-06-21 23:24:21 -03:00
ted Merge of pull requests #49, #50, and #52. Fixes issues #2, #4, #11, #12, #46, #47, and #48 2016-10-13 15:15:39 -04:00
alphabet.txt Support custom alphabet mappings (Fixes #692) (#797) 2017-08-31 11:51:15 +02:00
README.md Added reference about generate_trie for clarity 2019-07-18 16:59:40 +05:30

Language-Specific Data

This directory contains language-specific data files. Most importantly, you will find here:

  1. A list of unique characters for the target language (e.g. English) in data/alphabet.txt
  2. A binary n-gram language model compiled by kenlm in data/lm/lm.binary
  3. A trie model compiled by generate_trie in data/lm/trie

For more information on how to build these resources from scratch, see data/lm/README.md