DeepSpeech/data
2019-11-15 13:28:45 +01:00
..
lm Switch to --prune 0 0 1 model and move generation code to a script 2019-11-15 13:28:45 +01:00
smoke_test UTF-8 target 2019-11-11 11:36:16 +01:00
ted Merge of pull requests #49, #50, and #52. Fixes issues #2, #4, #11, #12, #46, #47, and #48 2016-10-13 15:15:39 -04:00
alphabet.txt Support custom alphabet mappings (Fixes #692) (#797) 2017-08-31 11:51:15 +02:00
README.rst Move from Markdown to reStructuredText 2019-10-04 12:07:32 +02:00

Language-Specific Data
======================

This directory contains language-specific data files. Most importantly, you will find here:

1. A list of unique characters for the target language (e.g. English) in `data/alphabet.txt`

2. A binary n-gram language model compiled by `kenlm` in `data/lm/lm.binary`

3. A trie model compiled by `generate_trie <https://github.com/mozilla/DeepSpeech#using-the-command-line-client>`_ in `data/lm/trie`

For more information on how to build these resources from scratch, see `data/lm/README.md`