mirror of
https://github.com/mozilla/DeepSpeech.git
synced 2025-10-26 11:19:39 +00:00
| .. | ||
| lm | ||
| smoke_test | ||
| ted | ||
| alphabet.txt | ||
| README.rst | ||
Language-Specific Data ====================== This directory contains language-specific data files. Most importantly, you will find here: 1. A list of unique characters for the target language (e.g. English) in `data/alphabet.txt` 2. A scorer package (`data/lm/kenlm.scorer`) generated with `data/lm/generate_package.py`. The scorer package includes a binary n-gram language model generated with `data/lm/generate_lm.py`. For more information on how to build these resources from scratch, see `data/lm/README.md`