DeepSpeech/data
Reuben Morais 83d22e591b
Merge pull request #2856 from reuben/training-install
Package training code to avoid sys.path hacks
2020-03-31 15:42:42 +02:00
..
lm Merge pull request #2856 from reuben/training-install 2020-03-31 15:42:42 +02:00
smoke_test Transfer-learning support 2020-02-17 08:29:10 +01:00
ted Merge of pull requests #49, #50, and #52. Fixes issues #2, #4, #11, #12, #46, #47, and #48 2016-10-13 15:15:39 -04:00
alphabet.txt Support custom alphabet mappings (Fixes #692) (#797) 2017-08-31 11:51:15 +02:00
README.rst Address review comments 2020-02-11 19:44:36 +01:00

Language-Specific Data
======================

This directory contains language-specific data files. Most importantly, you will find here:

1. A list of unique characters for the target language (e.g. English) in `data/alphabet.txt`

2. A scorer package (`data/lm/kenlm.scorer`) generated with `data/lm/generate_package.py`. The scorer package includes a binary n-gram language model generated with `data/lm/generate_lm.py`.

For more information on how to build these resources from scratch, see `data/lm/README.md`