Commit Graph

70 Commits

Author SHA1 Message Date
Kelly Davis
4765953b99 Fixed #412 2017-03-07 15:05:49 +01:00
Kelly Davis
8f93127298 Fixed #403 2017-02-24 10:36:23 +01:00
Alexandre Lissy
3838c0a9ce Upgrade to run on Tensorflow 1.0.0 2017-02-22 14:48:47 +01:00
Kelly Davis
e00655321e Addressed review comments 2017-02-22 14:39:07 +01:00
Kelly Davis
9e266f2fc3 Temp fix of #8 until tensorflow/tensorflow#6034 is fixed 2017-02-20 07:10:51 +01:00
Reuben Morais
8f341cf2b7 Merge pull request #383 from mozilla/issue340_unused_importers
Don't create DataSet objects for data sets we will not use
2017-02-14 08:42:47 -02:00
Reuben Morais
6903cd3f3c Don't create DataSet objects for data sets we will not use 2017-02-13 20:37:42 -02:00
Alexandre Lissy
2f63b2e89a Move some data to be stored on /data
Settting temp directory to be in /data, and DATA_DIR to /data,
defaulting to XDG_DATA_HOME.

Fixes #374
2017-02-13 14:25:58 +01:00
Reuben Morais
d6fb444287 Convert code comments to Sphinx RST docstrings 2017-02-02 23:42:36 -02:00
Kelly Davis
e912621100 Addressed review comments 2017-01-16 11:08:11 +01:00
Kelly Davis
0f8242ad39 Fixed #340 2017-01-16 09:45:45 +01:00
Andre Natal
3e7d3a725a Fixing reviewer comments 2017-01-03 12:17:02 -08:00
Andre
3cbb29ee31 deleted swb 2017-01-03 12:17:02 -08:00
Andre Natal
c92f986690 Switchboard importer 2017-01-03 12:17:02 -08:00
Andre Natal
501783501d Switchboard importer 2017-01-03 12:17:02 -08:00
Andre Natal
32a436309e Switchboard importer 2017-01-03 12:17:02 -08:00
Reuben Morais
bc8b046605 Merge pull request #84 from mozilla/issue3_fisher
Implement Fisher corpus importer
2016-12-30 09:29:36 -02:00
Reuben Morais
5fc818b87a Remove unnecessary file existence check and comment 2016-12-30 09:26:16 -02:00
Reuben Morais
5dc2e98c8f Add code to manually fix broken transcript 2016-12-23 13:12:59 -02:00
Reuben Morais
64724815e7 Remove unneeded check from _maybe_split_wav 2016-12-23 13:12:33 -02:00
Reuben Morais
3ef07ce7d0 Adapt Fisher importer to other importer API changes 2016-12-21 14:38:17 -02:00
Reuben Morais
33c9521a6f Add validation and cleanup function to util/text.py 2016-12-21 14:37:21 -02:00
Tilman Kamp
8e99eeffdd Fix #284; BiRNN with stride 2 2016-12-19 16:21:39 +01:00
Reuben Morais
d4dcd312ce Update Fisher importer to the new next_batch API 2016-12-19 11:47:24 -02:00
Reuben Morais
e7bbfbf703 Convert Fisher importer to new input system 2016-12-19 11:47:24 -02:00
Reuben Morais
a40df7251e Convert each channel individually before splitting wav files 2016-12-19 11:47:24 -02:00
Reuben Morais
d8aaffce62 Address review comments and do further filtering and cleanup on the transcription data 2016-12-19 11:47:24 -02:00
Reuben Morais
c7eaf9939b Implement Fisher corpus importer 2016-12-19 11:47:24 -02:00
Alexandre Lissy
6ec9cc91d5 Restore checkpoint automatically based on some rules
Fixes #250
2016-11-29 14:15:05 +01:00
Kelly Davis
06bb017c15 Fixed #239 2016-11-28 09:44:41 +01:00
Tilman Kamp
e83b49ee3e Fix #176; Training, validation an test each in separate sessions and own graphs 2016-11-24 18:07:50 +01:00
Kelly Davis
9d8a261a1d Revert "Merge pull request #192 from Cwiiis/validation-graph-duplication-rebased"
This reverts commit 08064e7120, reversing
changes made to 757fc74e1e.
2016-11-18 15:17:26 +01:00
Alexandre Lissy
58f0609afa Tracking GPU Usage
Fixes #178
2016-11-17 11:49:20 +01:00
Chris Lord
77100ed1df Don't duplicate graph to do validation 2016-11-16 15:37:42 +00:00
Kelly Davis
04af407011 Merge pull request #171 from mozilla/issue164
Fixed #164
2016-11-11 06:26:27 +01:00
Kelly Davis
8e90c84337 Modified to have a uniform importer API 2016-11-10 17:54:36 +01:00
Kelly Davis
763ce2b329 Fixed #164 2016-11-10 16:14:39 +01:00
Alexandre Lissy
b3f4f40869 Do not remove existing checkpoint dirs, update training paramteters
This breaks restore feature. We should let code overwrite existing
checkpoints. This fixes #166.

Change training parameters, to address issue #169.
2016-11-10 15:54:09 +01:00
Kelly Davis
0707ad89d2 Merge branch 'master' into issue109_inputops 2016-11-09 12:26:18 +01:00
Tilman Kamp
fc94a2e532 Merge pull request #146 from mozilla/issue144
Fix #144; Ability to limit amount of samples in TEDLIUM
2016-11-09 08:30:21 +01:00
Reuben Morais
6fb628521d Normalize non-ASCII chars in the importers 2016-11-08 17:07:46 -02:00
Tilman Kamp
b1fc959f7a Fix #144; Ability to limit amount of samples in TEDLIUM 2016-11-08 16:27:36 +01:00
Reuben Morais
d989e8de09 Make sure the initializer passed to tf.scan doesn't break the API contract
We need to make sure the initializer shape matches the return value
of the callable passed to tf.scan.

This also adds an assertion on the shape of labels and the values
in label_lengths that enforces a condition that is needed for
ctc_label_dense_to_sparse to work.
2016-11-08 12:19:42 -02:00
Alexandre Lissy
fa0fc14439 Verify date of merges when pulling list of changes
Fixes #141
2016-11-08 14:11:34 +01:00
Reuben Morais
dba8f219f7 Only start the importers' queue threads after we initialize variables 2016-11-08 09:39:27 -02:00
Chris Lord
6178c31a20 Write a Tensorflow Serving client 2016-11-08 11:45:28 +01:00
Reuben Morais
c0bb34cfd3 Undo extraction of shared DataSets code from importers 2016-11-08 03:51:26 -02:00
Reuben Morais
182e20187a Switch importers to new input pipeline 2016-11-08 02:35:50 -02:00
Reuben Morais
ca98c5aab8 Expose text_to_char_array in util/text.py 2016-11-07 16:12:58 -02:00
Alexandre Lissy
34848dcda7 Switch dependency from xdg to pyxdg
Fixes #129
2016-11-07 15:27:01 +01:00