limit "t" and correct prev non blank so that task=search works #69

mikel-zhobro · 2021-06-01T09:16:46Z

Now task=search should work as intended.

common/models/transducer/transducer_fullsum.py

albertz · 2021-06-01T09:21:10Z

common/models/transducer/transducer_fullsum.py

      "am": {"class": "copy", "from": "am0" if search else "data:source"},

+      "prev_output_wo_b": {
+        "class": "masked_computation", "unit": {"class": "copy", "initial_output": 0},
+        "from": "prev:output_", "mask": "prev:output_emit", "initial_output": 0},


I don't understand. Why is this needed? Esp in search, this should have no effect.

prev:output_ doesn't guarantee non_blank during search. Both are sparse, but it messes up the embedding that happens in slow_rnn.

I get something like this:

TensorFlow exception: indices[0] = 1056 is not in [0, 1056) [[node output/rec/slow_rnn/masked/input_embed/linear/embedding_lookup (defined at /Users/mikel/setups/rt4/returnn/returnn/tf/layers/basic.py:1468) ]] Errors may have originated from an input operation. Input Source operations connected to node output/rec/slow_rnn/masked/input_embed/linear/embedding_lookup: output/rec/slow_rnn/masked/input_embed/linear/py_print_1/Identity (defined at /Users/mikel/setups/rt4/returnn/returnn/tf/util/basic.py:6245)

Hm, this is strange. Haven't we used it always like this in the other configs as well? Why did the problem never occur? Also, I have used exactly this config, and it did not occur for me. How can that be?

What TensorFlow version do you use?

Also, maybe we should fix MaskedComputationLayer instead? This can only happen for frames for slow_rnn which will actually not be used (due to the masking). It does not really matter what we calculate in those masked-out frames. We could simply fix the input for the masked-out frames.

But first I want to understand better why this happens now and not before, and not for me.

Haven't we used it always like this in the other configs as well?

It looks the same in other configs, I don't get it either.

What TensorFlow version do you use?

2.4.1

This can only happen for frames for slow_rnn which will actually not be used (due to the masking)

Exactly.

But first I want to understand better why this happens now and not before, and not for me.

Here are my logs.

common/models/transducer/transducer_fullsum.py

albertz reviewed Jun 1, 2021

View reviewed changes

common/models/transducer/transducer_fullsum.py Outdated Show resolved Hide resolved

albertz reviewed Jun 1, 2021

View reviewed changes

common/models/transducer/transducer_fullsum.py Outdated Show resolved Hide resolved

limit "t" and correct prev non blank for search

0862712

mikel-zhobro force-pushed the transducer_search_problem branch from dcae584 to 0862712 Compare June 1, 2021 09:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

limit "t" and correct prev non blank so that task=search works #69

limit "t" and correct prev non blank so that task=search works #69

mikel-zhobro commented Jun 1, 2021

albertz Jun 1, 2021

mikel-zhobro Jun 1, 2021

albertz Jun 1, 2021

mikel-zhobro Jun 1, 2021 •

edited

Loading

limit "t" and correct prev non blank so that task=search works #69

Are you sure you want to change the base?

limit "t" and correct prev non blank so that task=search works #69

Conversation

mikel-zhobro commented Jun 1, 2021

albertz Jun 1, 2021

Choose a reason for hiding this comment

mikel-zhobro Jun 1, 2021

Choose a reason for hiding this comment

albertz Jun 1, 2021

Choose a reason for hiding this comment

mikel-zhobro Jun 1, 2021 • edited Loading

Choose a reason for hiding this comment

mikel-zhobro Jun 1, 2021 •

edited

Loading