Speeding Up Neural Machine Translation Decoding by Cube Pruning

Publication
EMNLP 2018