Spjaldblog

Dr. Api

Fimmtudagurinn 5. maí 2016 kl. 5:44

Flokkur: Spjaldið

"Surprisingly, the LSTM did not suffer on very long sentences, despite the recent experience of other researchers with related architectures. We were able to do well on long sentences because we reversed the order of words in the source sentence but not the target sentences in the training and test set. By doing so, we introduced many short term dependencies that made the optimization problem much simpler. As a result, SGD could learn LSTMs that had no trouble with long sentences. The simple trick of reversing the words in the source sentence is one of the key technical contributions of this work."
http://papers.nips.cc/paper/5346-sequence-to-sequence-learning-with-neural-networks.pdf

Spjallaðu við höfundinn á Spjaldinu...

Sjáðu þessa færslu í samhengi...

Vesen 2009