LSTM-based Encoder-Decoder Network: Difference between revisions

Revision as of 04:52, 23 September 2020

Context:
- It can be trained by a LSTM-based Encoder/Decoder RNN Training System.
Example(s):
Counter-Example(s):
- a GRU-based Encoder-Decoder RNN.
See: Neural seq2seq, Bidirectional LSTM.

(Robertson, 2017) ⇒ Sean Robertson. (2017). “Translation with a Sequence to Sequence Network and Attention.” In: TensorFlow Tutorials
- QUOTE: A basic sequence-to-sequence model, as introduced in Cho et al., 2014 , consists of two recurrent neural networks (RNNs): an encoder that processes the input and a decoder that generates the output. This basic architecture is depicted below.
  
  Each box in the picture above represents a cell of the RNN, most commonly a GRU cell or an LSTM cell (see the RNN Tutorial for an explanation of those). Encoder and decoder can share weights or, as is more common, use a different set of parameters. Multi-layer cells have been successfully used in sequence-to-sequence models too, e.g. for translation Sutskever et al., 2014 .

@@ Line 3: / Line 3: @@
 ** It can be trained by a [[LSTM-based Encoder/Decoder RNN Training System]].
 * <B>Example(s):</B>
-** a [[LSTM-based Encoder-Decoder Machine Translation Model]].
+** an [[LSTM-based Encoder-Decoder Machine Translation Model]].
-** a [[LSTM-based Encoder-Decoder Text Error Correction Model]].
+** an [[LSTM-based Encoder-Decoder Text Error Correction Model]], such as an [[LSTM-based Encoder-Decoder WikiText Error Correction Model]].
 ** an [[LSTM+Attention-based Encoder-Decoder Model]].
 ** ...