LSTM-based Encoder-Decoder Network: Difference between revisions

Latest revision as of 02:09, 5 August 2023

An LSTM-based Encoder-Decoder Network is an RNN/RNN-based encoder-decoder model composed of LSTM models (an LSTM encoder and an LSTM decoder).

Context:
- It can be trained by a LSTM-based Encoder/Decoder RNN Training System.
Example(s):
Counter-Example(s):
- a GRU-based Encoder-Decoder RNN.
See: Neural seq2seq, Bidirectional LSTM.

References

2018

(Brownlee, 2018) ⇒ Jason Brownlee. (2018). “Encoder-Decoder Recurrent Neural Network Models for Neural Machine Translation." Blog Post
- QUOTE: After reading this post, you will know:
  - The encoder-decoder recurrent neural network architecture is the core technology inside Google’s translate service.
  - The so-called “Sutskever model” for direct end-to-end machine translation.
  - The so-called “Cho model” that extends the architecture with GRU units and an attention mechanism.

2017

(Robertson, 2017) ⇒ Sean Robertson. (2017). “Translation with a Sequence to Sequence Network and Attention.” In: TensorFlow Tutorials
- QUOTE: A basic sequence-to-sequence model, as introduced in Cho et al., 2014 , consists of two recurrent neural networks (RNNs): an encoder that processes the input and a decoder that generates the output. This basic architecture is depicted below. ...
  ... Each box in the picture above represents a cell of the RNN, most commonly a GRU cell or an LSTM cell (see the RNN Tutorial for an explanation of those). Encoder and decoder can share weights or, as is more common, use a different set of parameters. Multi-layer cells have been successfully used in sequence-to-sequence models too, e.g. for translation Sutskever et al., 2014 .

2014a

(Sutskever et al., 2014) ⇒ Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. (2014). “Sequence to Sequence Learning with Neural Networks.” In: Advances in Neural Information Processing Systems.

@@ Line 1: / Line 1: @@
-An [[LSTM-based Encoder-Decoder Network]] is an [[RNN-based encoder-decoder model]] composed of [[LSTM model]]s (an [[LSTM encoder]] and an [[LSTM decoder]]).
+An [[LSTM-based Encoder-Decoder Network]] is an [[RNN/RNN-based encoder-decoder model]] composed of [[LSTM model]]s (an [[LSTM encoder]] and an [[LSTM decoder]]).
 * <B>Context:</B>
 ** It can be trained by a [[LSTM-based Encoder/Decoder RNN Training System]].
 * <B>Example(s):</B>
-** a [[LSTM-based Encoder-Decoder Machine Translation Model]].
+** an [[LSTM-based Encoder-Decoder Machine Translation Model]].
-** a [[LSTM-based Encoder-Decoder Text Error Correction Model]].
+** an [[LSTM-based Encoder-Decoder Text Error Correction Model]], such as an [[LSTM-based Encoder-Decoder WikiText Error Correction Model]].
 ** an [[LSTM+Attention-based Encoder-Decoder Model]].
-** ...
+** …
 * <B>Counter-Example(s):</B>
 ** a [[GRU-based Encoder-Decoder RNN]].
 * <B>See:</B> [[Neural seq2seq]], [[Bidirectional LSTM]].
 ----
 ----
@@ Line 24: / Line 25: @@
 === 2017 ===
 * ([[Robertson, 2017]]) ⇒ [[Sean Robertson]]. ([[2017]]). “[https://tensorflow.org/versions/r1.1/tutorials/seq2seq Translation with a Sequence to Sequence Network and Attention].” In: TensorFlow Tutorials
-** QUOTE: A basic [[sequence-to-sequence model]], as introduced in [[Cho et al., 2014]] , consists of two recurrent neural networks (RNNs): an encoder that processes the input and a decoder that generates the output. This basic architecture is depicted below. <P> <HTML><CENTER><IMG WIDTH=600 SRC=http://tensorflow.org/versions/r1.1/images/basic_seq2seq.png></CENTER></HTML> <P>  Each box in the picture above represents a cell of the RNN, most commonly a [[GRU cell]] or an [[LSTM cell]] (see the RNN Tutorial for an explanation of those). Encoder and decoder can share weights or, as is more common, use a different set of parameters. Multi-layer cells have been successfully used in [[sequence-to-sequence model]]s too, e.g. for translation [[Sutskever et al., 2014]] .
+** QUOTE: A basic [[sequence-to-sequence model]], as introduced in [[Cho et al., 2014]] , consists of two recurrent neural networks (RNNs): an encoder that processes the input and a decoder that generates the output. This basic architecture is depicted below. ...         <P>        ... Each box in the picture above represents a cell of the RNN, most commonly a [[GRU cell]] or an [[LSTM cell]] (see the RNN Tutorial for an [[explanation]] of those). Encoder and decoder can share weights or, as is more common, use a different set of parameters. Multi-layer cells have been successfully used in [[sequence-to-sequence model]]s too, e.g. for translation [[Sutskever et al., 2014]] .
 === 2014a ===
 * ([[2014_SequencetoSequenceLearningwithN|Sutskever et al., 2014]]) ⇒ [[Ilya Sutskever]], [[Oriol Vinyals]], and [[Quoc V. Le]]. ([[2014]]). “[http://papers.nips.cc/paper/5346-sequence-to-sequence-learning-with-neural-networks.pdf Sequence to Sequence Learning with Neural Networks].” In: Advances in Neural Information Processing Systems.
 ----
 __NOTOC__
 [[Category:Concept]]

LSTM-based Encoder-Decoder Network: Difference between revisions

Latest revision as of 02:09, 5 August 2023

References

2018

2017

2014a

Navigation menu

Search