Final States

Explore how final states in LSTM and BiLSTM encoders are represented and combined in Seq2Seq models. Learn to handle multi-layer and bidirectional states to prepare encoder outputs for the decoder in NLP tasks.

We'll cover the following...

Chapter Goals:
A. The encoder
B. LSTM final state
C. Multi-layer final states
D. BiLSTM final state
E. Combining forward and backward
Time to Code!

Python 3.5

import tensorflow as tf
# Input sequences (embedded)
# Shape: (batch_size, max_seq_len, embed_dim)
input_embeddings = tf.compat.v1.placeholder(
    tf.float32, shape=(None, None, 4))
cell = tf.keras.layers.LSTMCell(5)
rnn = tf.keras.layers.RNN(
    cell,
    return_state=True)
output  = rnn(input_embeddings)
#With final_state = True , rnn will return 2 final state value that are stored in the output variable.
#get the final states in "final_state"
final_state = {output[1]} , {output[2]}
# final_state is the output of our LSTM encoder.
# it contains all the information about our input sequence,
# which in this case is just a tf.compat.v1.Placeholder object
print(final_state)

1.What you'll learn from this course

2.Word Embeddings

3.Language Model

4.Text Classification

5.Seq2Seq Model

Final States

Chapter Goals:

A. The encoder

B. LSTM final state