85. Bi-directional RNN and Multi-layer RNN #85

neutron0831 · 2023-02-14T12:27:52Z

85. Bi-directional RNN and Multi-layer RNN

Encode the input text using both forward and backward RNNs and train the model.

$$
\overleftarrow{h}_{T+1} = 0, \
\overleftarrow{h}t = {\rm \overleftarrow{RNN}}(\mathrm{emb}(x_t), \overleftarrow{h}{t+1}), \
y = {\rm softmax}(W^{(yh)} [\overrightarrow{h}_T; \overleftarrow{h}_1] + b^{(y)})
$$

However，$\overrightarrow{h}_t \in \mathbb{R}^{d_h}, \overleftarrow{h}_t \in \mathbb{R}^{d_h}$ is the hidden state vector at time $t$ obtained by the forward and backward RNNs, and ${\rm \overleftarrow{RNN}}(x,h)$ is the RNN unit that calculates the previous state from the input $x$ and the hidden state $h$ at the next time, $W^{(yh)} \in \mathbb{R}^{L \times 2d_h}$ is a matrix for predicting categories from the hidden state vector, and $b^{(y)} \in \mathbb{R}^{L}$ is the bias term. Moreover，$[a; b]$ represents a concatenation of two vectors $a$ and $b$.

In addition, experiment with multi-layered bidirectional RNNs.

neutron0831 added the enhancement New feature or request label Feb 14, 2023

neutron0831 added this to the Chapter 9: RNN and CNN milestone Feb 14, 2023

neutron0831 self-assigned this Feb 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

85. Bi-directional RNN and Multi-layer RNN #85

85. Bi-directional RNN and Multi-layer RNN #85

neutron0831 commented Feb 14, 2023 •

edited

Loading

85. Bi-directional RNN and Multi-layer RNN #85

85. Bi-directional RNN and Multi-layer RNN #85

Comments

neutron0831 commented Feb 14, 2023 • edited Loading

85. Bi-directional RNN and Multi-layer RNN

neutron0831 commented Feb 14, 2023 •

edited

Loading