Tutorial 18: Train a GRU/LSTM model using the MNIST dataset