Research Article

Indonesian Lip-Reading Detection and Recognition Based on Lip Shape Using Face Mesh and Long-Term Recurrent Convolutional Network

Table 2

Neural network architecture.

ModelDetailed architecture
WordsPhrase

Conv-LSTMConvLSTM2D (8)ConvLSTM2D (8)
MaxPooling3D ()MaxPooling3D ()
ConvLSTM2D (16)ConvLSTM2D (16)
MaxPooling3D ()MaxPooling3D ()
Flatten ()Flatten ()
Dense (10)Dense (4)

LRCN-2ConvTimeDistributed (Conv2D (16))TimeDistributed (Conv2D (16))
TimeDistributed (MaxPooling2D ())TimeDistributed (MaxPooling2D ())
TimeDistributed (Dropout ())TimeDistributed (Dropout ())
TimeDistributed (Conv2D (32))TimeDistributed (Conv2D (32))
TimeDistributed (MaxPooling2D ())TimeDistributed (MaxPooling2D ())
TimeDistributed (Dropout ())TimeDistributed (Dropout ())
TimeDistributed (Flatten ())TimeDistributed (Flatten ())
LSTM (64)LSTM (64)
Dropout ()Dropout ()
Dense (10)Dense (4)

LRCN-3ConvTimeDistributed (Conv2D (16))TimeDistributed (Conv2D (16))
TimeDistributed (MaxPooling2D ())TimeDistributed (MaxPooling2D ())
TimeDistributed (Dropout ())TimeDistributed (Dropout ())
TimeDistributed (Conv2D (32))TimeDistributed (Conv2D (32))
TimeDistributed (MaxPooling2D ())TimeDistributed (MaxPooling2D ())
TimeDistributed (Dropout ())TimeDistributed (Dropout ())
TimeDistributed (Conv2D (64))TimeDistributed (Conv2D (64))
TimeDistributed (MaxPooling2D ())TimeDistributed (MaxPooling2D ())
TimeDistributed (Dropout ())TimeDistributed (Dropout ())
TimeDistributed (Flatten ())TimeDistributed (Flatten ())
LSTM (64)LSTM (64)
Dropout ()Dropout ()
Dense (10)Dense (4)