Research Article

Indonesian Lip-Reading Detection and Recognition Based on Lip Shape Using Face Mesh and Long-Term Recurrent Convolutional Network

Table 3

Performance results of IndoLR.

ModelWordsRt (ms)PhraseRt (ms)
Val. acc. (%)Test acc. (%)Tt (s)Val. acc. (%)Test acc. (%)Tt (s)

Conv-LSTM94.790.427996.6±889590.629509.8±105
LRCN-2Conv95.8392.92600.3±6499.3795.00539.6±68
LRCN-3Conv97.9295.42727.3±6299.3795.63585.9±66

Tt, training time in seconds; Rt, average recognition time for each video sample in milliseconds.