Research Article

A Relationship: Word Alignment, Phrase Table, and Translation Quality

Table 7

Statistics of CMWT 2013 + UM-Corpus.

LanguagesTokensAverage lengthVocabularies

English152,161,23319.371,655,080
CharacterCE229,110,26529.16397,442
CTBCE123,917,39515.781,331,505