Review Article

A Comparative Study of Some Automatic Arabic Text Diacritization Systems

Algorithm 1

create_Distribution_Of_Diacritized_Letters.
(i)Input: list of diacritized sentences s,
(ii)Output: distribution of diacritized letters lDist
(iii)SET words EQUAL TO EMPTY LIST
(iv)SET letters EQUAL TO EMPTY LIST
(v)SET lDist EQUAL TO EMPTY DICTIONARY
(vi)tool: = MyToolKit()//this is a toolkit class from the same work, it contains a group of methods
(vii)FOR EACH sent IN s
(viii) sentenceWords: = tool.words(sent)//splitting according to Arabic regular expression
(ix) words.extend(sentenceWords)
(x)FOR IN words
(xi) wordLetters: = tool.LettersDiac(w)//diacritized letters of the word w
(xii) letters.extend(wordLetters)
(xiii)lDist: = freqDist(letters)
(xiv)RETURN lDist