Review Article
A Comparative Study of Some Automatic Arabic Text Diacritization Systems
Algorithm 1
create_Distribution_Of_Diacritized_Letters.
(i) | Input: list of diacritized sentences s, | (ii) | Output: distribution of diacritized letters lDist | (iii) | SET words EQUAL TO EMPTY LIST | (iv) | SET letters EQUAL TO EMPTY LIST | (v) | SET lDist EQUAL TO EMPTY DICTIONARY | (vi) | tool: = MyToolKit()//this is a toolkit class from the same work, it contains a group of methods | (vii) | FOR EACH sent IN s | (viii) | sentenceWords: = tool.words(sent)//splitting according to Arabic regular expression | (ix) | words.extend(sentenceWords) | (x) | FOR IN words | (xi) | wordLetters: = tool.LettersDiac(w)//diacritized letters of the word w | (xii) | letters.extend(wordLetters) | (xiii) | lDist: = freqDist(letters) | (xiv) | RETURN lDist |
|