Research Article

Parsing of Research Documents into XML Using Formal Grammars

Algorithm 1

Parsing document to XML algorithm.
(1)Input: PDF proposal document,
  Output: XML document,
   = set of document styles,
   = set of predefined text strings,
   = empty nested array/set as empty Parse tree object, [[]]
  , /set of Productions /
(2)
 /Extract text string tokens from into nested array object/
fordo
(3)  ifthen
(4)   /Add extracted text string tokens to nested array object/
   ifthen
(5)    Add to
(6)   end
(7)  end
(8)end
(9)
  /Serialize nested array (a parse tree) object into an XML file/
  return XML_Document