Research Article
Parsing of Research Documents into XML Using Formal Grammars
Algorithm 1
Parsing document to XML algorithm.
(1) | Input: PDF proposal document, | | Output: XML document, | | = set of document styles, | | = set of predefined text strings, | | = empty nested array/set as empty Parse tree object, [[]] | | , /set of Productions / | (2) | | | /Extract text string tokens from into nested array object/ | | fordo | (3) | ifthen | (4) | /Add extracted text string tokens to nested array object/ | | ifthen | (5) | Add to | (6) | end | (7) | end | (8) | end | (9) | | | /Serialize nested array (a parse tree) object into an XML file/ | | return XML_Document |
|