Research Article
Parsing of Research Documents into XML Using Formal Grammars
Table 3
Comparison of RX tool with other alternatives.
| Software tool | Successful XML conversion? | Meaningful XML? | Rich XML tree? | XML tag content? | Comments | Link |
| FormAI | Y | N | N | Y | XML structure has all the contents of the pdf file in a single tag | https://www.formx.ai | i2pdf | Y | N | Y | Y | XML structure was too complex with many verbose tags of the input file metadata | https://www.2pdf.com | Nanonets pdf_to_xml | N | NA | NA | NA | NA | https://www.nanonets.com | Vertopal | Y | N | Y | Y | XML structure was too complex with many verbose tags of the input file metadata | https://www.vertopal.com | Aconvert | Y | N | Y | Y | XML structure was too complex with many verbose tags of the input file metadata | https://www.aconvert.com | RX | Y | Y | Y | Y | The tags are meaningful and has meaningful content, often a text chunk of complete information | github repo |
|
|