Development of a converter of KIAM preprints from .docx format to HTML and JATS XML formats
Abstract:
Along with the traditional form of electronic presentation of full texts of scientific articles – PDF format – in recent years the HTML format has become widespread. HTML has a number of advantages for online publications due to the means it has for better structuring of material, adding multimedia content and implementation of various interactive and dynamic capabilities. The most common approach to generating an HTML version of an article is to first create its XML version in accordance with the JATS XML standard developed in the USA, which, in addition to the basis for creating HTML and PDF versions, is also a standard for exchanging and storing article contents. However, converting scientific articles with complex content, including a large number of formulas, tables and figures, authored in the most commonly used .docx and LaTeX formats into this format is not an easy task and the available software tools either do not cope with it in full or are quite expensive. The paper proposes an approach to creating a converter of scientific articles from the .docx format to HTML and JATS XML formats leveraging the open source tool Mammoth and describes a prototype converter of KIAM preprints to HTML with subsequent conversion to JATS XML, created based on this approach.
Keywords:
journal article HTML-version, JATS XML, conversion of scholarly article formats, KIAM preprints