You can develop your own document converter. Either in Java or .NET. Have a look at the required code. .NET You have to download and add GroupDocs.Conversion for .NET DLL reference in your project. In case of any issue, you can create a post http.//forum.groupdocs.com/ using (Converter converter = new Converter("sample.pdf")) { MarkupConvertOptions options = new MarkupConvertOptions ; converter.Convert("converted.rtf", options); } Java Download and add GroupDocs.Conversion for Java JAR file in your project. In case of any issue, you can create a post http.//forum.groupdocs.com/ String outputFile = "ConvertToHtml.html"; Converter converter = new Converter("sample.docx"); MarkupConvertOptions options = new MarkupConvertOptions ; converter.convert(outputFile, options); System.out.print("\nConversion to html completed successfully. \nCheck output in " + outputFile);
It can take up to several hours to do. ITAs also possible to have a program do the formatting. However, you need a lot of time. So you may as well take care of it on-line. If you only have a few articles to convert, then you should use RTF, and if not than PDF. Note: if the text is very long (like 500 or 1000 words), and only small parts of it is not needed to be displayed (like the logo or title) then you can use a .gif file.