Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision |
publishing:digital:word_conversion [2021/03/20 12:37] – maarten | publishing:digital:word_conversion [2024/02/14 12:20] (current) – external edit 127.0.0.1 |
---|
Another method is using a site such as https://wordhtml.com/, but often the most reliable method is to use Calibre and to convert to [[publishing:digital:ebook|EPUB]], certainly if you have footnotes in the text, and from there [[publishing:digital:html|convert to HTML]]. | Another method is using a site such as https://wordhtml.com/, but often the most reliable method is to use Calibre and to convert to [[publishing:digital:ebook|EPUB]], certainly if you have footnotes in the text, and from there [[publishing:digital:html|convert to HTML]]. |
| |
With all methods try to have your initial layout in Word as simple and clean as possible to avoid bad HTML code. Make sure in Word the main titles are “Heading 1” and subtitles “Heading 2”, etc., but if that is not practical in larger documents with poor formatting, often it is easier/quicker to convert the ‘dirty’ Word file in Calibre first and edit the EPUB conversion in Sigil. When using Calibre, it can be useful to change the default setting and to tick “Do not split on page breaks” and to set 0 instead of the default 260kb for the split threshold, then split manually in Sigil according to the chapters (''Edit –> Split at cursor''). | With all methods try to have your initial layout in Word as simple and clean as possible to avoid bad HTML code. Make sure in Word the main titles are “Heading 1” and subtitles “Heading 2”, etc., but if that is not practical in larger documents with poor formatting, often it is easier/quicker to convert the ‘dirty’ Word file in Calibre first and [[publishing:digital:ebook|edit the EPUB conversion in Sigil]]. When using Calibre, it can be useful to change the default setting and to tick “Do not split on page breaks” and to set 0 instead of the default 260kb for the split threshold, then split manually in Sigil according to the chapters (''Edit –> Split at cursor''). |
| |
{{:publishing:digital:word-conversion.jpg|}} | {{:publishing:digital:word-conversion.jpg|}} |
| |
| The Calibre editor also has a useful ''Tools -> Remove unused CSS rules'' function that can clean up <span> tags and various classes without losing the italics and footnotes. Again, be careful and make sure you don't lose any formatting (you may have to get familiar with [[:regex|regular expressions]] to capture certain classes in order to convert them to italics before cleaning up). |
| |