|Conversion Options Dialog|
DocToHtml—How to Speed Up the Conversion Process
To access the original document being converted, DocToHtml uses
MS Word OLE automation. This approach provides independence
from the particular input document format, be it RTF, DOC, DOCX,
etc. Alas, there is a catch: sometimes the conversion speed may be
extremely low. The following information might be helpful in
improving the situation.
- DocToHtml has a 32-bit core conversion engine. Even if
your MS Word edition is 64 bit, DocToHtml’s 32-bit conversion
engine still will be used. Moreover, it will be necessary to
translate each parameter for each function call between the 32- and
64-bit domains, which may cause a significant slowdown. So if this
is the case, think about installing a 32-bit edition of MS
- Turn on the “No endnote formatting” and
“No footnote formatting” options via the
Body & Footer tab of the
Conversion Options dialog. If these options are on, any special
formatting (bold, underline, italic letters, etc.) will be omitted
from the text in footnotes and endnotes, but the conversion speed
will be more reasonable.
- At the bottom of the Advanced
tab, there is an option named “Do not preserve
clipboard content.” If it is turned off, DocToHtml will
always save and then restore clipboard content before and after
each operation involving the clipboard. (The program uses the
clipboard when processing images and some special characters.)
Turning this option on will speed up the conversion process, but
your data in the clipboard may be occasionally destroyed.
- You may want to turn off the “Additional check
for special chars” option at the bottom of the HTML Code tab to reduce the number of
clipboard operations, which might speed up the conversion. However,
in that case, less strict parameters will be used when handling
special character codes.
- Make sure that your PC has enough free physical memory. In most
cases, about 100 MB is enough.
- Make sure that you are not running other CPU-intensive tasks
during the conversion. Currently, DocToHtml cannot take advantage
of multi-core CPUs and does all processing in one thread. So at
least one CPU core should not be overloaded by other
- In general, the less formatting options you use, the faster is
the conversion. If you turn such an option on, DocToHtml will have
to query the respective formatting attribute for every paragraph
and character that are to be processed, which takes some time.