[l2h] Looking for the reverse procedure - html to (La?)Tex...

Graham Toal gtoal at gtoal.com
Mon Jun 14 20:57:29 CEST 2010


I hope html2latex isn't too off topic for this group.  (Moderator feel free
to reject this post if it is)

I have an ebook formatted in a fairly minimal HTML (a couple of <div
align="right"> statements are about the only thing in it that wouldn't have
been recognised by the original 1.0 NCSA browser!) so I was wondering what
would be the best tool to convert this to TeX?  No 8-bit chars in the
source, just entity-refs.  Should be easy.  I'm looking to see what's around
before I write my own...  Obviously I'm willing to do whatever tweaking is
necessary once converted to make it look good, but the more that can be done
automatically the better.

By the way, I've discovered that you can create a kindle ebook fairly easily
that is just a file of rendered pages - simply alternate the <IMG> command
with a mobipocket extension - <mbp:pagebreak /> and it generates a regular
file that you can download and read like normal.  (My previous effort meant
that you had to download a directory of separate images in a zip file, which
was not very convenient)

[And yes, I know that imaged pages don't allow the user to change font size
or justification options.  For my purposes, I can live with that...]

Btw, I'm not asking to avoid doing the research - I have identified several
tools that might do some of the job - why I'm posting is to ask what you've
used and how good your preferred converter is in practise - what the
pitfalls are etc that they don't warn you about on the web page...

thanks,

Graham
PS The ebook also has a simple table of contents and a few images, each on a
page by itself.  Nothing complicated.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://tug.org/pipermail/latex2html/attachments/20100614/e99bba17/attachment.html>


More information about the latex2html mailing list