[l2h] current state of unicode support

Janusz S. Bień jsbien at mimuw.edu.pl
Sun Jul 6 16:04:50 CEST 2003


On Sun, 6 Jul 2003  Ross Moore <ross at ics.mq.edu.au> wrote:

Thanks for your quick answer.


> On Sun, 6 Jul 2003, [iso-8859-2] Janusz S. Bieñ wrote:
> 
> >
> > Can latex2html accept some form of unicode (utf8 or utf16) as input?
> 
> Yes, and no.
> 
> The `no' means that there is nothing that is specifically designed
> to support this kind of input.
> 
> The `yes' means that the effect of supplying UTF8 *should* be that any
> bytes (nibbles?) in the upper range go through unchanged.
> 
> If this does not happen by default, then it is because the default
> charsets assume that upper-8-bit characters have a special meaning that
> can be translated into alternative TeX sequences, and perhaps require
> an image to be created.
> 
> To stop this you may need to specify on the commandline something like:
> 
>   latex2html -html_version 4.0,unicode  ...other-options...  <filename>
> or
>   latex2html -html_version 4.0,unicode,utf8  ......
> or even
>   latex2html -html_version 4.0,unicode,unicode  ......

[...]

The command line

        latex2html -html_version 4.0,unicode,utf8  ......

produces something strange, but both other variants works correctly.

Thanks again for your help.

Best regards

Janusz

-- 
                     ,   
dr hab. Janusz S. Bien, prof. UW
Prof. Janusz S. Bien, Warsaw Uniwersity
jsbien at mimuw.edu.pl, jsbien at uw.edu.pl
http://www.orient.uw.edu.pl/~jsbien/
http://www.mimuw.edu.pl/~jsbien/


More information about the latex2html mailing list