[l2h] Confused about Unicode support

Mariusz Pietrzak Mariusz Pietrzak <mariuszp@polbox.pl>
Wed, 30 Jun 1999 17:39:44 +0200


Hi,

Ross MOORE wrote:
> Hmm. It certainly works correctly if you use \L and \l
> for the Polish L characters; so I presume that you are using
> upper-plane (129-255) characters directly in the source, yes ?

Yes

> OK, I think I see what is causing the problem.
> In the file  ...../versions/unicode.pl
> there is a line near the top:
> 
>         require("$LATEX2HTMLVERSIONS${dd}latin1.pl");
> 
> Change this to read:
> 
> require("$LATEX2HTMLVERSIONS${dd}latin1.pl") if ($CHARSET =~/iso\-8859\-1/);
> 

Thanks, the patch works, but ... 
how about generating "polish" characters without using 8-bit font,
(and without using images), by using standard commands:
\k{a} \'c \k{e} \l{} \'n \'o \'s \'z \.z 
\k{A} \'C \k{E} \L{} \'N \'O \'S \'Z \.Z 
This worked with "-html_version 3.2,latin2,unicode" switch.
Now (after the above patch) it works except \'o and \'O (l2h can't 
convert them into available encodings - is it OK? before the patch it
could). 
And when using Latin2 output ("-html_version 3.2,latin2"), the
characters generated 
as above appear as &#<latin2_number>(only \'o and \'O appears as 
regular characters) thus, at lest my Netscape, can't disply them
correctly - I think that &# requires unicode number (regardless
selected charset), and maybe in future it would be possible to
generate 8-bit characters rather then entities.

And one more question:
Is there a difference between \usepackage[latin2]{inputenc}
and setting latin2 using $CHARSET and $HTML_VERSION, which
one is a better way.

PS:In manual, page 15 - I think that there should be
$TITLES_LANGUAGE = 'french'; rather then $LANGUAGE_TITLES = ...

Regards

Mariusz Pietrzak
mariuszp@polbox.pl