[l2h] Converting emdashs and endashs?
James Howison
jhowison at syr.edu
Mon Aug 11 15:49:17 CEST 2003
Now I have curly quotes happening (yay!) I am wondering about the other
special characters. I realize that this will break back-wards
compatibility but that is not an issue for my needs.
I would like "---" to be converted to "—" as defined in the
unicode.pl file at 799 - but this doesn't seem to happen - instead it
is converted to "--". This is also what happens if I change --- to
{---}.
I'm not sure why some of the conversions in the unicode.pl file happen,
while others do not. I can't find an equivalent of the
$USE_CURLY_QUOTES in the source code that seems relevant to mdash ...
Any ideas on how to get a maximal set of the conversions in unicode.pl
actually happening? I notice that there is no do_cmd_textemdash in
unicode.pl - is that why?
Also I see from the source that converting single quotes is
tough---perhaps I'm naive but it would seem to me that this sequence
would work...
s/``/“/og
s/`/‘/og # once the `` is gone then the ` is only used for
open single quote right?
s/''/”/og
s/'/’/og # Will also replace apostrophes with close curly
single - not a bad thing.
i.e. ensure that one does the singles after the doubles ...
But there is probably a better algorithm in the source code for 'quoter'
http://www.dwheeler.com/quoter/
Thanks,
James
On Saturday, August 9, 2003, at 02:53 am, Ross Moore wrote:
> On Sat, 9 Aug 2003, James Howison wrote:
>
>> Hi all,
>>
>> I'd really like to convert the latex quotation marks, `` and '' to the
>> recommended HTML curly quotes, “ instead of `` and ” instead
>> of '' - standard codes that render the curly quotes beautifully.
>
> set
> $USE_CURLY_QUOTES =1;
> in an initialisation file.
>
> This is not the default, because not all browsers actually render
> these characters. (At least, that was the situation 3-4 years ago when
> the LaTex2HTML coding was written.)
>
>
> Hope this helps,
>
> Ross Moore
>
>
>>
>> I'm sure that this is possible through latex2html - the codes are
>> listed around unicode.pl:722 - but either I can't find the magic
>> incantation to have latex2html do the conversion or there is a bug
>> preventing this from working in my version (1.70) or set-up.
>>
>> I've tried:
>>
>> latex2html -html_version 4.0,unicode test.tex
>>
>> What is strange is that this does work for, say \v{Z} which converts
>> to
>> the code Ž (and that is definitely happening through unicode.pl
>> (I
>> changed the translation and it worked fine).
>>
>> So why doesn't the translation for `` (which is correctly listed in
>> the
>> unicode.pl as \`\`) and '' which is correctly listed as \'\' work?
>>
>> I've had a good hunt around for this - but I can't see why the other
>> codes are converted but not the quotes.
>>
>> Cheers,
>> James
>>
>> ps. minimal test.tex follows
>>
>> ----------
>>
>> \documentclass[11pt]{article}
>> \begin{document}
>> ``Why are these quotes not converted to unicode'' (they are in the
>> unicode.pl file)
>> While this symbol (also in the unicode.pl file) is? - \v{Z}
>> \end{document}
>>
>> _______________________________________________
>> latex2html mailing list
>> latex2html at tug.org
>> http://tug.org/mailman/listinfo/latex2html
>>
>
More information about the latex2html
mailing list