[tex4ht] curiosity about unicode.4hf

Karl Berry karl at freefriends.org
Mon Mar 13 22:53:47 CET 2017


Hi Matteo,

    I get "a.html" that contains:
    ...’...

I guess you're expecting the literal UTF-8 right single quote instead of
the entity syntax?

    AFAIK, ' and " are illegal in attributes, 

I have used those characters in attribute values. Anyway, how are
attributes related to the example?  I'm baffled here, sorry.

    (and #x2018 is not in the file - texlive2016).
    Does anyone know why &x2019; ended up in unicode.4hf?

I don't know why Eitan decided to translate ASCII ' to the Unicode
entity value and leave ASCII ` output as literal UTF-8 (with your options).
I don't know what the implications would be of changing it, either; not
something I would want to do lightly.

Briefly looking at the source file (tex4ht-fonts-4hf.tex), I don't see
any explanation. Could have missed it.

Does outputting the entity cause some problem?

    htlatex a "xhtml" " -cunihtf -utf8"

Why do you want to use those options in the first place?
(Just wondering.)

Thanks,
Karl


More information about the tex4ht mailing list