[tex4ht] curiosity about unicode.4hf

Michal Hoftich michal.h21 at gmail.com
Tue Mar 14 12:21:02 CET 2017


> I use tex4ht to transform some TeX fragments to XML
>
> For instance the authors names of some physics articles:
> \author{Francesco D'Eramo}
> <contrib><string-name>Francesco D&#x2019;Eramo</string-name>...
>
> which is correctly shown by any decent xml viewer as:
> <contrib><string-name>Francesco D’Eramo</string-name>...
>
> (for instance https://repo.scoap3.org/record/19196/files/main.xml)
>
> Sometimes, I need to compare the author's name from these XML to what
> we have in our DB, and what we have in our DB is always in the form
> Francesco D'Eramo
> (with simple ' instead of ’ or &#x2019;)
>
> This is not a big problem (I just replace &#x2019; with ' and do my
> comparison).

You can make a copy of unicode.4hf in your working dir and delete the
line with &#x2019;. It will be converted as a character then.

Michal



More information about the tex4ht mailing list