[XeTeX] traditional to simplified Chinese character conversion utility or data base

Daniel Greenhoe dgreenhoe at gmail.com
Sat Oct 22 09:17:50 CEST 2011


On Sat, Oct 22, 2011 at 2:11 PM, Zdenek Wagner <zdenek.wagner at gmail.com> wrote:
> \addfontfeatures{Mapping=mapname}.

Amazing. I tested it. It works. Amazing. Thanks! It's a really great solution.

Dan

On Sat, Oct 22, 2011 at 2:11 PM, Zdenek Wagner <zdenek.wagner at gmail.com> wrote:
> 2011/10/22 Daniel Greenhoe <dgreenhoe at gmail.com>:
>> On Tue, Oct 18, 2011 at 2:53 PM, Zdenek Wagner <zdenek.wagner at gmail.com> wrote:
>>> If you wish to do it on the fly in XeTeX,  you can write a TECkit map.
>>
>> I do have a map now. Can someone tell me how to do the conversion "on
>> the fly" in XeLaTeX? I did see the command line option
>> "-translate-file=TCXNAME", but for that it says "(ignored)".
>>
> \usepackage{fontspec}
> \setmainfont[Mapping=mapname]{fontname}
>
> or
>
> \fontspec[Mapping=mapname]{fontname}
>
> TCX tables are used in pdftex and the table is used for the whole
> document (and cannot be changed). TECkit map is applied in XeTeX per
> font and can even be replaced (eg in a group) by
> \addfontfeatures{Mapping=mapname}.
>
>> Dan
>>
>>
>> On Tue, Oct 18, 2011 at 2:53 PM, Zdenek Wagner <zdenek.wagner at gmail.com> wrote:
>>> 2011/10/17 Daniel Greenhoe <dgreenhoe at gmail.com>:
>>>> I know that this is not really the right mailing list for this
>>>> question, but I have so far not found the answer by any other means
>>>> ...
>>>>
>>>> I would like to find or write some a utility that would take an
>>>> unicode encoded file and map Chinese traditional characters to
>>>> simplified, while leaving all other code points (such  as those in the
>>>> Latin and IPA code spaces) untouched. For example, the traditional
>>>> character for horse (馬) is at unicode U+99AC, the simplified one (马)
>>>> is at unicode U+9A6C, and the Latin character for "A" is at U+0041. So
>>>> I want a utility that would change the 99AC to 9A6C, but leave the
>>>> 0041 unchanged.
>>>>
>>> If it is really that simple 1:1 mapping, you can just use tr, it does
>>> exactly that if you supply the map. If you wish to do it on the fly in
>>> XeTeX, you can write a TECkit map. Having the TECkit map you can also
>>> run txtconv from the command line.
>>>
>>>> Does anyone know of such a utility? Does anyone know of any data base
>>>> with a traditional to simplified character mapping such that I could
>>>> maybe write the utility myself?
>>>>
>>>> Many thanks in advance,
>>>> Dan
>>>>
>>>>
>>>>
>>>> --------------------------------------------------
>>>> Subscriptions, Archive, and List information, etc.:
>>>>  http://tug.org/mailman/listinfo/xetex
>>>>
>>>
>>>
>>>
>>> --
>>> Zdeněk Wagner
>>> http://hroch486.icpf.cas.cz/wagner/
>>> http://icebearsoft.euweb.cz
>>>
>>>
>>>
>>> --------------------------------------------------
>>> Subscriptions, Archive, and List information, etc.:
>>>  http://tug.org/mailman/listinfo/xetex
>>>
>>
>>
>>
>> --------------------------------------------------
>> Subscriptions, Archive, and List information, etc.:
>>  http://tug.org/mailman/listinfo/xetex
>>
>
>
>
> --
> Zdeněk Wagner
> http://hroch486.icpf.cas.cz/wagner/
> http://icebearsoft.euweb.cz
>
>
>
> --------------------------------------------------
> Subscriptions, Archive, and List information, etc.:
>  http://tug.org/mailman/listinfo/xetex
>



More information about the XeTeX mailing list