[pdftex] ToUnicode CMap

Hans Hagen pragma at wxs.nl
Thu Feb 22 10:57:34 CET 2001


At 11:39 AM 2/22/01 +0300, A.V. Kuznetsov wrote:
>As known, there are two aspects of the language problem in pdf and
>ps documents: showing of a text and determination of a text
>content. The former is successfully solved in pdftex, dvips and
>dvipdfm by embedding of needed fonts but the latter is still open
>for Cyrillic and many other languages.
>
>To solve the latter problem one should embed ToUnicode CMap with
>the font (PDF 1.3 Reference 5.9 sec.ed.). This CMap relates codes
>of font's glyphs to Unicode codes, it can easily be made when such
>relations are known.
>
>It seems to be necessary to embed ToUnicode CMap in embedded font
>program. Data for ToUnicode CMap generation can be obtained from a
>separate file containing code-to-code relations. This file can be
>connected with corresponding font like encoding file in font-map.
>Is it possible in pdftex's future?

This can be done by defining pdf font resources but hooking it into your
font system could be non trivial [i implemented in context a couple of
month on behalf of central european users] 

Hans

-------------------------------------------------------------------------
                                  Hans Hagen | PRAGMA ADE | pragma at wxs.nl
                      Ridderstraat 27 | 8061 GH Hasselt | The Netherlands
 tel: +31 (0)38 477 53 69 | fax: +31 (0)38 477 53 74 | www.pragma-ade.com
-------------------------------------------------------------------------




More information about the pdftex mailing list