[Fontinst] Re-encoded ligatures and searching

Vladimir Volovich vvv at vsu.ru
Fri Oct 22 17:36:11 CEST 2004


"LH" == Lars Hellström writes:

 LH> 1. The nice mechanism for mapping glyphs to characters (for
 LH> searching) in a PDF file is via something called a ToUnicode
 LH> CMap. This can be part of any Font dictionary, but at least
 LH> pdfTeX doesn't ever seem to generate any (it certainly hasn't got
 LH> any source for the information).

But LaTeX does have a source for the information (in the form of font
encoding), and can issue the commands to pdftex to include the
corresponding CMap file for each font.

The cmap package on CTAN:macros/latex/contrib/cmap should be able to
do this. If your document uses the T1 font encoding, then simply
including the \usepackage{cmap} in the preamble before
\usepackage[T1]{fontenc} should be able to solve this problem,
i.e. the font should be associated with the CMap encoding and the PDF
file should become searchable even if the font uses inconsistent glyph
names.

Best,
v.



More information about the fontinst mailing list