[Fontinst] Re-encoded ligatures and searching
Vladimir Volovich
vvv at vsu.ru
Fri Oct 22 17:36:11 CEST 2004
"LH" == Lars Hellström writes:
LH> 1. The nice mechanism for mapping glyphs to characters (for
LH> searching) in a PDF file is via something called a ToUnicode
LH> CMap. This can be part of any Font dictionary, but at least
LH> pdfTeX doesn't ever seem to generate any (it certainly hasn't got
LH> any source for the information).
But LaTeX does have a source for the information (in the form of font
encoding), and can issue the commands to pdftex to include the
corresponding CMap file for each font.
The cmap package on CTAN:macros/latex/contrib/cmap should be able to
do this. If your document uses the T1 font encoding, then simply
including the \usepackage{cmap} in the preamble before
\usepackage[T1]{fontenc} should be able to solve this problem,
i.e. the font should be associated with the CMap encoding and the PDF
file should become searchable even if the font uses inconsistent glyph
names.
Best,
v.
More information about the fontinst
mailing list