[OS X TeX] converting ligatures into text

Lawrence Paulson lp15 at cam.ac.uk
Fri Apr 22 15:37:09 CEST 2005


  I have to extract text from a large number of PDF documents produced 
using TeX. Because (I presume) of TeX's non-standard font encodings, 
cut and paste often goes wrong. In particular, ligatures get garbled: I 
get di±cult instead of difficult.

Does anybody know of a program (or of a definitive set of replacements 
that could be given to Perl) for cleaning up such text?

Larry Paulson

--------------------- Info ---------------------
Mac-TeX Website: http://www.esm.psu.edu/mac-tex/
           & FAQ: http://latex.yauh.de/faq/
TeX FAQ: http://www.tex.ac.uk/faq
List Post: <mailto:MacOSX-TeX at email.esm.psu.edu>





More information about the macostex-archives mailing list