[Tugindia] Closeup of word spaces in PDF text extraction
Suresh Avvarulakshmi, Integra-PDY, IN
suresh.avvarulakshmi at integra.co.in
Wed Apr 6 06:24:07 CEST 2011
On extracting text from PDF (created from dvips driver) certain word spaces are getting closed up, like the characters followed by 'W'. This happens only for serif fonts like Times-Bold, and not with the sans serif fonts like Helvetica.
Can anyone advice, how to get the correct extraction of text from the LaTeX generated PDFs?
Note: Effective immediately my email address has changed to suresh.avvarulakshmi at integra.co.in; kindly update your address book to reflect this change. Thank you.
This email and any accompanying attachments is for the sole use of the intended recipient(s) and may contain confidential and privileged information. Any unauthorized review, use, disclosure, distribution, or copying is strictly prohibited. If you are not the intended recipient of this communication or received the email by mistake, please notify the sender and destroy all copies. Integra Software Services Pvt Ltd. reserves the right, subject to applicable local law, to monitor and review the content of any electronic message or information sent to or from its company allotted employee email address/ID without informing the sender or recipient of the message.
More information about the tugindia