<div dir="ltr">Dear David,<div><br></div><div>It would work better if you started from some other source that </div><div>generated the PDF, for example: tex, Word, InDesign, ... </div><div><br></div><div>You mention, towards the end of your short sentence, that the </div><div>paper includes a "source document". What is the format of this </div><div>source? That may be your best option.</div><div><br></div><div>But if you do NOT have the sources that produce it, one of the best </div><div>(second) options to start with is a PDF text extractor.</div><div><br></div><div>There are tons of them for Windows, MacOS and Linux and even</div><div>apps like:</div><div><br></div><div> <a href="https://www.pdftext.net/">https://www.pdftext.net/</a></div><div><br></div><div>I use "pdftotext", part of the popular Poppler suite.</div><div><br></div><div>That should work for the majority of PDF files out there, but not all. </div><div>The ones that are produced by scanners without OCR passes will</div><div>not contain text and there is nothing to extract that way. In this case</div><div>your option is to pass it by and OCR.</div><div><br></div><div>Please give us more detail, and we will probably be able to help you better.</div><div><br></div><div>Best,</div><div>Paulo Ney</div><div><br></div><div><br></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Fri, May 6, 2022 at 8:22 AM David Jonah via texhax <<a href="mailto:texhax@tug.org">texhax@tug.org</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">I want to convert a .pdf document to a LaTeX document. The paper has superscripts, an index, and a source document. <br>
<br>
Sent from my iPad<br>
</blockquote></div>