[tex-hyphen] [lltx] spell checking TeX files (was: [Trennmuster] Replacing the old hyphenation patterns with new ones)
adityam at umich.edu
Wed Oct 3 22:15:38 CEST 2012
On Wed, 3 Oct 2012, Stephan Hennig wrote:
> [CC'ing lualatex-dev at tug.org and tex-hyphen at tug.org,
> since spell checking is of international concern.
> Please reply to lualatex-dev at tug.org.]
> Am 02.10.2012 16:01, schrieb Pander:
>> You can mention that the Dutch patterns are being processed by OpenTaal.
>> They are put on hold since we are working very hard on the next version
>> of spell checking at the moment.
> You're speaking about spell checking, not hyphenation, right? Could you
> please elaborate a bit?
> I've recently thought about spell checking of TeX documents and came up
> with the following idea that requires LuaTeX's node list manipulations:
> 1. In the first LuaTeX run, write all typeset text into a UTF-8 encoded
> text file.
> 2. Feed that text file to your favourite spell checker, generating a
> list of bad words.
> 3. In the second run, LuaTeX reads-in the list of bad words and puts a
> red wavy line under all bad words in the document. A possible approach
> is to mark nodes corresponding to a bad word in pre_linebreak_filter
> with an attribute so that they can be identified later.
FWIW, you can also look at how ConTeXt implements the on the fly spell
The implementation details are in lang-wrd.mkiv.
More information about the tex-hyphen