[tex-hyphen] [lltx] spell checking TeX files (was: [Trennmuster] Replacing the old hyphenation patterns with new ones)

Aditya Mahajan adityam at umich.edu
Wed Oct 3 22:15:38 CEST 2012


On Wed, 3 Oct 2012, Stephan Hennig wrote:

> [CC'ing lualatex-dev at tug.org and tex-hyphen at tug.org,
> since spell checking is of international concern.
> Please reply to lualatex-dev at tug.org.]
>
> Am 02.10.2012 16:01, schrieb Pander:
>
>> You can mention that the Dutch patterns are being processed by OpenTaal.
>> They are put on hold since we are working very hard on the next version
>> of spell checking at the moment.
>
> You're speaking about spell checking, not hyphenation, right?  Could you
> please elaborate a bit?
>
> I've recently thought about spell checking of TeX documents and came up
> with the following idea that requires LuaTeX's node list manipulations:
>
> 1. In the first LuaTeX run, write all typeset text into a UTF-8 encoded
> text file.
>
> 2. Feed that text file to your favourite spell checker, generating a
> list of bad words.
>
> 3. In the second run, LuaTeX reads-in the list of bad words and puts a
> red wavy line under all bad words in the document.  A possible approach
> is to mark nodes corresponding to a bad word in pre_linebreak_filter
> with an attribute so that they can be identified later.

FWIW, you can also look at how ConTeXt implements the on the fly spell 
checking:

http://wiki.contextgarden.net/Spellchecker

The implementation details are in lang-wrd.mkiv.

Aditya


More information about the tex-hyphen mailing list