[tex-hyphen] UTF-ization of hyphenation patterns

Mojca Miklavec mojca.miklavec.lists at gmail.com
Fri May 16 01:27:21 CEST 2008

On Thu, May 15, 2008 at 11:38 PM, Arthur Reutenauer wrote:
>> To clean up some mess in loading the patterns, it would be nice to go
>> the other way around, and start from proper Unicode (UTF-8) patterns
>> and let (pdf)TeX interpret UTF-8 patterns in its own way instead of
>> both XeTeX and LuaTeX having to deal with some really weird encodings
>> in patterns.
>  Great!  Could we also try to get away from the naming mess and use
> standards tags for languages?  The relevant document is IETF Best
> Current Practice 47 (http://tools.ietf.org/html/bcp47) -- currently it's
> RFC 4646 and 4647, but it's under revision.  It's already well on its
> way, but some tags could be modified.  I can prepare the complete list
> of tags for languages in language.dat

Oh, thanks. That would be really welcome. I forgot about that document already.
German needs some improvements in naming indeed. The same is true for
"a variant of US patterns" (ushyphmax). Have fun :)


More information about the tex-hyphen mailing list