[luatex] hyphenating ancient Greek

Sat Jun 12 22:55:56 CEST 2010

On 12 June 2010 Khaled Hosny wrote:

 > On Sat, Jun 12, 2010 at 07:16:48PM +0100, Robin Fairbairns wrote:
 > > Patrick Gundlach <patrick at gundla.ch> wrote:
 > > 
 > > > ... to set lccode for every greek letter (as far as understand it, but someone else should confirm).
 > > > 
 > > > Underfull \hbox (badness 10000) in paragraph at lines 29--29
 > > > [][] \EU2/cmr/m/n/12 χα-λε-πὰ
 > > > 
 > > > ... whishing that this could be done at the engine level....
 > > 
 > > what?  setting uc/lccode tables?  that would require a further primitive
 > > to specify the language (and probably also writing scheme -- i'm not
 > > enough of a linguist to know), since the meanings and relations of
 > > glyphs differ in some languages.
 > > 
 > > type example:
 > > 
 > > in english, the uccode of i gives you I
 > > in turkish, the uccode of i gives you \.I
 > >        (and the lccode of I gives you \i)
 > 
 > Unicode have uppercase/lowe case matching some where in its data, so we
 > could just follow whatever unicode suggests.

Sure, but I fear that Robin's objection is still valid.  Unicode
treats Turkish as an exception too, see:

   http://unicode.org/Public/UNIDATA/CaseFolding.txt

As far as I understand you have to know in advance whether you are
typesetting Turkish.  A static table for all languages isn't
sufficient.

It would be much more convenient if case folding wouldn't depend on
the language, i.e. the Turkish "i" had a separate code point.

However, I think it's solvable.

Regards,
  Reinhard

-- 
----------------------------------------------------------------------------
Reinhard Kotucha			              Phone: +49-511-3373112
Marschnerstr. 25
D-30167 Hannover	                      mailto:reinhard.kotucha at web.de
----------------------------------------------------------------------------
Microsoft isn't the answer. Microsoft is the question, and the answer is NO.
----------------------------------------------------------------------------