[luatex] [XeTeX] Assignment of codes (particularly \catcode) based on Unicode data

Wed May 6 22:24:23 CEST 2015

On 06/05/2015 21:06, David Carlisle wrote:
> On 6 May 2015 at 20:15, Philip Taylor <P.Taylor at rhul.ac.uk> wrote:
>>
>>
>> Apostolos Syropoulos wrote:
>>
>>> It seems to me that most people have no idea what Unicode is and what is really
>>> involved.
>>
>> OK, so if we restrict the Universe of Discourse to the set of native
>> Hellenic speakers who know what Unicode is, know the importance of being
>> able to use it to identify the correct upper case of (for example)
>> 'GREEK SMALL LETTER EPSILON WITH PSILI', and hold an informed opinion on
>> the matter, would you expect that 100% of these would agree that the
>> uppercase is 'GREEK LETTER EPSILON' and not 'GREEK LETTER EPSILON WITH
>> PSILI', or would you expect that some percentage (perhaps small) would
>> hold the opposite point of view ?
>>
>> ** Phil.
>>
> 
> I don't think that's the right question. Even if everyone, including
> the Unicode technical committee,
> agreed some properties are incorrect for some characters, it isn't
> clear we should change
> them at this level.
> 
> I think that unicode-letters.def makes most sense as a
> fully automated representation of the UCD data files in TeX syntax.
> 
> That way everyone knows what data is in there.
> 
> Individual language packages have far fewer characters to worry about
> and can over-ride
> the base settings where appropriate.

Indeed: provided hyphenation is correct then we are OK. (LuaTeX of
course is rather more flexible there than XeTeX.)
--
Joseph Wright