[tex-hyphen] hyphenation with ligatures in input

Jonathan Kew jfkthame at googlemail.com
Thu Jan 16 00:11:27 CET 2014


On 15/1/14 20:00, Philip Taylor wrote:
>
>
> Arthur Reutenauer wrote:
>
>>    You should.  Characters such as U+FB01 are deprecated and shouldn't be
>> used in text.
>
> /Characters/ ... : yes.  But consider a Unicode-in/Unicode-out
> preprocessor; might it not generate fi in the output stream,
> since it thinks it is generating glyphs,

A preprocessor generating such a "Unicode-out" stream would necessarily 
be confused, because Unicode is not a glyph encoding, it's a character 
encoding.

If a preprocessor wants to generate a stream of glyphs rather than 
characters, it needs to do so according to some glyph encoding standard 
(which is usually font-specific), and this can no longer be expected to 
work in conjunction with Unicode character-based hyphenation patterns.

JK

> yet in a pipeline
> environment that output might get re-used as TeX input ...
>
> ** Phil.
> .
>



More information about the tex-hyphen mailing list