taco at elvenkind.com
Sat May 16 00:37:41 CEST 2009
Reinhard Kotucha wrote:
> But in this case, however, they *are* avoidable. There are a few
> broken editors, but there are many other ones which support UTF-8
> perfectly. Is this really a luatex issue?
> I'm convinced that a BOM in a UTF-8 file is a severe bug
The Unicode spec disagrees with you (16.8):
In UTF-8, the BOM corresponds to the byte sequence <EF16 BB16 BF16>.
Although there are never any questions of byte order with UTF-8 text,
this sequence can serve as signature for UTF-8 encoded text where the
character set is unmarked.
More information about the luatex