[luatex] BOM

Taco Hoekwater taco at elvenkind.com
Sat May 16 00:37:41 CEST 2009


Reinhard Kotucha wrote:
> 
> But in this case, however, they *are* avoidable.  There are a few
> broken editors, but there are many other ones which support UTF-8
> perfectly.  Is this really a luatex issue?
>
> I'm convinced that a BOM in a UTF-8 file is a severe bug

The Unicode spec disagrees with you (16.8):

   In UTF-8, the BOM corresponds to the byte sequence <EF16 BB16 BF16>.
   Although there are never any questions of byte order with UTF-8 text,
   this sequence can serve as signature for UTF-8 encoded text where the
   character set is unmarked.

Best wishes,
Taco


More information about the luatex mailing list