[luatex] BOM

Yannis Haralambous yannis.haralambous at telecom-bretagne.eu
Thu May 14 18:36:08 CEST 2009


This has probably already been brought up, but please take care of the  
BOM character: it must
be ignored by the LuaTeX engine.

Here is why: BOM is useful when writing in UCS-16 (or UTF-16) to find  
out whether the file is written in
big-endian or small-endian way. In UTF-8 it makes no sense because  
UTF-8 is written bite-wise, in logical order.

Nevertheless software like M$ Notepad (under Vista) will  
systematically insert a BOM at file begin (and I didn't found any way  
to prevent it).

Other text editors, such as Ultra-Edit (Win) or BBEdit (Mac) will let  
the user choose, but by default they will still insert a BOM.

LaTeX then sees a character at file begin which is not a backslash or  
a comment, and stops because there should
be no text character before \begin{document}.

If one could, once and for all, decide to ignore that character, it  
would be the best. Using lua code for that would be a waste of time  
and energy....

cheers

yh



More information about the luatex mailing list