# [luatex] BOM

Yannis Haralambous yannis.haralambous at telecom-bretagne.eu
Thu May 14 18:36:08 CEST 2009

This has probably already been brought up, but please take care of the
BOM character: it must
be ignored by the LuaTeX engine.

Here is why: BOM is useful when writing in UCS-16 (or UTF-16) to find
out whether the file is written in
big-endian or small-endian way. In UTF-8 it makes no sense because
UTF-8 is written bite-wise, in logical order.

Nevertheless software like M\$ Notepad (under Vista) will
systematically insert a BOM at file begin (and I didn't found any way
to prevent it).

Other text editors, such as Ultra-Edit (Win) or BBEdit (Mac) will let
the user choose, but by default they will still insert a BOM.

LaTeX then sees a character at file begin which is not a backslash or
a comment, and stops because there should
be no text character before \begin{document}.

If one could, once and for all, decide to ignore that character, it
would be the best. Using lua code for that would be a waste of time
and energy....

cheers

yh



More information about the luatex mailing list