[XeTeX] handling malformed UTF-8 input

Marcin Woliński wolinski at mimuw.edu.pl
Wed Feb 20 13:07:26 CET 2008


Jonathan,

> OK, motivated by this I have just committed a patch to the xetex  
> repository that checks for valid UTF-8 sequences (when reading a file  

Thank you very much for the quick reaction.

> as UTF-8, of course). If an invalid sequence is encountered, it will  
> give a warning (in the log, unless \tracingonline is positive), and  
> read the remainder of the file as "bytes". This will often be wrong,  

What does "bytes" exactly mean?  That the rest of file will be
interpreted as ISO 8859-1?

Marcin




More information about the XeTeX mailing list