[XeTeX] xetex doesn't recognize/replace all invalid utf8 bytes

Peter Dyballa Peter_Dyballa at Web.DE
Wed Dec 30 11:07:36 CET 2009


Am 30.12.2009 um 01:54 schrieb Herbert Schulz:

> Is there a ``common'' name for that encoding?


It could be "CP1252" as Mac OS X provides for example these:

/usr/share/locale/ru_RU.CP1251
/Applications/Adobe Reader.app/Contents/MacOS/Resource/TypeSupport/ 
Unicode/Mappings/win/CP874.TXT
/Applications/Adobe Reader.app/Contents/MacOS/Resource/TypeSupport/ 
Unicode/Mappings/win/CP932.TXT
/Applications/Adobe Reader.app/Contents/MacOS/Resource/TypeSupport/ 
Unicode/Mappings/win/CP936.TXT
/Applications/Adobe Reader.app/Contents/MacOS/Resource/TypeSupport/ 
Unicode/Mappings/win/CP949.TXT
/Applications/Adobe Reader.app/Contents/MacOS/Resource/TypeSupport/ 
Unicode/Mappings/win/CP950.TXT
/Applications/Adobe Reader.app/Contents/MacOS/Resource/TypeSupport/ 
Unicode/Mappings/win/CP1250.TXT
/Applications/Adobe Reader.app/Contents/MacOS/Resource/TypeSupport/ 
Unicode/Mappings/win/CP1251.TXT
/Applications/Adobe Reader.app/Contents/MacOS/Resource/TypeSupport/ 
Unicode/Mappings/win/CP1252.TXT
/Applications/Adobe Reader.app/Contents/MacOS/Resource/TypeSupport/ 
Unicode/Mappings/win/CP1253.TXT
/Applications/Adobe Reader.app/Contents/MacOS/Resource/TypeSupport/ 
Unicode/Mappings/win/CP1254.TXT
/Applications/Adobe Reader.app/Contents/MacOS/Resource/TypeSupport/ 
Unicode/Mappings/win/CP1255.TXT
/Applications/Adobe Reader.app/Contents/MacOS/Resource/TypeSupport/ 
Unicode/Mappings/win/CP1256.TXT
/Applications/Adobe Reader.app/Contents/MacOS/Resource/TypeSupport/ 
Unicode/Mappings/win/CP1257.TXT
/Applications/Adobe Reader.app/Contents/MacOS/Resource/TypeSupport/ 
Unicode/Mappings/win/CP1258.TXT
/Developer/SDKs/MacOSX10.5.sdk/System/Library/Frameworks/ 
Ruby.framework/Versions/1.8/usr/lib/ruby/1.8/rexml/encodings/CP-1252.rb

You could also invoke on the command line:

	iconv -l | grep CP

The iconv utility is meant to convert file contents between many  
encodings. It lists, among others:

	CP1252 MS-ANSI WINDOWS-1252

--
Greetings

   Pete

Almost anything is easier to get into than out of.
				– Allen's Law



More information about the XeTeX mailing list