[XeTeX] Follow-up on CJK (Unicode) and XeTeX (xelatex)

Roger Hart rhart at mail.utexas.edu
Thu Feb 24 18:00:58 CET 2005


Dear Jonathan,

Thank you, again, so very much for your expert and most generous 
assistance, and for your very prompt work-around to wrap CJK 
characters.  It does work, at least as a temporary solution. I was so 
amazed by XeTeX's capabilities that I spent the entire rest of the day 
testing various fonts and packages to try to assess how difficult it 
might be to immediately convert all my work to xelatex!

In offering the following observations, I should note that I am not in 
any way an expert but just an end-user very grateful for all the 
incredible work you and others have done:

First, XeTeX is absolutely amazing in how easy it is to set up fonts. 
For CJK, at least to my knowledge, the only font that can be used is 
Bitstream Cyberbit: the larger character sets in CJK are incompatible 
with many of the usual tools for setting up TeX fonts, and setting up 
even Cyberbit requires considerable effort, resulting in a kind of 
Catch-22, where someone wanting to start using LaTeX with CJK must 
already know a considerable amount about LaTeX just to get CJK 
installed. Using XeTex, simply by typing in a font name, it is possible 
to change, for example, to the 60,000+ character font Simsun (Foundry 
Extended), or to any other style of Chinese characters.

All the Truetype fonts work perfectly as far as I can tell. I should 
note that I did have several problems with the Type 1 (MOE Sung 
Regular, MOE Kai Regular) and some of the OpenType (STSong Std Acro 
Light and MSung Std Acro Light) fonts I have installed on my computer. 
I could not identify the cause of the problem, but the results were 
unpredictable, possible relating to memory errors, for example, 3/4 of 
the characters not printing, English printing without font 
characteristics, English without spaces, etc. But these fonts seem to 
me to be inferior, covering too few characters, and not worth worrying 
about. It seems that the OpenType font Hiragino worked fine. I don't 
have a good PostScript or OpenType traditional Chinese font, and was 
wondering if others might have found ones that work?

For bibliographies, I briefly tested natbib and jurabib. natbib seems 
to work perfectly and writes .bbl files in UTF-8; jurabib does *not* 
write files in UTF-8 and thus failed when CJK was included in entries. 
I don't know if there is a way to fix this.

Unfortunately, the wonderful package ps4pdf does not work under 
xelatex.  I assume it would not be hard to fix or find a work-around.

Did I overlook something simple here, but is there a way to make 
xelatex in Computer Modern (no roman font specified) recognize the 
Unicode characters that seem to work under pdflatex, such as m-dashes, 
smart quotes, German and French characters?

Finally, if I might very humbly submit a request -- and I am truly 
humbled by the work you've done on XeTeX -- could you please make 
proper wrapping of CJK a priority for the next release? I think that 
line-wrapping is a very basic capability, it is easy for someone 
installing XeTeX, on finding CJK lines don't wrap, to just assume that 
XeTeX is not compatible with CJK.

Again, many, many thanks,

Roger

> One thing on my "to-do" list for XeTeX is to add an option to 
> automatically allow line-breaking within runs of CJK characters, 
> similarly to automatic hyphenation of English (but naturally, not 
> inserting hyphens!) If/when I get this done, it'll be easy to deal 
> with text like this.
>
> Meanwhile, a possible approach would be to make the CJK ideographs 
> into "active" characters that insert skip and/or penalty items around 
> themselves, to permit line-breaking. This isn't really a good 
> long-term solution, but it may be useful for the time being. See the 
> attached extended version of the CJK example.
>
********************************

Roger Hart
Assistant Professor, Departments of History and Asian Studies
University of Texas at Austin

office: Room 405, Garrison Hall
office phone: 512-475-7258
department fax: 512-475-7222
email: rhart at mail.utexas.edu
http://uts.cc.utexas.edu/~rhart

*********************************



More information about the XeTeX mailing list