<p dir="ltr">Can anyone point me to docs on XeT--TeX? A Google the other day failed to turn up anything useful.</p>
<p dir="ltr">Also: polyglossia appears to be doing some amount of LTR/RTL directionality switching based on the character block. Can anyone offer advice on how to avoid fighting with that, if I'm implementing my own bidi algorithm?</p>
<p dir="ltr">Finally: any advice on using CJK languages with polyglossia? Embedded CJK is quite common. Should I be writing gloss-ja etc files to set the right directionality and font and get the appropriate CJK support packages loaded?<br>
--scott</p>
<div class="gmail_quote">On Dec 5, 2013 5:42 AM, "Jonathan Kew" <<a href="mailto:jfkthame@googlemail.com">jfkthame@googlemail.com</a>> wrote:<br type="attribution"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
On 4/12/13 13:24, C. Scott Ananian wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
The goal is to match the Unicode bidi algorithm, because that is how the<br>
web page displays and thus how the original author saw the text as they<br>
wrote.<br>
</blockquote>
<br>
This would be a nice enhancement, but would require a significant amount of work (or in other words, it's not likely to get implemented quickly, if at all).<br>
<br>
Currently, typesetting bidi text with xetex requires correct use of the TeX--XeT bidi commands (\beginR, \endR, \beginL, \endL) to mark up the text direction. These could be used directly, or via higher-level markup that's tagging script and language, but you definitely need them to be present in some way.<br>
<br>
Sorry, that's not what you want to hear, but it's how things are. At this point, I think the most practical way forward in your situation is probably to implement this as part of whatever tool is taking the wikipedia content and converting it to (Xe)LaTeX markup - that tool could inspect the content of each element it's processing, and add any necessary direction controls for XeTeX.<br>
<br>
JK<br>
<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Guessing the proper language tag to use is likely infeasible;<br>
note that the example given contains titles in Turkish as well as<br>
English. The safest option is probably to treat embedded LTR text in an<br>
RTL context as 'exotic' and not to attempt hyphenation.<br>
<br>
I've heard it said that LuaTeX has "better bidi support". What does<br>
that mean, exactly? Should I be considering switching?<br>
--scott<br>
<br>
On Dec 4, 2013 4:08 AM, "Keith J. Schultz" <<a href="mailto:schultzk@uni-trier.de" target="_blank">schultzk@uni-trier.de</a><br>
<mailto:<a href="mailto:schultzk@uni-trier.de" target="_blank">schultzk@uni-trier.de</a>><u></u>> wrote:<br>
<br>
Hi Scott,<br>
<br>
Am 03.12.2013 um 19:42 schrieb C. Scott Ananian <<a href="mailto:cscott@cscott.net" target="_blank">cscott@cscott.net</a><br>
<mailto:<a href="mailto:cscott@cscott.net" target="_blank">cscott@cscott.net</a>>>:<br>
<br>
><br>
> But in the XeLaTeX/polyglossia/bidi output, the "soft space" weak<br>
> directionality of the Unicode BiDi algorithm doesn't seem to be<br>
> honored (or implemented?) and so the English article titles appear<br>
> with the individual words in RTL order, which is a mess. Manually<br>
> tagging the language of the article title is probably the Right<br>
thing,<br>
> but infeasible for the entire wikipedia.<br>
Well, without proper tagging you can not expect any system to<br>
work properly or as expected!<br>
For most entries a simple script should do the trick to add the<br>
language tags to the article titles.<br>
<br>
Hope this helps<br>
regards<br>
Keith.<br>
<br>
<br>
------------------------------<u></u>--------------------<br>
Subscriptions, Archive, and List information, etc.:<br>
<a href="http://tug.org/mailman/listinfo/xetex" target="_blank">http://tug.org/mailman/<u></u>listinfo/xetex</a><br>
<br>
<br>
<br>
<br>
<br>
------------------------------<u></u>--------------------<br>
Subscriptions, Archive, and List information, etc.:<br>
<a href="http://tug.org/mailman/listinfo/xetex" target="_blank">http://tug.org/mailman/<u></u>listinfo/xetex</a><br>
<br>
</blockquote>
<br>
<br>
<br>
------------------------------<u></u>--------------------<br>
Subscriptions, Archive, and List information, etc.:<br>
<a href="http://tug.org/mailman/listinfo/xetex" target="_blank">http://tug.org/mailman/<u></u>listinfo/xetex</a><br>
</blockquote></div>