On Tue, Dec 15, 2009 at 5:50 AM, Juan Acevedo <span dir="ltr"><<a href="mailto:juan.acevedo.juan@gmail.com">juan.acevedo.juan@gmail.com</a>></span> wrote:<br><div class="gmail_quote"><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
<div dir="ltr">Dear all,<br><br>I have unfortunately forgotten, and now fail to find the answer to this basic RTL text question:<br><br>Talking about inputing Arabic Unicode text, what is the preferred input typing sequence, to obtain a better final output?; here is a case:<br>
<br>The word "al-.hayyu" (لحيُّ) ends with ya-shadda-damma<br>No matter my typing sequence, the rendering will be the same (damma on top of shadda), </div></blockquote><div><br>That would be an error. See Unicode 5.2 Section 2.11, p. 43. For Arabic in particular, section 8.2, p. 240: "The Unicode Standard does not specify a sequence order in case of multiple harakat applied to the same Arabic base character, as there is no possible ambiguity of interpretation."<br>
</div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;"><div dir="ltr"><br>Similar cases include combinations that involve shadda+tanwin combinations, like<br>
".hayyaN" (حيًّا)<br></div></blockquote></div><br>Proper sequencing is tanween, then alif, but unfortunately the reverse order is more common even in the Arab world, due to the incorrect notion that the alif is the "seat" of the tanween.<br>
<br>It's not just about typesetting but also about search.<br><br>-gregg<br>