[XeTeX] Incorrect rendering of Vedic Sanskrit accents

ShreeDevi Kumar shreeshrii at gmail.com
Sun May 24 07:30:09 CEST 2015


​Please see http://www.unicode.org/versions/Unicode7.0.0/ch12.pdf
Page 443 and 444
Rule R10

"
Modifier Mark Rules.

In addition to vowel signs, three other types of combining marks may be
applied to a component of an orthographic syllable or to the syllable as a
whole: nukta, bindus, and svaras.

R10

Other modifying marks, in particular bindus and svaras, apply to the
orthographic syllable as a whole and should follow (in the memory
representation) all other characters that constitute the syllable.* The
bindus should follow any vowel signs, and the svaras should come last.*
"

As per the above, "\V{Violet} marks a semantically invalid reordering of
the Unicode characters that happens to produce acceptable output." is
actually the correct ordering of unicode characters:

ta̍ṃ & ta + udatta + anusvara & \R{\1त॑ं}\\


taṃ̍ & ta + anusvara + udatta & \V{\1तं॑} \\


ta̱ṃ & ta + anudatta + anusvara & \R{\1त॒ं}\\


taṃ̱ & ta + anusvara + anudatta & \V{\1तं॒} \\[\medskipamount]


ta̍m̐ & ta + udatta + candrabindu & \R{\1त॑ँ}\\


tam̐̍ & ta + candrabindu + udatta & \V{\1तँ॑} \\


ta̱m̐ & ta + anudatta + candrabindu & \R{\1त॒ँ}\\


tam̱̐ & ta + candrabindu + anudatta & \V{\1तँ॒} \\[\medskipamount]


ta̍ḥ & ta + udatta + visarga & \R{\1त॑ः}\\


taḥ̍ & ta + visarga + udatta & \B{\1तः॑} \\


ta̱ḥ & ta + anudatta + visarga & \R{\1त॒ः}\\


taḥ̱ & ta + visarga + anudatta & \B{\1तः॒} \\[\medskipamount]


​

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Fri, May 22, 2015 at 7:44 PM, David M. Jones <dmj at dmj.ams.org> wrote:

> I posted this to the XeTeX sourceforge tracker a couple of weeks ago,
> and it was suggested that I also mention it here.  The sourceforge
> ticket is
>
>     https://sourceforge.net/p/xetex/bugs/111/
>
> and the zip file is
>
>
> https://sourceforge.net/p/xetex/bugs/111/attachment/xetex-indic-bug.zip
>
> Cheers,
> David.
>
> P.S. There's actually a third class of bug that is clearly visible in
> the table at the top of my document, but which I didn't mention
> explicitly: XeTeX won't typeset one of the Devanagari combining
> characters in isolation without adding a prothetic dotted circle
> (U+25CC).
>
> ===========================================================================
>
> On a lark, I've been experimenting with typesetting Vedic Sanskrit,
> specifically verses from the Ṛg-Veda, using the Murty Hindi font from
> Tiro Typeworks.  So far, XeTeX works beautifully, with two exceptions
> involving the Vedic udatta and anudatta accent marks:
>
> a) They don't attach properly to the Devanagari digits 1 and 3, and
>
> b) They don't combine properly with the candrabindu, anusvara, and
>    visarga signs.
>
> luaTeX does render these correctly, so it doesn't appear to be a
> problem with the font itself.
>
> I've included a zip file with a test file and sample output from
> luaTeX and XeTeX.
>
> Is this a bug, or am I missing something obvious?
>
>
> --------------------------------------------------
> Subscriptions, Archive, and List information, etc.:
>   http://tug.org/mailman/listinfo/xetex
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://tug.org/pipermail/xetex/attachments/20150524/3c01081a/attachment.html>


More information about the XeTeX mailing list