[XeTeX] Hyphenated, transliterated Sanskrit.

Dominik Wujastyk wujastyk at gmail.com
Sat Nov 20 21:54:06 CET 2010


I've been banging my head against this for a while today, without resolving
things.  I see that the UTF8 hyph-sa.tex file contains the rules for
hyphenating Sanskrit in several scripts, including Roman (Latin?).  The way
this should work, I believe, is that as long as I flag my words as being in
Sanskrit, then they'll get appropriately hyphenated whichever of these
scripts I use.

But I can't find a way to get Polyglossia to accept Sanskrit written in
Roman script.

If I say

 \setotherlanguage{sanskrit}

\newfontfamily\devanagarifont[Script=Latin]{Gentium Basic}


or even

\newfontfamily\devanagarifont{Gentium Basic}


and then


\setlength\textwidth{1cm} % or whatever, to get lots of hyphenating.

\textsanskrit{manum ekāgram āsīnam abhigamya maharṣayaḥ |}


I'm really not sure what I'm getting as a result. It looks as if it's roman
script being hyphenated as if it were Devanagari. The initial a- of several
words, like arhasi, gets separated (a-rhasi), which might just about look
okay in Nagari, but not in romanisation. Am I actually getting the right
thing, but I just need to crank up the first argument of hyphenmins? (This
does seem to improve things.)


Why do I have to pretend that this is Devanagari (\devanagarifont)? I would
like to be able to define \sanskritfont and then define the script, e.g.,


\newfontfamily\sanskritfont[Script=[Devanagari|Latin|Malayalam|Bengali]{Code2000}


I think I'll stop. You can see I'm in a terrible muddle.


Has someone got this running already? Yves? Jonathan?


Best,

Dominik
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://tug.org/pipermail/xetex/attachments/20101120/f3c26c11/attachment.html>


More information about the XeTeX mailing list