<html><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">
Thanks to Dominik for presenting my needs for hyphenating romanised Sanskrit according to the syllabic division of Sanskrit traditional phonetics. For a number of reasons, in my philologically-oriented work I prefer to typeset Sanskrit words as faithfully as possible to the sources, and the hyph-sa.tex fulfils this need.<div><div><br></div><div>Yet, I think I understand Dominik on the need for a reader-friendly hyphenation of Sanskrit, particularly in texts with less strict philological needs, and in English essays with occasional Sanskrit terms. In this regard, Dominik's suggestion of adopting the customs of the academic tradition makes sense. But how consistently are such customs applied? And, how many of them are the informed choice of scholars, and not the product of typographers' tastes, dictionaries of modern languages, or software-specific algorithms? In any case, I think that readibility judgements on hyphenation of Sanskrit are largely influenced by one's own habits in hyphenating English, Italian, or any other language, so it is difficult to set a universal standard other than the Devanagari-conforming one.</div><div><br></div><div>As for Italian typesettingt, hyphenation of Sanskrit words is probably as irregularly applied as in English literature. It is just that, in respect to English, some consonantic clusters commonly found also in Sanskrit (pr, pl, st etc.) are not broken in Italian hyphenation (e.g. ca-sti-tà vs. chas-ti-ty); thus, by adopting Italian hyphenating patterns, one probably gets slightly better results as far as traditional syllabic division of Sanskrit.</div><div><br></div><div>Best,</div><div>Alessandro Graheli<br><div><br></div><div><br><div><br><div><div>Il giorno 12/set/11, alle ore 12:58, Dominik Wujastyk ha scritto:</div><br class="Apple-interchange-newline">I've just had a stimulating conversation about this with my friend and fellow Sanskritist, Alessandro Graheli (who also reads this XeTeX list, and is doing critical editions of Sanskrit texts with XeTeX). <br><br>Alessandro was concerned that I overstated the case. He has used the existing Codet/Kew hyph-sa.tex patterns, and prefers them even for romanised Sanskrit. Word-division after a vowel fits with the forms of recitation and caesura that Alessandro learned when he was a student in India working extensively with traditional Sanskrit pandits. He also said that Italian typesetting of Sanskrit in romanisation hyphenates this way, rather than in the etymological manner that I was asserting. <br> <br>We need more study to sort out some of these issues, but it looks prima facie as if both styles of hyphenating romanised Sanskrit should be preserved, since there are different usage-groups out there. While the hyphenation style for romanised Sanskrit that I describe below reflects widespread usage in good printing over the last century or more, mainly in British texts and journals, and may be required in future too, there are also people who are comfortable with "Devanagari-style" hyphenation in Romanised text too.<br> <br>Best,<br>Dominik<br><br><div class="gmail_quote">On 11 September 2011 20:40, Dominik Wujastyk <span dir="ltr"><<a href="mailto:wujastyk@gmail.com">wujastyk@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;"> Sanskrit is hyphenated differently in Devanagari and in Roman script. If you use the hyph-sa.tex patterns, you get Roman hyphenated <i>as if it were Devanagari,</i> which is not acceptable in scholarly circles. The last 150 years of European writing on Sanskrit, using Romanisation, has developed hyphenation rules based on Sanskrit etymology, paying attention to compound words, internal sandhi, etc. (i.e., like German in some respects). The Devanagari hyphenation uses a much simpler idea, basically hyphenate after almost any vowel.<br> <br>To get appropriate hyphenation in Romanisation, we need to go down the Patgen path. So we need to develop a large lexicon of appropriately-hyphenated romanised Sanskrit words in UTF8 encoding, and when that list is reasonably long, process it through Patgen to make patterns.<br> <br>I am slowly developing such a list, but it would be great to collaborate. <br><br>While the list is in the making, it can still be used, by using \hyphenation. <br><br>Thus:<br><br>\documentclass{article}<br><br>polyglossia, xltxtra, whatnot<br> ... <br>\setotherlanguage{sanskrit} % for transliterated Sanskrit<br>\newfontfamily\sanskritfont{TeX Gyre Pagella}<br><br>% Define \sansk{} which is the same as \emph{}, except that it causes appropriate hyphenation<br> % for Sanskrit words. Use \sansk{} for Sanskrit and \emph{} for English.<br> \newcommand{\sansk}[1]{\emph{\textsanskrit{#1}}}<br>... <br>\begin{document}<br><br>\input{sanskrit-hyphenations.tex} % see attached file. <br><br>Blah English blah. \sansk{āyurveda, avicchinnasampradāyatvād}. <br> <br> \end{document}<br> <br> <br>Best,<br><font color="#888888">Dominik<br> </font></blockquote></div><br><div style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; min-height: 14px; "><br></div><div style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; min-height: 14px; "><br></div><div style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; ">--------------------------------------------------</div><div style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; ">Subscriptions, Archive, and List information, etc.:</div><div style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; "><span class="Apple-converted-space"> </span><a href="http://tug.org/mailman/listinfo/xetex">http://tug.org/mailman/listinfo/xetex</a></div> </div><br></div></div></div></div></body></html>