[XeTeX] XeTeX Digest, Vol 94, Issue 12

Shiva Shankar shivably04sdst at gmail.com
Mon Jan 9 06:10:07 CET 2012


2012/1/8 <xetex-request at tug.org>

> Send XeTeX mailing list submissions to
>        xetex at tug.org
>
> To subscribe or unsubscribe via the World Wide Web, visit
>        http://tug.org/mailman/listinfo/xetex
> or, via email, send a message with subject or body 'help' to
>        xetex-request at tug.org
>
> You can reach the person managing the list at
>        xetex-owner at tug.org
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of XeTeX digest..."
>
> Today's Topics:
>
>   1. Re: Preventing a line break at a given set of characters
>      (Zdenek Wagner)
>   2. HI (chandrasekhar s)
>   3. Re: HI (Zdenek Wagner)
>   4. Re: HI (Yves Codet)
>
>
> ---------- Forwarded message ----------
> From: Zdenek Wagner <zdenek.wagner at gmail.com>
> To: Unicode-based TeX for Mac OS X and other platforms <xetex at tug.org>
> Cc:
> Date: Sat, 7 Jan 2012 14:50:14 +0100
> Subject: Re: [XeTeX] Preventing a line break at a given set of characters
> 2012/1/7 Paul Isambert <zappathustra at free.fr>:
> > Shiva Shankar <shivably04sdst at gmail.com> a écrit:
> >>
> >> Hi,
> >>
> >> Is it possible to prevent the linebreak at a given set of characters?
> >> for example TeX will never break a line at fullstop(.) I mean
> >> fullstop alone never goto next line (unless there is a space behind
> >> it). Similarly for comma(,), semicolon(;) etc.
> >
> This is not a right question. TeX breaks lines:
> 1. at a glue unless it is preceded by a discardable item
> 2. at a kern it it is immediatelly followed by a glue
> 3. at a penalty no matter what follows or precedes
> 4. at a \discretionary either explicit or resulting from analysis of
> hyphenation pattern.
>
> A hyphen is roughly treated as
> \penalty\exhyphenpenalty\discretionary{-}{}{-}. Thus there is nothing
> in the TeX world as breaking at a character. In order to give good
> advice I need to know what exactly you wish to achieve. If you look at
> the above rules, you see that if you put \kert0pt in between thwo
> characters, TeX will never break there. However, the kern will
> suppress any implicit kern that would otherwise be inserted by
> examining the font metrics and in addition will create word boundary
> so that hyphenation patterns will not match properly the remaining
> parts of the word. If you mark feasible hyphenation points with \-,
> TeX will not examine hyphenation patterns for that word and used only
> \- when optimizing paragraph breaking. The trick with \lccode will
> work too because hyphenation works only if the characters have
> \catcode equal to 11 and nonzero \lccode.
>

I want to achieve the following

We have a package called kanlel.sty to typeset Kannada in TeX/LaTeX.
At present we don't have hyphenation patterns for this package.
For the time being what I want to achieve is, let us minimize the breaks at
inappropriate places so that it eases many of our tasks.  In Kannada
we get characters(Conjuncts) which have to appear below some other
character(Consonants) and they cannot
appear independently. Nearly there are 32 such characters and nine other
characters which are
Vowel diacritics and there also if I can prevent the breaking then I guess
output will be good.

I want to try to build a hyphenation patterns for Kannada (kanlel package).
We have a huge corpus of Kannada and I can create hyphenated list of unique
words out of it.
And we also have a rule for hyphenation i.e. leftmin should be 2 and
rightmin should also be 2.
What is the next step that we have to do?

After generating hyphenation pattern should I compulsorily use babel for
using these patterns
or can I use them Independently?

Regards
Shivashankar



>
> > You can always set \lccode to 0 for the characters you want to be
> > unbreakable, but that will mess with word recognition globally, so you
> > won't have hyphenation where you don't want it, but you won't have it
> > either in many places where expected. A solution might be to add a
> > glue -- preceded by an infinite penalty -- after the character to mark
> > the legitimate end of a word; you might be able to do that automatically
> > with interchartoks; that won't solve problem with \right/lefthyphenmin,
> > though. See e.g.:
> >
> >    % Normal hyphenation.
> >    \showhyphens{absolutely}
> >
> >    % Bad solution.
> >    \lccode`\t=0
> >    \showhyphens{absolutely}
> >
> >    % Better solution.
> >    \XeTeXinterchartokenstate=1
> >    \XeTeXcharclass`\t=10
> >    \XeTeXinterchartoks 10 0 = {\penalty10000\hskip0pt}
> >    \showhyphens{absolutely}
> >
> > I'm not sure this is sound.
> >
> > Of course, you can also declare exceptions with \hyphenation, provided
> > you don't have too many of them.
> >
> > Best,
> > Paul
> >
> >
> >
> > --------------------------------------------------
> > Subscriptions, Archive, and List information, etc.:
> >  http://tug.org/mailman/listinfo/xetex
>
>
>
> --
> Zdeněk Wagner
> http://hroch486.icpf.cas.cz/wagner/
> http://icebearsoft.euweb.cz
>
>
>
>
> ---------- Forwarded message ----------
> From: chandrasekhar s <chandru.mcc at gmail.com>
> To: Unicode-based TeX for Mac OS X and other platforms <xetex at tug.org>
> Cc:
> Date: Sun, 8 Jan 2012 02:32:33 +0530
> Subject: [XeTeX] HI
>
> [image: enter image description here]
>
>
> Can the above image be produced using the devanagari package. To produce
> it I used the sktpackage but I am curious to know if this image (that is
> the chandrabindu along with the halanth) can be produced using the
> devanagari package. This symbol very often is used in the "Rudram:
> Namakam-Camakam" mantra.
>
> Chandrasekhar
>
>
>
>
> --
>
>
>
>
>
> ---------- Forwarded message ----------
> From: Zdenek Wagner <zdenek.wagner at gmail.com>
> To: Unicode-based TeX for Mac OS X and other platforms <xetex at tug.org>
> Cc:
> Date: Sat, 7 Jan 2012 23:10:50 +0100
> Subject: Re: [XeTeX] HI
> 2012/1/7 chandrasekhar s <chandru.mcc at gmail.com>
>
>>
>> [image: enter image description here]
>>
>>
>> Can the above image be produced using the devanagari package. To produce
>> it I used the sktpackage but I am curious to know if this image (that is
>> the chandrabindu along with the halanth) can be produced using the
>> devanagari package. This symbol very often is used in the "Rudram:
>> Namakam-Camakam" mantra.
>>
>
> Yes, {\dn\dnsamaveda hi\@\ }
> But this is not a XeTeX question. I do not know how to produce it in
> XeTeX.
>
>>
>> Chandrasekhar
>>
>>
>>
>>
>> --
>>
>>
>>
>>
>>
>>
>> --------------------------------------------------
>> Subscriptions, Archive, and List information, etc.:
>>  http://tug.org/mailman/listinfo/xetex
>>
>>
>
>
> --
> Zdeněk Wagner
> http://hroch486.icpf.cas.cz/wagner/
> http://icebearsoft.euweb.cz
>
>
> ---------- Forwarded message ----------
> From: Yves Codet <yves.codet at sfr.fr>
> To: Unicode-based TeX for Mac OS X and other platforms <xetex at tug.org>
> Cc:
> Date: Sun, 8 Jan 2012 07:36:31 +0100
> Subject: Re: [XeTeX] HI
> Le 7 janv. 2012 à 23:10, Zdenek Wagner a écrit :
>
> > 2012/1/7 chandrasekhar s <chandru.mcc at gmail.com>
> >
> >
> >
> >
> > Can the above image be produced using the devanagari package. To produce
> it I used the sktpackage but I am curious to know if this image (that is
> the chandrabindu along with the halanth) can be produced using the
> devanagari package. This symbol very often is used in the "Rudram:
> Namakam-Camakam" mantra.
> >
> > Yes, {\dn\dnsamaveda hi\@\ }
> > But this is not a XeTeX question. I do not know how to produce it in
> XeTeX.
>
> You need a font including the last character (U+A8F3 DEVANAGARI SIGN
> CANDRABINDU VIRAMA). There's only Siddhanta as far as Alan Woods knows:
> http://www.alanwood.net/unicode/devanagari-extended.html
>
> Regards,
>
> Yves
>
>
>
> _______________________________________________
> XeTeX mailing list
> XeTeX at tug.org
> http://tug.org/mailman/listinfo/xetex
>
>


-- 
Regards
Shivashankar
Sriranga Digital Software Technologies Pvt. Ltd.,
Srirangapatna
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://tug.org/pipermail/xetex/attachments/20120109/69660e1f/attachment.html>


More information about the XeTeX mailing list