[tex-hyphen] Adding turkmen hyphenation and language definition to babel

François Charette firmicus at ankabut.net
Mon Mar 15 22:57:27 CET 2010


On 14/03/2010 21:21, Mojca Miklavec wrote:
> On Sun, Mar 14, 2010 at 20:14, Nazar Annagurban wrote:
>    
>> Hi guys,
>>
>> I found the mailing-list through google. I'd like to ask you about adding a
>> new hyphenation language. I have prepared the hyphenation patterns for the
>> turkmen language (Turkmenistan) and tested it using the java implementation
>> of Frank Liangs algorithm. However, I'd like to test that the charset (our
>> special character that are not in latin1) works correctly under tex (latex)
>> before I submit it. I looked through the Babel files to find the file
>> containing language definition to hyphen patterns binding, but I couldn't
>> find it. If you guys could give me some directions? Or anyway I could latex
>> use my patterns for hyphenating.
>>
>> (Turkmenistan hasn't had much latex exposure, but I'm using it to prepare
>> digital versions of older books, and maybe more people will start using it
>> seeing the quality of the result)
>>      
> Hello,
>
> It is very nice to see new contributors.
>
> It is quite possible that nobody has written Babel support for you
> language yet. From what I can read, people might use both Latin and
> Cyrillic script (as well as Arabic), so you might want to keep that in
> mind when writing support, so that it would be easy enough to plug in
> the other script if one would need or request it in future.
>
> Maybe you could also contact François Charette for help with
> Polyglossia or the author of Babel if you need help. To start with
> just take a file *.ldf and try to adapt it for your language.
>    

I would be glad indeed to add Turkmen support to polyglossia! It would 
be nice to also find contributors for the other important Central Asian 
languages (Uzbek, Kazakh, Kirghiz, Tajik, etc.)!

Concerning Polyglossia: I am currently preparing v1.2 which will 
introduce a new simplified interface for writing gloss-*.ldf files.
Contact me if you are interested. From v1.2 onward there will most 
probably be one gloss file per language+script combination. That means 
we would need gloss-turkmen.ldf for the common macros and definitions, 
and gloss-turkmen_cyr.ldf, gloss-turkmen_lat.ldf and perhaps even 
gloss-turkmen_ara.ldf, for each script.

BTW, Mojca, someone has posted new hyphenation patterns for Malayalam 
and Tamil today. Should I forward them to you or to this list?

Best regards,
François


More information about the tex-hyphen mailing list