[XeTeX] Velthuis to Roman translit
Dominik Wujastyk
wujastyk at gmail.com
Thu Feb 3 19:31:26 CET 2011
Is this the issue you have come across?
-
http://cikitsa.blogspot.com/2010/09/xelatex-velthuis-encoding-and-palatal.html
Or is it something simpler? Perhaps you can just delete the line in the sed
file that produces the unwanted conversion? Sed files are dead easy to
understand and to write. They're just glorified search-and-replace
statements.
Wait a minute. Reading your note more carefully, I think the sed script is
giving you English text with unwanted "\ñ" strings here and there. Is that
right? If so, you could just add a last line to the sed script to right
that wrong ("s/\\ñ/ñ/g").
Yes, you are right about TEC/map files. I think you should use the existing
velthuis map files and replace the Devanagari character codes for output
with Latin, just as you say. Once you get to understand the conjunct
consonants coding, it can be stripped out. Etc. I don't know where the
documentation for TEC files is, but probably on the SIL website.
The TEC mechanism is also a glorified search-and-replace system, but with
more intelligence about context-sensitive processing of strings.
Best,
Dominik <https://www.dropbox.com/referrals/NTIzNzI2MTY5>
On 3 February 2011 18:58, Neal Delmonico <ndelmonico at sbcglobal.net> wrote:
> Thanks Dominik,
>
> I tried the sed file and it works wonderfully, in fact a little too well.
> My files tend to be mixed Sanskrit and translation in which I use the
> standard LaTeX diacritic codes. "\~n" gets picked up by the sed file and
> converted and then XeLaTeX chokes on it. Is there a way around this? I
> can, of course, search and replace those unwanted conversions, but it is a
> little less convenient.
>
> What is involved in writing a XeTeX TEC file? I've looked at the map
> files. Is it mostly a matter of substituting the Unicode codes with other
> Unicode codes to produce Romanized output instead of Devanagari? Is it
> possible for a semi-computer literate person like myself to do that? Where
> would I start?
>
> Thanks again for your help.
>
> Best
>
> Neal
>
>
> On Thu, 03 Feb 2011 06:05:51 -0600, Dominik Wujastyk <wujastyk at gmail.com>
> wrote:
>
> I tend to do this just with a sed script, for the file as a whole (sed
>> file
>> attached, originally from Richard Mahoney, but edited by me). It wouldn't
>> be that hard to write a xetex TEC file to do this, but I'm not aware of
>> anyone actually having done it yet. It would be nice to have, I agree.
>> Dominik
>> <https://www.dropbox.com/referrals/NTIzNzI2MTY5>
>>
>>
>>
>> On 2 February 2011 23:34, Neal Delmonico <ndelmonico at sbcglobal.net>
>> wrote:
>>
>> Hi All,
>>>
>>> Is there any easy way to get from Velthuis Devanagari encoding to Roman
>>> transliteration? I have lots of documents in Velthuis that I would like
>>> to
>>> switch to Roman transliteration sometimes without having to type them in
>>> again. If there is a way to just substitute some LaTex codes, that would
>>> be
>>> tremendous.
>>>
>>> Thanks,
>>>
>>> Neal
>>>
>>> --
>>> Using Opera's revolutionary email client: http://www.opera.com/mail/
>>>
>>>
>>> --------------------------------------------------
>>> Subscriptions, Archive, and List information, etc.:
>>> http://tug.org/mailman/listinfo/xetex
>>>
>>>
>
> --
> Using Opera's revolutionary email client: http://www.opera.com/mail/
>
>
> --------------------------------------------------
> Subscriptions, Archive, and List information, etc.:
> http://tug.org/mailman/listinfo/xetex
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://tug.org/pipermail/xetex/attachments/20110203/af0ecce0/attachment-0001.html>
More information about the XeTeX
mailing list