[XeTeX] xetex and the unicode bidirectional algorithm.

Zdenek Wagner zdenek.wagner at gmail.com
Mon Dec 9 14:55:06 CET 2013


2013/12/9 Philip Taylor <P.Taylor at rhul.ac.uk>:
> Keith -- could you possible supply an example of
> "a properly encoded utf-8 string" from which it
> can be unambiguously determined whether the string
> "sang" is an English word (the past tense of "sing")
> or a Vietnamese word meaning "to", "posh" or "knowingly"

And it may be a Danish noun meaning "a song" or a past tense from
Danish verb "synge".

> in English ?  Could you also paste that string into
> Richard Ishida's Unicode String Analyser :
>
>         http://rishida.net/tools/analysestring/
>
> and let us know what information it returns ?
>
> Philip Taylor
> --------
>
> Keith J. Schultz wrote:
>
>> Unfortunately, for efficiency reasons, utf-8 strings are not properly
>> encoded and programs assume a particular language, to save space.
> --
> Windows 8 ? Just say "no".
>
>
> --------------------------------------------------
> Subscriptions, Archive, and List information, etc.:
>   http://tug.org/mailman/listinfo/xetex



-- 
Zdeněk Wagner
http://hroch486.icpf.cas.cz/wagner/
http://icebearsoft.euweb.cz



More information about the XeTeX mailing list