[tex-hyphen] ptex-specific patterns

Tak Yato (ZR) zrbabbler at yahoo.co.jp
Tue Jun 1 16:02:00 CEST 2010


Hello Mojca,

--- Mojca Miklavec <mojca.miklavec.lists at gmail.com> wrote:

> PS: Since it seems that we won't be able to use any "smart approach"
Unfortunately, you cannot make a Japanese character active,
which makes conversion based on active character expansion
nearly impossible. The last resort left to you would be
to construct a looping macro, which would penalize everyone
with drastic slow down.

Another comment to add:
You probably miss the point that when you use pTeX then
obviously “UTF-8 to legacy 8-bit” conversion at the first
step is also running on pTeX.That is, the byte sequences
which will be contaminated by Japanese characters is those
in UTF-8, not legacy 8-bit encodings.

For example, think of convering UTF-8 byte sequence <C4 8D 69>
“&#269;i” to T1 <A3 69>; what pTeX in sjis mode would see is an
8-bit character <C4> followed by a Japanese character <8D69>
(that is U+7D5E in Unicode). It is extremely hard to devise
a procedure to successfully convert this sequence to the
desired result.

Best regards,
Tak Yato (aka "Z.R.")

--------------------------------------
2010 FIFA World Cup News [Yahoo!Sports/sportsnavi]
http://pr.mail.yahoo.co.jp/southafrica2010/


More information about the tex-hyphen mailing list