<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">

<html>

  <head>

    <meta content="text/html; charset=UTF-8" http-equiv="Content-Type">

    <title></title>

  </head>

  <body bgcolor="#ffffff" text="#000000">

    Am 16.06.2011 01:41, schrieb <a class="moz-txt-link-abbreviated" href="mailto:mskala@ansuz.sooke.bc.ca:">mskala@ansuz.sooke.bc.ca:</a>

    <blockquote

      cite="mid:alpine.LNX.2.00.1106151828370.2191@tetsu.ansuz.sooke.bc.ca"

      type="cite">

      <pre wrap="">I thought the original poster was talking about segments of text written

in romanized Japanese as the only script - not phonetic guide texts

(furigana) attached to Japanese script, nor equivalents in other

languages.  The issues you describe are interesting for knowing how to

break furigana when words are split at the end of a line, but I don't

think they're relevant to the original poster's question; it sounded like

they had a pretty clear, and simple, idea of the hyphenation they wanted

to use for romanized text, and it would be a fair bit simpler than the

existing algorithm currently used for languages like English.

I'm not sure that romanized Japanese is used enough for texts of more than

a few words, to justify a lot of development effort going into figuring

out how to hyphenate it beyond the original poster's immediate

application.  Do any established standards or traditions exist for such

hyphenation at all?</pre>

    </blockquote>

    <br>

    Hello,<br>

    <br>

    yes, I thought exactly of such a few words in a western text.<br>

    <br>

    For example, in situations like this: “A town where many hot springs

    (<i>onsen</i>) are located is Beppu in Kyūshū. ”<br>

    Here, you have three Japanese words in a text: onsen, Beppu, Kyūshū.

    The hyphenation rules would be quite easy: on-sen, bep-pu, kyū-shū.

    <br>

    Of course, you can do this all manually, but I think if one writes a

    text in japan studies or somewhere like that, occurrences of

    romanized Japanese can be quite often. Also, if you have a Japanese

    book in a reference section, you may need to write the complete

    title in romanization, where hyphenation may be needed as well:

    “Wakabayashi Masahiro: Taiwan - Henyō shi chūcho suru aidentiti.”<br>

    This is often not so much of a problem, because many Japanese words

    are not that long (but there are some longer words!), but still, the

    space will not be used perfectly. <br>

    <br>

    Because the hyphenation rules seem to be very easy, I think it

    shouldn’t be much of a problem to create rules for it. I am not sure

    if there exist standards or traditions for that, but I can imagine

    it. Especially Pinyin has many rules, so I think there would be

    romanization for that. Japanese on the other side is quite easy to

    structure, so I think hyphenation, even in absence of specific

    rules, is clearly straightforward.<br>

    <br>

    Furigana hyphenation etc. is an entire different field: For that, we

    would first need Furigana support, which seems to be very difficult

    (or at least needs some work). But romanization hyphenation seems

    easy.<br>

    <br>

    By the way, I think this romanization hyphenation is not only

    necessary for Japanese or Chinese, but for any other language as

    well (Arabic, Greek, Russian, Thai, etc.). For Japanese and Chinese

    the advantage is that you have a universal romanization, but for

    other languages there seem to be romanization specific to the target

    language (e.g. Arabic romanization for German). Still, there also

    seems to be some scientific romanization which could be the

    standard. After all, I guess Latex (and Xelatex) is most often used

    for western scientific texts, which often do include these romanized

    foreign terms (if they deal with these areas).  <br>

    <br>

    Gerrit<br>

    <br>

    <br>

  </body>

</html>