<html>
<head>
<meta content="text/html; charset=UTF-8" http-equiv="Content-Type">
</head>
<body bgcolor="#FFFFFF" text="#000000">
<div class="moz-cite-prefix">Mojca,<br>
<br>
As for patching ICU, here is what they did for me:
<a class="moz-txt-link-freetext" href="http://bugs.icu-project.org/trac/changeset/34229">http://bugs.icu-project.org/trac/changeset/34229</a><br>
<br>
It should be 'copy-and-paste' for many parts, but might be wise to
have someone more experienced in programming than myself double
check and compile things...<br>
<br>
Or better yet, is there someone who would be willing to mentor me
through the steps, so that I can get some experience in this
field? (I'd like to get a better feel for programming, but have
minimal experience in C/C++) If so, though it may take more work,
could help me in the future submit patches more easily. I'd like
to do something similar for firefox, inkscape, openoffice,
libreoffice, etc. The more I can do for them upfront, the better.<br>
<br>
ICU does various boundary detections, but if only the word break
iterator is being used, then a solution for syllables would be
needed for hyphenation... <br>
<br>
Thankfully, I have a Lao Syllable List attached (ods spreadsheet),
as well as a document (pdf) describing how things work (if a regex
method would be preferred). Could this be useful for hyphenation?
What is the best approach?<br>
<br>
Sometimes, especially for foreign words, syllables break the
'standard' spelling conventions... And as of yet, there isn't an
accepted standard for these cases (though the Lao government is
working on such a standard). So may end up with some hiccups with
foreign words (possibly unavoidable until more data is acquired).
I have attempted, in helping collect this data, that all variant
spellings (whether standard or not) be included. This, in my
opinion, would make for less immediate problems for users who may
not use the standard spellings.<br>
<br>
Anyways, let me know your thoughts!<br>
<br>
<div class="moz-signature">Respectfully,<br>
<br>
<b>Robert M Campbell</b><br>
IT Specialist for ADRA Laos & Open Source Advocate<br>
Lao Cell: +856 207 616 7299<br>
US Phone: +1 270 681 0399<br>
<a class="moz-txt-link-abbreviated" href="mailto:robert.rcampbell@gmail.com">robert.rcampbell@gmail.com</a><br>
<a class="moz-txt-link-abbreviated" href="mailto:rcampbell@adralaos.org">rcampbell@adralaos.org</a><br>
<br>
Visit ADRA Lao's Facebook Page at <a
href="www.facebook.com/ADRALaos">facebook.com/ADRALaos</a></div>
On 09/23/2013 01:44 AM, Khaled Hosny wrote:<br>
</div>
<blockquote cite="mid:20130922184437.GA5884@khaled-laptop"
type="cite">
<pre wrap="">On Sun, Sep 22, 2013 at 07:46:17PM +0200, Mojca Miklavec wrote:
</pre>
<blockquote type="cite">
<pre wrap="">One thing I don't know is whether ICU only breaks between words or
also between syllables. (If it only breaks between words you would
still need hyphenation patterns. If it breaks between syllables as
well, you don't need anything else.)
</pre>
</blockquote>
<pre wrap="">
We only used the word breaking from ICU, so unless syllable boundaries
are valid word breaks, hyphenation will still be needed.
Regards,
Khaled
</pre>
</blockquote>
<br>
</body>
</html>