[tex-hyphen] US English patterns in hyph-en-us.pat.txt are buggy
roozbeh at google.com
Fri Jun 9 22:16:54 CEST 2017
First post to the list, reporting a bug. Please point me to the bug tracker
if there is one.
Debugging an Android user report, I found that Android was hyphenating the
words "democrat" and "democrats" incorrectly, as:
While Merriam Webster was recommending:
And Plain TeX was hyphenating as:
Digging deeper, the source of the problem seems to be the following pattern
That pattern seems to not exist in Plain TeX's pattern file for US English.
The other patterns applying to those words, all existing in Plain TeX, are:
I think the source of the problem is that the authors of the extended
pattern file derived the modified patterns based on TUGboat's exception
list, they created that "5moc1ra1t" pattern based on the word
"de-moc-ra-tism" and didn't notice that adding it would cause "democrat"
and "democrats" to be hyphenated incorrectly.
I guess these two words would not be the only exceptions, and there should
be tens of other words that are affected by a similar problem of
over-weighing the exception list.
I believe as a temporary solution, the US English hyphenation patterns
should be restored to the Plain TeX version, and the exception list should
get extended to include everything in
I can prepare a patch, if that's useful.
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the tex-hyphen