[tex-hyphen] Accuracy of the hyphenation algorithm
yuri at rawbw.com
Wed Jul 29 01:36:38 CEST 2015
When I am looking at the algorithm results, I keep seeing a lot of
Original hyphen.tex has some testcases in the end, that are supposedly
the correct hyphenation points:
But when I run the algorithm with patterns from hyphen.tex, I get these
Available correct answers from the Merriam-Webster dictionary:
Additionally, the produced "gen·uine" hyphenation split isn't correct
(should be " gen·u·ine"), the word "toothache" isn't split at all, and
"p·neu·mo·ni·a" result is wrong too (should be " pneu·mo·nia").
(https://github.com/mnater/hyphenator) with pattern set from hyphen.tex,
reviewed the algorithm there in detail, and it seems correct. I didn't
try the Tex implementation.
Franklin Liang paper says that this algorithm almost always produces
So how to explain these discrepancies? Why even the testcases from
hyphen.tex aren't reproducible? Is the algorithm implementation not
correct? Something is missing?
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the tex-hyphen