[tex-hyphen] Hyphenation patterns for classical Latin

Keno Wehr wehr at abgol.de
Wed Jul 3 16:50:35 CEST 2019


The current hyphenation patterns for classical Latin have some deficiencies:

- Some simple words are known to be hyphenated erroneously.
- There is only very limited support of compound words.
- Only u spellings are supported, no v spellings ("uiuere" is supported, 
but not "vivere"), even if there are editions of classical texts using 
the v spelling.
- There is no support of diacritical marks (accents, macrons, breves, 
ties), which are often used in dictionaries, grammars, and text books.
- In some cases, Roman numerals are hyphenated.

Furthermore, the hyphenation rules for classical Latin are also suitable 
for modern Latin if the traditional German or Slavic pronunciation is 
used. The patterns for modern Latin are not suitable for this purpose as 
they are based on Italian pronunciation. So support for some modern 
spelling variants (j spellings, ae/oe ligatures) is desirable for the 
classical patterns.

For these reasons, new patterns have been developed with the aid of 
patgen, based on a list of about 7500 Latin words. These patterns are 
attached. The main version is UTF-8 encoded and intended for xetex and 
luatex. Additionally, an EC version has been generated based on the same 
word list but with a reduced set of spelling variants. This version is 
suitable for pdftex and ptex.

The process of pattern generation is described here: 
https://github.com/gregorio-project/hyphen-la/blob/master/patterns/generation/README.md

The hyphenation rules are documented here: 
https://github.com/gregorio-project/hyphen-la/blob/master/doc/classical-hyphenation.md

A comparison of the different sets of hyphenation patterns for Latin can 
be found here: 
https://github.com/gregorio-project/hyphen-la/blob/master/doc/README.md

Keno
-------------- next part --------------
A non-text attachment was scrubbed...
Name: hyph-la-x-classic.tex
Type: text/x-tex
Size: 218825 bytes
Desc: not available
URL: <https://tug.org/pipermail/tex-hyphen/attachments/20190703/9f1172bf/attachment-0002.bin>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: hyph-la-x-classic.ec.tex
Type: text/x-tex
Size: 104059 bytes
Desc: not available
URL: <https://tug.org/pipermail/tex-hyphen/attachments/20190703/9f1172bf/attachment-0003.bin>


More information about the tex-hyphen mailing list