[tex-hyphen] tug.ctan.org upload: Hyphenation Patterns in UTF-8

Mojca Miklavec mojca.miklavec.lists at gmail.com
Sat Jun 28 21:38:04 CEST 2008


On Sat, Jun 28, 2008 at 7:52 PM, Norbert Preining wrote:
>
> Ok, here is again a diff between what we have and what you have.
>        From ours To yours
>
> First:
>        hyphen-sanskrit.tlpsrc is new, so I guess we add it to
>        collection-langindic.tlpsrc.

Oh, I forgot to mention that in differences. Yes, that file is
completely new. I have no idea where you put what, but
collection-langindic probably makes sense.

> --- /src/TeX/texlive-svn/Master/tlpkg/tlpsrc/hyphen-ancientgreek.tlpsrc 2008-06-17 16:10:41.000000000 +0200
> +++ hyphen-ancientgreek.tlpsrc  2008-06-28 17:43:03.000000000 +0200
> @@ -2,4 +2,5 @@
>  category TLCore
>  execute AddHyphen name=ancientgreek lefthyphenmin=1 righthyphenmin=1 file=loadhyph-grc.tex
>  execute AddHyphen name=ibycus lefthyphenmin=2 righthyphenmin=2 file=ibyhyph.tex
> +runpattern f texmf/tex/generic/hyphen/grmhyph5.tex
>  runpattern f texmf/tex/generic/hyphen/ibyhyph.tex
>
> grmhyph5.tex is contained in hyphen-greek.tlpsrc and cannot be included
> in 2 packages, that is how it works. So we have to remove it here.
>
> Can you hack this special case into it?

Thanks Norbert. We are extremely lucky to have you :)
We were accidentally loading modern monotonic instead of ancient
Greek. (not "we" - it was my fault!)
I doubt that anyone would ever notice that until the release if we
didn't have you.

Thanks a lot for spotting that :)

> --- /src/TeX/texlive-svn/Master/tlpkg/tlpsrc/hyphen-bulgarian.tlpsrc    2008-06-19 12:06:30.000000000 +0200
> +++ hyphen-bulgarian.tlpsrc     2008-06-28 17:43:03.000000000 +0200
> @@ -1,4 +1,3 @@
>  name hyphen-bulgarian
>  category TLCore
> -depend bghyphen
>  execute AddHyphen name=bulgarian lefthyphenmin=2 righthyphenmin=2 file=loadhyph-bg.tex
>
> Is the bghyphen package still necessary? It contains the files
>  texmf-dist/tex/generic/bghyphen/bghyphen.tex
>  texmf-dist/tex/generic/bghyphen/bghyphsi.tex
>  texmf-dist/tex/generic/bghyphen/catmik.tex
>  texmf-dist/tex/generic/bghyphen/mik2t2.tex
>
> Can we delete the whole package?

We do not depend on it, but I would prefer to contact the author
before removing it. bghyphen contains some mechanism to support
different encodings, though TeX Live only contains support for t2
(it's written - do not modify if you don't understand what this is).

We can remove dependency safely, but before removing the package -
Arthur, do you volunteer to write to the author and ask him for
advice?

> --- /src/TeX/texlive-svn/Master/tlpkg/tlpsrc/hyphen-danish.tlpsrc       2008-06-17 15:52:00.000000000 +0200
> +++ hyphen-danish.tlpsrc        2008-06-28 17:43:03.000000000 +0200
> @@ -1,6 +1,3 @@
>  name hyphen-danish
>  category TLCore
>  execute AddHyphen name=danish lefthyphenmin=2 righthyphenmin=2 file=loadhyph-da.tex
> -runpattern f texmf/tex/generic/hyphen/dkcommon.tex
> -runpattern f texmf/tex/generic/hyphen/dkhyph.tex
> -runpattern f texmf/tex/generic/hyphen/dkspecial.tex
>
> I guess that refers to what you wrote below, that you got rid of the
> danish patterns. So we can remove these 3 files and the patterns. Right?

Right. (I hope so, at least.)

> --- /src/TeX/texlive-svn/Master/tlpkg/tlpsrc/hyphen-french.tlpsrc       2008-06-17 15:54:26.000000000 +0200
> +++ hyphen-french.tlpsrc        2008-06-28 17:43:03.000000000 +0200
> @@ -1,4 +1,3 @@
>  name hyphen-french
>  category TLCore
>  execute AddHyphen name=french synonyms=patois,francais lefthyphenmin= righthyphenmin= file=loadhyph-fr.tex
> -runpattern f texmf/tex/generic/hyphen/frhyph.tex
>
> Same with french?

Yes, same with French.

> --- /src/TeX/texlive-svn/Master/tlpkg/tlpsrc/hyphen-german.tlpsrc       2008-06-19 12:11:24.000000000 +0200
> +++ hyphen-german.tlpsrc        2008-06-28 17:43:03.000000000 +0200
> @@ -3,6 +3,4 @@
>  execute AddHyphen name=german lefthyphenmin=2 righthyphenmin=2 file=loadhyph-de-1901.tex
>  execute AddHyphen name=ngerman lefthyphenmin=2 righthyphenmin=2 file=loadhyph-de-1996.tex
>  runpattern f texmf/tex/generic/hyphen/dehypht.tex
> -runpattern f texmf/tex/generic/hyphen/dehyphtex.tex
> -runpattern f texmf/tex/generic/hyphen/ghyphen.README
>  runpattern f texmf/tex/generic/hyphen/dehyphn.tex
>
> ??? I guess the exceptions should stay, and the README, too. What do you
> say?

Werner, there are currently 4 files with german patterns. Is there any
chance to include that readme in either of the other four files?

Any comments about the other file? Is it still needed with the new
patterns (just for info)?
I will add both to the list for now, but contents of the readme could
be moved; the copyrights in headers of other patterns are a bit
"messy" anyway, so if someone could take a look, it wouldn't hurt.

If nohing else, the name is weird (it should be dehyp*, not ghyp*).

> If you agree please add this to your tlpsrc generation code if possible.

Done.

> --- /src/TeX/texlive-svn/Master/tlpkg/tlpsrc/hyphen-greek.tlpsrc        2008-06-17 15:36:14.000000000 +0200
> +++ hyphen-greek.tlpsrc 2008-06-28 17:43:03.000000000 +0200
> @@ -2,5 +2,5 @@
>  category TLCore
>  execute AddHyphen name=monogreek lefthyphenmin=1 righthyphenmin=1 file=loadhyph-el-monoton.tex
>  execute AddHyphen name=greek synonyms=polygreek lefthyphenmin=1 righthyphenmin=1 file=loadhyph-el-polyton.tex
> -runpattern f texmf/doc/generic/elhyphen/*
> -runpattern f texmf/tex/generic/hyphen/gr*hyph*.tex
> +runpattern f texmf/tex/generic/hyphen/grmhyph5.tex
> +runpattern f texmf/tex/generic/hyphen/grphyph5.tex
>
> Ok, fine with me.

grahyph is part of ancientgreek, so it's probably better to list two files.

> But what is with all the files in
>        texmf/doc/generic/elhyphen/*
> there are quite a lot:
>  texmf/doc/generic/elhyphen/README
>  texmf/doc/generic/elhyphen/anc-test.ltx
>  texmf/doc/generic/elhyphen/anc-test.pdf
>  texmf/doc/generic/elhyphen/ancient.pdf
>  texmf/doc/generic/elhyphen/compound.ltx
>  texmf/doc/generic/elhyphen/compound.pdf
>  texmf/doc/generic/elhyphen/copyrite.txt
>  texmf/doc/generic/elhyphen/modern.pdf
>
> Are they obsolete with gr*5* ??

It's simply that I didn't consider any docs when generating the
tlpsrc. I added them, though I suspect that there might be some
gr*hyp* documentation as well. If nothing else, one could regenerate
documentation with outline fonts. The content is "all Greek to me".
Dimitrios?

> --- /src/TeX/texlive-svn/Master/tlpkg/tlpsrc/hyphen-hungarian.tlpsrc    2008-06-24 22:17:49.000000000 +0200
> +++ hyphen-hungarian.tlpsrc     2008-06-28 17:43:03.000000000 +0200
> @@ -1,4 +1,3 @@
>  name hyphen-hungarian
>  category TLCore
>  execute AddHyphen name=hungarian lefthyphenmin=2 righthyphenmin=2 file=loadhyph-hu.tex
> -docpattern f texmf/doc/generic/huhyphen/*

runpattern or docpattern??? (you have filepattern in other languages)

> Same with these files:
>  texmf/doc/generic/huhyphen/huhyphn.pdf
>  texmf/doc/generic/huhyphen/hyph_hu.dic
>  texmf/doc/generic/huhyphen/searchforerrors.rb
>  texmf/doc/generic/huhyphen/testhyphenation.rb
>
> Still useful/necesary? Or remove them and the pattern?

Docs are never too much. I have added it, but it looks like I need to
contact the author anyway (is NAGY Bence a male or a female name?)

> --- /src/TeX/texlive-svn/Master/tlpkg/tlpsrc/hyphen-latin.tlpsrc        2008-06-17 15:57:47.000000000 +0200
> +++ hyphen-latin.tlpsrc 2008-06-28 17:43:03.000000000 +0200
> @@ -1,4 +1,3 @@
>  name hyphen-latin
>  category TLCore
>  execute AddHyphen name=latin lefthyphenmin=2 righthyphenmin=2 file=loadhyph-la.tex
> -runpattern f texmf/tex/generic/hyphen/lahyph.tex
>
> Ok, you mentioned that, so remove the file and the runpattern line?

Yes. And then hope for the best.

> --- /src/TeX/texlive-svn/Master/tlpkg/tlpsrc/hyphen-polish.tlpsrc       2008-06-24 22:51:36.000000000 +0200
> +++ hyphen-polish.tlpsrc        2008-06-28 17:43:03.000000000 +0200
> @@ -1,4 +1,3 @@
>  name hyphen-polish
>  category TLCore
>  execute AddHyphen name=polish lefthyphenmin=2 righthyphenmin=2 file=loadhyph-pl.tex
> -runpattern f texmf/tex/generic/hyphen/plhyph.tex
>
> I guess that has to stay due to mex etc. Can you somehow add that magic
> to your tlpsrc generation code?

What about adding that magic to mex.tlpsrc? Only as a suggestion.

name mex
category Package
depend pl
depend hyphen-polish
runpattern +!d texmf-dist/tex/mex/utf8mex
docpattern +!d texmf-dist/doc/mex/utf8mex


> --- /src/TeX/texlive-svn/Master/tlpkg/tlpsrc/hyphen-russian.tlpsrc      2008-06-17 16:01:04.000000000 +0200
> +++ hyphen-russian.tlpsrc       2008-06-28 17:43:03.000000000 +0200
> @@ -1,4 +1,3 @@
>  name hyphen-russian
>  category TLCore
> -depend ruhyphen
>  execute AddHyphen name=russian lefthyphenmin=2 righthyphenmin=2 file=loadhyph-ru.tex
>
> What about that? The package ruhyphen ships loads of files:
> srcfiles size=15
>  texmf-dist/source/generic/ruhyphen/BUGS
>  texmf-dist/source/generic/ruhyphen/Makefile
>  texmf-dist/source/generic/ruhyphen/README
>  texmf-dist/source/generic/ruhyphen/README.ruhyphal
>  texmf-dist/source/generic/ruhyphen/hyphen.rules
>  texmf-dist/source/generic/ruhyphen/mkcyryo
>  texmf-dist/source/generic/ruhyphen/reduce-patt
>  texmf-dist/source/generic/ruhyphen/sorthyph
>  texmf-dist/source/generic/ruhyphen/sortkoi8
>  texmf-dist/source/generic/ruhyphen/trans
> runfiles size=62
>  texmf-dist/tex/generic/ruhyphen/catkoi.tex
>  texmf-dist/tex/generic/ruhyphen/cyryoal.tex
>  texmf-dist/tex/generic/ruhyphen/cyryoas.tex
>  texmf-dist/tex/generic/ruhyphen/cyryoct.tex
>  texmf-dist/tex/generic/ruhyphen/cyryodv.tex
>  texmf-dist/tex/generic/ruhyphen/cyryomg.tex
>  texmf-dist/tex/generic/ruhyphen/cyryovl.tex
>  texmf-dist/tex/generic/ruhyphen/cyryozn.tex
>  texmf-dist/tex/generic/ruhyphen/enrhm2.tex
>  texmf-dist/tex/generic/ruhyphen/hypht2.tex
>  texmf-dist/tex/generic/ruhyphen/koi2koi.tex
>  texmf-dist/tex/generic/ruhyphen/koi2lcy.tex
>  texmf-dist/tex/generic/ruhyphen/koi2ot2.tex
>  texmf-dist/tex/generic/ruhyphen/koi2t2a.tex
>  texmf-dist/tex/generic/ruhyphen/koi2ucy.tex
>  texmf-dist/tex/generic/ruhyphen/ruenhyph.tex
>  texmf-dist/tex/generic/ruhyphen/ruhyphal.tex
>  texmf-dist/tex/generic/ruhyphen/ruhyphas.tex
>  texmf-dist/tex/generic/ruhyphen/ruhyphct.tex
>  texmf-dist/tex/generic/ruhyphen/ruhyphdv.tex
>  texmf-dist/tex/generic/ruhyphen/ruhyphen.tex
>  texmf-dist/tex/generic/ruhyphen/ruhyphmg.tex
>  texmf-dist/tex/generic/ruhyphen/ruhyphvl.tex
>  texmf-dist/tex/generic/ruhyphen/ruhyphzn.tex
>
> Can we delete the whole package? Are these files still necessary?
>
> I.e., are all these files *ONLY* for the patterns, and not for actual
> processing later on?

I'm not sure. I need to take a look. We do not depend on these files,
but maybe there are some useful macros inside, and I suspect that user
is able to modify a file to specify which encoding to use for
patterns. There is one default, but we do not provide the same
mechanisms for switching the encoding. It's similar to Bulgarian
(which only ships with support for a single encoding by default), but
it might be that people need other files at runtime as well. Arthur
did the conversion, but ... I would leave the package there. I'll add
the dependency.

> Thanks Mojca for all your work, and sorry that I am asking soooo many
> questions. But I don't want to mess it up now, not completely.

There are never too many questions :)
It has clarified a lot.

Mojca


More information about the tex-hyphen mailing list