Descender frequency question

Paulo Ney de Souza pauloney at gmail.com
Sun May 5 05:10:05 CEST 2024


Barbara brings a very good point. My numbers apply ONLY to TimesRoman
Regular

[image: Screenshot from 2024-05-04 20-05-08.png]

where the "f" does not have descenders. So, yes, this is not the case for
TimesRoman Italic and the numbers will be slightly different.

In any case the count is trivial with:

   tr -cd 'f' < ODE-words.txt | wc -m

Paulo Ney




On Sat, May 4, 2024 at 4:45 PM barbara beeton <bnb at tug.org> wrote:

> On Sat, 4 May 2024, Paulo Ney de Souza wrote:
>
> > From one of the simplest scripts in the world using "tr" and "wc" to
> count
> > the amount of gjpqy in the texts, and then running on a gazillion of
> texts
> > to see the distribution and frequencies.
> > It is no secret that distribution of letters in a language are very
> uniform.
> >
> > Paulo Ney
> >
> > On Sat, May 4, 2024, 3:48 PM Norbert Preining <norbert at preining.info>
> wrote:
> >       Nice answer, where did you get these numbers from?
>
> But, may I remind you, that the italic alphabet differs from the roman
> form, and in math texts, there is a lot of italic, so at least "f"
> should be included.  That would likely kick the frequency up a bit for
> math texts.
>                                                 -- bb
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://tug.org/pipermail/texhax/attachments/20240504/2d9bc973/attachment-0001.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Screenshot from 2024-05-04 20-05-08.png
Type: image/png
Size: 22980 bytes
Desc: not available
URL: <https://tug.org/pipermail/texhax/attachments/20240504/2d9bc973/attachment-0001.png>


More information about the texhax mailing list.