<div dir="ltr"><div>Barbara brings a very good point. My numbers apply ONLY to TimesRoman Regular</div><div><br></div><img src="cid:ii_lvsyap5a0" alt="Screenshot from 2024-05-04 20-05-08.png" width="562" height="109"><div><br></div><div>where the "f" does not have descenders. So, yes, this is not the case for TimesRoman Italic and the numbers will be slightly different. </div><div><br></div><div>In any case the count is trivial with:</div><div><br></div><div>   tr -cd 'f' < ODE-words.txt | wc -m</div><div><br></div><div>Paulo Ney</div><div><br></div><div><br><div><br></div></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Sat, May 4, 2024 at 4:45 PM barbara beeton <<a href="mailto:bnb@tug.org">bnb@tug.org</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">On Sat, 4 May 2024, Paulo Ney de Souza wrote:<br>

<br>

> From one of the simplest scripts in the world using "tr" and "wc" to count<br>

> the amount of gjpqy in the texts, and then running on a gazillion of texts<br>

> to see the distribution and frequencies.<br>

> It is no secret that distribution of letters in a language are very uniform.<br>

> <br>

> Paulo Ney<br>

> <br>

> On Sat, May 4, 2024, 3:48 PM Norbert Preining <<a href="mailto:norbert@preining.info" target="_blank">norbert@preining.info</a>> wrote:<br>

>       Nice answer, where did you get these numbers from?<br>

<br>

But, may I remind you, that the italic alphabet differs from the roman<br>

form, and in math texts, there is a lot of italic, so at least "f"<br>

should be included.  That would likely kick the frequency up a bit for<br>

math texts.<br>

                                                -- bb<br>

<br>

</blockquote></div>