<div dir="ltr"><div>Thanks Mojca and Arthur,</div><div><br></div><div>The community of Macedonian TeX/LaTeX users is generally growing during the years and some of those people occasionally tex some things in Macedonian.</div><div><br></div><div>Here are some examples I am aware of:</div><div><br></div><div>1) A friend of mine recently re-texed a university Physics textbook in Macedonian and he mentioned it made it working with some hassle for the hyphenation.</div><div>2) This year, the leader of the macedonian team for an international mathematics competition


 typed the student exam paper for the first time in tex: <a href="https://artofproblemsolving.com/contests/cmc">https://artofproblemsolving.com/contests/cmc</a> (In the past, many of them either opted for MS Word or used Serbian/Bulgarian babel support.) </div><div>I cc-ed the former for a reference.<br></div><div><br></div><div>Their default choice is (still) an 8-bit engine. I think hyphenation for 8-bit will definitely be useful in the proposed form until they start using unicode-aware engines (LuaTeX, XeTeX etc.).</div><div><br></div><div>Regards,</div><div>Stojan</div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Wed, 30 Dec 2020 at 22:52, Mojca Miklavec <<a href="mailto:mojca.miklavec.lists@gmail.com">mojca.miklavec.lists@gmail.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">On Wed, 30 Dec 2020 at 22:48, Arthur Reutenauer wrote:<br>

> It does sound like T2A is the best choice for Macedonian,<br>

<br>

While I agree that it might be the least problematic 8-bit encoding, I<br>

don't agree that it's the best choice in 2021.<br>

<br>

By supporting T2A you are actively educating the users to stick with<br>

T2A for NEW documents without them noticing tons of issues:<br>

- accents on those two letters will likely be misplaced (and as a<br>

consequence more ugly)<br>

- you don't get any kerning around those characters<br>

- words containing any of those two letters won't be hyphenated at all<br>

... along with an extremely limited choice of available fonts, extreme<br>

difficulties to mix the document with other languages, with all other<br>

limitations of the 8-bit engines, ...<br>

<br>

I'm saying this because I've seen tons of hardcover books on<br>

bookshelves in my native language that you can easily recognise as<br>

being typeset using the wrong (OT1) encoding, with the caron on ccaron<br>

"heavily" misplaced, esp. when bold is used. (I was among those who<br>

kept using the wrong encoding for many years as well, and after I've<br>

learned about it, I probably haven't seen anyone using the correct<br>

encoding in the sources either.)<br>

<br>

The biggest problem is when things work just enough to make an<br>

impression of being ok, and then users don't feel the need to go one<br>

extra mile and learn how to make them perfect, even though they would<br>

prefer the second option if they were ever aware about it. Are users<br>

actually requesting 8-bit support, or is the addition some kind of a<br>

personal wish to satisfy everyone?<br>

<br>

That said, if there is real urgency to support 8-bit encodings ...<br>

we'll do it, of course.<br>

<br>

> Mojca will try to make an upload to CTAN by tomorrow<br>

> (Thursday) evening, otherwise we’ll work on it some time next week.<br>

<br>

This was the initial plan, but I can no longer promise to stick with<br>

it after noticing that what's in the repository at the moment is<br>

actually wrong.<br>

<br>

The initial patterns from Vasil that we found online were claiming to<br>

be using the T2A encoding, but the actual encoding was different.<br>

Arthur "reversed-engineered" the contents (with the help of the<br>

comments) by creating his own custom mapping that he provisionally<br>

called "Macedonian" and using it to convert the original into the<br>

final UTF-8 file. We were assuming that the patterns were actually<br>

used in the original form, but unless there was a custom font<br>

available somewhere, they were likely not working correctly with T2A.<br>

I looked at those original patterns again and it seems that the<br>

original patterns were using the cp-1251 encoding, so that they looked<br>

OK inside a text editor on Windows. Or at least those 7 letters that I<br>

checked seem to match cp-1251 exactly.<br>

<br>

If we really do need to support 8-bit encodings, I would keep the<br>

original UTF-8 patterns intact and just remove the incompatible<br>

patterns from the 8-bit version. But that 8-bit version needs to be<br>

done from scratch (using the existing scripts). The file currently in<br>

the master branch is apparently using cp-1251 and would not work as<br>

desired.<br>

<br>

Mojca<br>

</blockquote></div></div>