[luatex] Why does LuaTeX show the middle dot instead of ano teleia?
Heiko Oberdiek
heiko.oberdiek at googlemail.com
Tue Aug 29 20:59:33 CEST 2023
Hello,
On 2023-08-29 20:29, Joseph Wright wrote:
> On 29/08/2023 19:27, Heiko Oberdiek via luatex wrote:
>>
>> using LuaTeX to review the glyphs of a font, I discovered an oddity
>> about U+0387 ANO TELEIA. LuaTeX shows U+00B7 MIDDLE DOT instead.
>> \symbol{"00B7}% MIDDLE DOT
>> \symbol{"0387}% ANO TELEIA
> From UnicodeData.txt:
>
> 0387;GREEK ANO TELEIA;Po;0;ON;00B7;;;;N;;;;;
>
> so it looks like it's a simple normalisation.
Start of the UnicodeData.txt format description
(https://www.unicode.org/reports/tr44/#UnicodeData.txt):
[0] Code value
[1] Character name
[2] General category
[3] Canonical combining classes
[4] Bidirectional category
[5] Character decomposition
...
In the LuaTeX manual, I found:
| Normalization of the Unicode input is on purpose not built-in and
| can be handled by a macro package during callback processing.
| We have made some practical choices and the user has to
| live with those.
The TeX input above, however, is plain ASCII. Therefore, any
normalization of the file contents should not matter.
Of course, I do not want to have any decomposition that replaces
the glyph with a different character. That would make reviewing
the original glyph impossible.
Yours sincerely
Heiko
More information about the luatex
mailing list.