[XeTeX] XeTeX/xdvipdfmx: PDF text copying with double encoded fonts

Jiang Jiang gzjjgod at gmail.com
Mon Jun 12 18:53:55 CEST 2017


Hi Akira,

Let me quote the originator:

> I have tried some PDF viewers. With \XeTeXgenerateactualtext=1,
> Adobe Reader DC, Adobe Acrobat DC will give the correct result, while SumatraPDF v3.1.2, Windows Reader App (阅读器), Microsoft Edge and Microsoft Word 2016 will not.

> My OS is Windows 10 1607 (Build 14393.1198).

So there might still be value to fix the ToUnicode map, don't you think?

I have an experimental patch at
https://github.com/jjgod/texlive/commit/f01557d549aaf27584f624fa540f6b4b05349bf3
in case you would like to build a w32tex binary for him to test.
(Actually you can ignore rest of the change and only take the
is_PUA_or_presentation() change.)

On Mon, Jun 12, 2017 at 6:59 AM, Jiang Jiang <gzjjgod at gmail.com> wrote:
>
> On Mon, Jun 12, 2017 at 12:38 AM Akira Kakuto <kakuto at fuk.kindai.ac.jp>
> wrote:
>>
>> Dear Jiang,
>>
>> > There has been a report
>> >
>> > https://github.com/CTeX-org/ctex-kit/issues/286
>>
>> % XeTeX
>> \XeTeXgenerateactualtext=1
>> \font\1="Source Han Sans SC"
>> \font\2="Source Han Serif SC"
>> \font\3="Microsoft YaHei"
>>
>> {\1 孤立子 ABC} \par
>> {\2 孤立子 ABC} \par
>> {\3 孤立子 ABC} \par
>>
>> \bye
>>
>> I tested the above example on Windows 7.
>> Used fonts are
>> (1) SourceHanSansSC-Regular.otf
>> (2) SourceHanSerifSC-Regular.otf
>> (3) msyh.ttf
>>
>> I'm afraid that I don't understand the problem correctly.
>> My Results:
>>
>> In the case of \XeTeXgenerateactualtext=0, copy&paste
>> is wrong for "立" in fonts (1) and (2), in Adobe Reader DC.
>> In the case of \XeTeXgenerateactualtext=1, copy&pase
>> is OK in Adobe Reader DC for all fonts (1), (2) and (3).
>
>
> Thank you for trying this, that's what I thought, do you mind sending me the
> PDF file generated with \XeTeXgenerateactualtext=1? Chances are the
> originator didn't use it properly.
>
> - Jiang
>
>>
>>
>> Best,
>> Akira
>>
>



More information about the XeTeX mailing list