[tex-live] Problems with non-7bit characters in filename

Reinhard Kotucha reinhard.kotucha at web.de
Sun Jul 6 02:50:12 CEST 2014


On 2014-07-05 at 13:31:58 +0200, Ulrike Fischer wrote:

 > Regardless if I'm in the command line, in a command line with 
 > chcp 65001, in a msys bash (with \\ there):  miktex works, texlive
 > not.

Does MiKTeX allow to create a file (\openout) whose mame contains
characters which are not supported by the 8-bit codepage currently
in use, e.g. '日本語.foo'?  If this works then MiKTeX accesses the
filesystem directly and doesn't consult any codepages.

I tried chcp 65001 some time ago on Windows7 but it had no effect.
Subversion stores comments in UTF-8.  So when I ran

  svn log

on the command line, non-ASCII characters were not displayed properly.
I expected that chcp 65001 solves the problem but it didn't.
Does anybody know more?

 > > Problems occur when user interfaces are involved which are not
 > > aware of Unicode.  Sure, if you know which encoding is used you
 > > can convert any filenames to Unicode.  But what about \openout?
 > > You can convert any 8-bit character encoding to Unicode but not
 > > vice versa.
 > > 
 > > What do you expect to happen if you create a file
 > > 
 > >   "Äöü-Русский язык-日本語"
 > 
 > You don't need not convince me of the benefits of utf8. I would be
 > happy if there was a button to switch to it in windows.

Sure, but what I meant is that you cannot convert Unicode to ISO-8859.
This is not a problem when reading a file.  The problem occurs when
you create a file.  Hence it's not sufficient to consider reading
only.  

BTW, what I prefer is to be able to use UTF-8 on Windows *without* the
need to press a button.  There is no such button on Linux and I don't
miss it.  It's nice that everything works out-of-the-box.

 > > I also don't hesitate to use non-ASCII characters in file names.
 > 
 > Also in file names that you want to \input in a pdflatex file? I
 > doubt this ;-). 

Well, I'm using pdftex for existing files only or for testing TeX
macros which don't require Lua or OpenType fonts.  (pdftex is a bit
faster).

 > I don't hesitate to use non-ASCII in files that are be handled only
 > by me and one application. As soon as such files should go to other
 > persons and be processed by more than one application (e.g. latex +
 > biber) I avoid it.

Yes, when I give files away I'm careful too, especially because I
don't know anything about the recipient's environment.  And I expect
problems on Windows.  But I don't worry about names of files which I
download from the internet.  It's simply too much work to rename them
all.

Regards,
  Reinhard

-- 
------------------------------------------------------------------
Reinhard Kotucha                            Phone: +49-511-3373112
Marschnerstr. 25
D-30167 Hannover                    mailto:reinhard.kotucha at web.de
------------------------------------------------------------------




More information about the tex-live mailing list