[luatex] problem with slnunicode's find
mailing_list at arcor.de
Tue Mar 2 18:58:32 CET 2010
Am 02.03.2010 16:11, schrieb Patrick Gundlach:
>> is a bit misleading, since just unicode.utf8.find is again not
> This is not true. It just returns the position in bytes. What would
> you suggest the following statement returns?
> str="aö" unicode.utf8.find(str,"\182") -- (ö's utf8 values are 195
> and 182)
Nil, or even better error out, since the second argument is invalid. Do
you think 3 is a sensible result?
This can lead to subtle errors, e.g., when strings are represented in
UTF-8 in a programme, but a user accidentally inputs a pattern in, say,
latin1. A wrong match might be undiscovered for a long time, while a
non-match might catch the user's attention instantly.
Is there a function in slnunicode that checks a string for UTF-8 compliance?
More information about the luatex