> is a bit misleading, since just unicode.utf8.find is again not Unicode-aware. This is not true. It just returns the position in bytes. What would you suggest the following statement returns? str="aö" unicode.utf8.find(str,"\182") -- (ö's utf8 values are 195 and 182) Patrick