[Date Prev][Date Next][Thread Prev][Thread Next]
[Date Index]
[Thread Index]
- Subject: Re: is string.gmatch(), string.upper() 7-bit ascii only?
- From: Roberto Ierusalimschy <roberto@...>
- Date: Thu, 7 Apr 2016 10:27:57 -0300
> I am trying to manipulate text with umlauts. string.upper() does not produce upper case version of umlauts like ä,ö,ü etc.
>
> Also the %g pattern, when used in string.gmatch() does not match these umlauts.
>
> Is there anything that can be done about it? Or, am I making a stupid mistake?
http://www.lua.org/manual/5.3/manual.html#6.4
[...]
The string library assumes one-byte character encodings.
If you are using an 8-bit encoding (e.g., LATIN 1), then these should
work, given a proper locale. Otherwise (e.g., UTF-8), you will need an
external library.
-- Roberto