[Date Prev][Date Next][Thread Prev][Thread Next]
- Subject: Re: is string.gmatch(), string.upper() 7-bit ascii only?
- From: Roberto Ierusalimschy <roberto@...>
- Date: Thu, 7 Apr 2016 10:27:57 -0300
> I am trying to manipulate text with umlauts. string.upper() does not produce upper case version of umlauts like ä,ö,ü etc.
> Also the %g pattern, when used in string.gmatch() does not match these umlauts.
> Is there anything that can be done about it? Or, am I making a stupid mistake?
The string library assumes one-byte character encodings.
If you are using an 8-bit encoding (e.g., LATIN 1), then these should
work, given a proper locale. Otherwise (e.g., UTF-8), you will need an