[Date Prev][Date Next][Thread Prev][Thread Next]
[Date Index]
[Thread Index]
- Subject: Re: UTF-8 identifiers [was: Re: Lua t-shirts]
- From: Jerome Vuarand <jerome.vuarand@...>
- Date: Mon, 5 Oct 2009 11:40:26 +0200
2009/10/5 Ico <lua@zevv.nl>:
>
> * On 2009-10-05 David Given <dg@cowlark.com> wrote :
>
>> I'm still pushing for my proposal to allow all characters with codes
>> in the range 128-255 to be considered valid in names. This would
>> allow identifiers contain UTF-8 sequences.
>
> Interesting idea. What's the catch, if there is one ?
Lua lexer uses isalpha C function, therefore depending on the locale
some of the characters in 128-255 are already valid. You could change
it to allow all characters in 128-255 in addition to 'alpha' class and
underscore, but that would be a hack imho.
On the other hand, as far as I can tell making the lexer unicode-aware
(so that you can use iwalpha) would require a major overhaul,
especially since Lua strings are not strings of characters but string
of bytes (and are often used to hold non-textual binary data).
- References:
- Lua t-shirts, Stuart P.Bentley
- Re: Lua t-shirts, Pierre LeMoine
- Re: Lua t-shirts, Roberto Ierusalimschy
- Re: Lua t-shirts, Patrick Donnelly
- Re: Lua t-shirts, Phoenix Sol
- Re: Lua t-shirts, Stefan
- Re: Lua t-shirts, Peter Cawley
- Re: Lua t-shirts, steve donovan
- Re: Lua t-shirts, David Given
- UTF-8 identifiers [was: Re: Lua t-shirts], Ico