Re: Will Lua kernel use Unicode in the future?

lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

Subject: Re: Will Lua kernel use Unicode in the future?
From: Chris Marrin <chris@...>
Date: Sat, 31 Dec 2005 07:24:24 -0800

Jens Alfke wrote:

(I've replied to a bunch of messages here rather than sending out sixseparate replies...)
On 29 Dec '05, at 9:17 AM, Chris Marrin wrote:
It allows you to add "incidental" characters without the need for afully functional editor for that language. For instance, when Iworked for Sony we had the need to add a few characters of Kanji onoccasion. It's not easy to get a Kanji editor setup for a westernkeyboard, so adding direct unicode was more convenient. There arealso some oddball symbols in the upper registers for math andchemistry and such that are easier to add using escapes.
Also, in some projects there are guidelines that discourage the use ofnon-ascii characters in source files (due to problems with editors,source control systems, or other tools. In these situations it'sconvenient to be able to use inline escapes to specify non-asciicharacters that commonly occur in human-readable text ... exampleswould include ellipses, curly-quotes, emdashes, bullets, currencysymbols, as well as accented letters of course.

But we're moving into an era where support of English text (the onlylanguage in existance that fits into ASCII) is not good enough. I usedto work at Sony, where this issue is magnified about a million timescompared to "western" languages. The notion of "optional" support fornon-ascii characters was never acceptable.

whisper@oz.net wrote:
IMO, with globalization, languages that don't support Unicode won'tmake the cut in the long run.
I find it ironic that the three non-Unicode-savvy languages I use (PHP,Ruby, Lua) all come from countries whose native languages use non-asciicharacters :)

But I think Lua mostly does a great job of supporting Unicode with it'sagnostic approach, because it allows UTF8 sequences to pass throughunchallenged... mostly. The few rough edges being discussed here arereally minor. Escaping unicode would simply make it more practical tohandle the full unicode range. Making it easier to tell the underlyingclib to use UTF8 for collating avoids putting platform specific codeoutside Lua. And allowing non-ascii character identifiers takes thisrestriction away from non-English speakers. Small changes just totighten up the i18n support.


--
chris marrin              ,""$, "As a general rule,don't solve puzzles
chris@marrin.com        b`    $  that open portals to Hell" ,,.
        ,.`           ,b`    ,`                            , 1$'
     ,|`             mP    ,`                              :$$'     ,mm
   ,b"              b"   ,`            ,mm      m$$    ,m         ,`P$$
  m$`             ,b`  .` ,mm        ,'|$P   ,|"1$`  ,b$P       ,`  :$1
 b$`             ,$: :,`` |$$      ,`   $$` ,|` ,$$,,`"$$     .`    :$|
b$|            _m$`,:`    :$1   ,`     ,$Pm|`    `    :$$,..;"'     |$:
P$b,      _;b$$b$1"       |$$ ,`      ,$$"             ``'          $$
 ```"```'"    `"`         `""`        ""`                          ,P`

References:
- Re: Will Lua kernel use Unicode in the future?, Roberto Ierusalimschy
- Re: Will Lua kernel use Unicode in the future?, Chris Marrin
- Re: Will Lua kernel use Unicode in the future?, Jens Alfke

Prev by Date: Re: lua_lock/lua_unlock
Previous by thread: Re: Will Lua kernel use Unicode in the future?
Next by thread: Re: Will Lua kernel use Unicode in the future?
Index(es):
- Date
- Thread