Re: What do you miss most in Lua

lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

Subject: Re: What do you miss most in Lua
From: David Given <dg@...>
Date: Tue, 07 Feb 2012 11:13:13 +0000

HyperHacker wrote:
[...]
> I do think a simple UTF-8 library would be quite a good thing to have
> - basically just have all of Lua's string methods, but operating on
> characters instead of bytes.

What do you mean by a 'character'? A Unicode code point? A grapheme
cluster?

If you split the string on code points you'll end up breaking grapheme
clusters in the middle, which will break any combining characters. If
you split the string on grapheme clusters you'll preserve the ability to
do random access into the string, but your string manipulation library
now becomes hideously heavyweight: grapheme clusters can be *any length*
(although there seems to be a promise that normalised Unicode won't have
any grapheme clusters longer than 32 code points).

The standard intuition that strings are made up of an array of
characters is, unfortunately, not really true in Unicode. It's basically
not possible to do random access into a Unicode string without jumping
through painful hoops.

-- 
┌─── ｄｇ＠ｃｏｗｌａｒｋ．ｃｏｍ ───── http://www.cowlark.com ─────
│ "I have always wished for my computer to be as easy to use as my
│ telephone; my wish has come true because I can no longer figure out
│ how to use my telephone." --- Bjarne Stroustrup

Attachment: signature.asc
Description: OpenPGP digital signature

Follow-Ups:
- Re: What do you miss most in Lua, Tim Mensch

References:
- Re: What do you miss most in Lua (was: Why isn't Lua more widely used?), sergei karhof
- Re: What do you miss most in Lua (was: Why isn't Lua more widely used?), Jay Carlson
- Re: What do you miss most in Lua, Miles Bader
- Re: What do you miss most in Lua, Jay Carlson
- Re: What do you miss most in Lua, Miles Bader
- Re: What do you miss most in Lua, HyperHacker

Prev by Date: Re: Avoiding FFI- allocations + using SSE-vectors
Next by Date: Re: luasocket udp multicast working example?
Previous by thread: Re: What do you miss most in Lua
Next by thread: Re: What do you miss most in Lua
Index(es):
- Date
- Thread