Re: unicode support in lua

lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

Subject: Re: unicode support in lua
From: David Jones <drj@...>
Date: Thu, 26 Apr 2007 14:38:12 +0100


On 26 Apr 2007, at 13:35, David Kastrup wrote:


It may also be considered somewhat counterintuitive that the call

unicode.utf8.byte(unicode.utf8.char(5000))

returns 5000, something which naive people like myself would not
exactly choose to call a "byte".

string.char and string.byte are inverses, and it seems sensible toextend this inverse into the unicode.utf8 domain.

When I implemented Lua in Java, strings were implemented usingjava.lang.String (so using Java's 16-bit unsigned char type). I tooka similar position, string.byte returned an integer between 0 and 65535.

string.byte should probably be named string.code to avoid anyemotional attachment to byte.

Whilst almost all bytes are 8-bit (octets), byte does have othermeanings apart from 8-bit number. Google for "14-bit byte", etc.


David Jones

Follow-Ups:
- Re: unicode support in lua, David Given
- Re: unicode support in lua, David Kastrup

References:
- unicode support in lua, Bertrand Mansion
- Re: unicode support in lua, David Kastrup
- Re: unicode support in lua, Klaus Ripke
- Re: unicode support in lua, David Kastrup
- Re: unicode support in lua, Klaus Ripke
- Re: unicode support in lua, David Kastrup
- Re: unicode support in lua, Klaus Ripke
- Re: unicode support in lua, David Kastrup

Prev by Date: Re: Lua string
Next by Date: Re: Lua string
Previous by thread: Re: unicode support in lua
Next by thread: Re: unicode support in lua
Index(es):
- Date
- Thread