lua-users home
lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]


Sorry, didn't realize you were discussing UTF8 encoding.
my bad.

Regards,
Jim


> -----Original Message-----
> From: owner-lua-l@tecgraf.puc-rio.br 
> [mailto:owner-lua-l@tecgraf.puc-rio.br] On Behalf Of 
> lua+Steven.Murdoch@cl.cam.ac.uk
> Sent: Monday, December 02, 2002 11:57 AM
> To: Multiple recipients of list
> Subject: Re: lua for unicode 
> 
> 
> > > The null character ('\0' in C) is represented in Unicode as a 
> > > single, zero 
> > > byte. 
> > 
> > I believe it's a null word, not byte.
> 
> In the UTF-8 encoding it is a null byte/octet. All the ASCII 
> characters 
> (0-127) are represented as one byte, hence maintaining 
> backwards compatibility 
> with ASCII.
> 
> The null byte occurs in no other situation, even with the multi-byte 
> characters.
> 
> Steven Murdoch.
> 
> 
> 
>