lua-users home
lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

On Fri, Jan 7, 2011 at 3:55 PM, Javier Guerra Giraldez <> wrote:
Unicode is defined at 32bit at most (i think), but UTF-8 needs more
that 4 octets to encode 32 bits.  UTF-8 is defined up to 6 octets (5
'trailing' bytes on this snippet)

Not in the latest standard. It's limited to 4 octets per character.