On Fri, Jan 7, 2011 at 3:55 PM, Javier Guerra Giraldez <javier@guerrag.com> wrote:
Unicode is defined at 32bit at most (i think), but UTF-8 needs more
that 4 octets to encode 32 bits. UTF-8 is defined up to 6 octets (5
'trailing' bytes on this snippet)
Not in the latest standard. It's limited to 4 octets per character.