[Date Prev][Date Next][Thread Prev][Thread Next]
[Date Index]
[Thread Index]
- Subject: Re: UTF-8 patterns in Lua 5.3
- From: Tim Hill <drtimhill@...>
- Date: Fri, 18 Apr 2014 01:02:12 -0700
On Apr 17, 2014, at 7:25 PM, William Ahern <william@25thandClement.com> wrote:
>
> This hasn't been helped by Windows' and Java's early adoption of UTF-16,
> because they engender the idea that Unicode text can be indexed just like
> ASCII as long as your datatype is wide enough. Yes, many people understand
> that UTF-16 won't fit all the codepoints, but they erroneously think all one
> needs to do is upgrade to a wider character type, et voila, you've mastered
> I18N.
>
To be fair to MS, they designed Windows for UCS2 way before Unicode moved to UTF-16 and overflowed the 16-bit codespace. I put the blame squarely on the Unicode people, who have yet again shown that when you try to be everything to everyone you end up being nothing to no-one.