lua-users home
lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]


David Given <dg@cowlark.com> writes:
> Which is why, of course, people should not be using UCS-2 or UTF-16 for
> anything. In fact, I'd suggest not using UCS-4 either --- it encourages
> shortcuts in handling Unicode that aren't actually valid, like assuming
> you can split strings anywhere. UTF-8 FTW.

Totally agree, which is why the unicode string constant notation in
C++0x is a bit of a shame:  the most "obvious" notation, u"xxx", yields
a _UTF-16_ wide-character string constant; one must write u8"yyy" to get
a proper UTF-8 constant string.  This is particularly awkward given the
long historical emphasis on, and generally much better handling of,
char* strings in C/C++.

I imagine MS lobbied for this though, and nobody else cared enough to
resist...

Oh well.

-miles

-- 
"She looks like the wax version of herself."
     	   	    		   [Comment under a Paris Hilton fashion pic]