[Date Prev][Date Next][Thread Prev][Thread Next]
[Date Index]
[Thread Index]
- Subject: Re: Plea for the support of unicode escape sequences
- From: Miles Bader <miles@...>
- Date: Fri, 01 Jul 2011 13:55:35 +0900
David Given <dg@cowlark.com> writes:
> Which is why, of course, people should not be using UCS-2 or UTF-16 for
> anything. In fact, I'd suggest not using UCS-4 either --- it encourages
> shortcuts in handling Unicode that aren't actually valid, like assuming
> you can split strings anywhere. UTF-8 FTW.
Totally agree, which is why the unicode string constant notation in
C++0x is a bit of a shame: the most "obvious" notation, u"xxx", yields
a _UTF-16_ wide-character string constant; one must write u8"yyy" to get
a proper UTF-8 constant string. This is particularly awkward given the
long historical emphasis on, and generally much better handling of,
char* strings in C/C++.
I imagine MS lobbied for this though, and nobody else cared enough to
resist...
Oh well.
-miles
--
"She looks like the wax version of herself."
[Comment under a Paris Hilton fashion pic]