[Date Prev][Date Next][Thread Prev][Thread Next]
- Subject: Re: Plea for the support of unicode escape sequences
- From: Dirk Laurie <dpl@...>
- Date: Wed, 29 Jun 2011 21:09:48 +0200
On Wed, Jun 29, 2011 at 08:33:28PM +0200, Petite Abeille wrote:
> On Jun 29, 2011, at 7:16 PM, Mike Pall wrote:
> > Deciding not to support UTF-8 escapes in 2011 is
> > rather anachronistic.
> What's wrong with hex sequences?
> print( '\xE2\x86\x92' )
> Or literals?
> print( '←' )
> Plus, the OP requested unicode sequences, e.g. \u2192 instead of \xE2\x86\x92 for →. Doesn't seem worth the trouble.
Yes, I don't see the point of supporting UTF-8 sequences while e.g
the pattern "[ａ-ｚ]*" doesn't do what you expect.
Just for clarification: is it true that UTF-8 is locale-independent
if you take care not to use any one-byte characters above '\x7f'?