[Date Prev][Date Next][Thread Prev][Thread Next]
- Subject: Re: Plea for the support of unicode escape sequences
- From: Mike Pall <mikelu-1106@...>
- Date: Wed, 29 Jun 2011 19:16:15 +0200
David Kolf wrote:
> A half baked solution (just escapes, not patterns) should be avoided in
> my opinion
Adding an external UTF-8 pattern matching library is easy enough,
but it won't be able to extend the syntax for strings. Asking
users to insert only literal glyphs or to pre-process all UTF-8
strings is a half baked solution.
So one either has to bite the bullet and add those 176 bytes or
one has to make the lexer extensible. The latter is not easy to
get right and would certainly need a lot more code.
Ignoring the problem and doing nothing is not an option. Unicode
is there to stay. Deciding not to support UTF-8 escapes in 2011 is