[Date Prev][Date Next][Thread Prev][Thread Next]
[Date Index]
[Thread Index]
- Subject: Re: LPeg support for utf-8
- From: Tony Finch <dot@...>
- Date: Fri, 1 Apr 2011 19:08:17 +0100
Roberto Ierusalimschy <roberto@inf.puc-rio.br> wrote:
> A quick survey, for those who care:
> - should LPeg support utf-8?
> - If so, what would that mean?
An alternative to lpeg.P(N) which matches N UTF-8 encoded code points
instead of octets. Similarly, alternatives to lpeg.R and lpeg.S that deal
with code points instead of octets. Maybe lpeg.uP and .uS and .uR ?
Perhaps there should be a .uB as well. I would prefer this to a "unicode
mode" which changes the behaviour of the existing funcctions.
Tony.
--
f.anthony.n.finch <dot@dotat.at> http://dotat.at/
Bailey: Southwest 5 to 7, veering west 4 or 5. Rough or very rough. Showers.
Moderate or good.