lua-users home
lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]


Roberto Ierusalimschy <roberto@inf.puc-rio.br> wrote:

> A quick survey, for those who care:
> - should LPeg support utf-8?
> - If so, what would that mean?

An alternative to lpeg.P(N) which matches N UTF-8 encoded code points
instead of octets. Similarly, alternatives to lpeg.R and lpeg.S that deal
with code points instead of octets. Maybe lpeg.uP and .uS and .uR ?
Perhaps there should be a .uB as well. I would prefer this to a "unicode
mode" which changes the behaviour of the existing funcctions.

Tony.
-- 
f.anthony.n.finch  <dot@dotat.at>  http://dotat.at/
Bailey: Southwest 5 to 7, veering west 4 or 5. Rough or very rough. Showers.
Moderate or good.