[Date Prev][Date Next][Thread Prev][Thread Next]
[Date Index]
[Thread Index]
- Subject: Re: LPEG and indentation based syntax
- From: roberto@... (Roberto Ierusalimschy)
- Date: Mon, 16 Apr 2007 10:19:06 -0300
> Does anyone have any bright ideas on how to parse indentation-based
> syntaxes with LPEG. [...]
Conventional backreferences (indexed from the beginning of the pattern)
is not very meaningful in LPeg, but I guess that a backreference indexed
from the beginning of the rule could solve your problem.
Assume that lpeg.Bs(patt) stores a backreference to the substring that
matched 'patt', and that lpeg.Br(n) matches the 'n'-th stored substring
in the current rule. Then I think the next pattern would do your trick:
-- Not tested (as lpeg does not support Bs-Br...)
S = lpeg.P" "
line = -S * (1 - lpeg.P"\n")^0 * "\n"
block = { -- defines an idented block
lpeg.Bs(S^0) * line -- capture identation of first line
* ( lpeg.Br(1) * line -- block may contain lines with the same identation
+ #(lpeg.Br(1) * S^1) * lpeg.V(1) -- or idented blocks
)^0
}
I am not much in favor of adding backreferences to LPeg. But, if that
code is correct, it would be the second example of a "useful" application
for them (the first being long strings in Lua).
-- Roberto