Re: bottom-up processing

lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

Subject: Re: bottom-up processing
From: Richard Hundt <richardhundt@...>
Date: Fri, 06 Nov 2009 15:27:49 +0100

spir wrote:

Hello,

I'm new to Lua and the list. Jump on the opportunity to comment and ask about the following:

Roberto Ierusalimschy wrote:

Lua does not build an AST; it generates code on the fly.


I'm rather surprised to read this. Thought, actually, that I was the only one to use such an approach for big jobs like that. I have never read about the topic, but anyway my knowledge in computer science lies at the zeroth level ;-)

I'd like to know how the method used in Lua code generation compares to the following:
* Each pattern possibly assign a "post-parse node action" (or several) to each node they will generate.
* This action transforms the node (or rather its data), including possibly collapsing whole branches into what becomes a simple leaf.
* That way, each node is responsible to pass itself in the most convenient form to higher levels.

This method is incredibly easy and powerful. Usually, it will not only help converting or restructuring nodes or branches, instead directly produce the desired result, be it returning a computed value or performing an action such as writing xhtml from wiki lang. I call that "bottom-up processing", mirroring "top-down parsing".
In a personal parsing tool (in python), I use it intensively to generate parsers. The meta-grammar specifies such actions so that the meta-parser does all the job -- actually the generator itself is a func of ~ 15 lines mainly instantiating a result parser with proper params.


[...]

I think what you're describing is essentially an "attribute grammar", soyou may find [1] interesting, or just google around for the term.

Before using this method, I long tried the hard way of walking parse trees, from outside and from the top.
Well, sorry if all of this is well-known to the ones of you possibly interested in the topic. The opportunity was too tempting. I'd really love pointers if any. But above all, I'm really curious about the method used in Lua to generate code while parsing.

If the language is fairly simple, you can do everything in one pass. Luadoesn't do any hoisting (e.g. in JavaScript you can call a functionbefore it is declared), and the register allocation is more stack-like(so no graph colouring or linear scan algorithms, or similar, whichrequire later analysis of variable lifetimes).


[DISCLAIMER]

I haven't studied Lua's compiler code, so YMMV, but I'm going for thegeneral principle here.

The basic strategy is to emit instructions when you've collected enoughinformation during parsing to do so, simple statements and expressionslike: local a = 41; a = a + 1, can have code generated for them at theend of the statement/expression (1 token lookahead is nice to havehere), others require recursing down and then generating the opcodes abit later.

I can imagine that there is also a certain amount of back patching ofinstructions and meta-data (variable life-time, upvalues, etc.) in thegenerated bytecode stream besides the usual forward jumps stuff forconditional branching.

It does mean that the parser and code generator are very tightlycoupled, though, but it also makes for fast, small memory footprintcompilation.



Cheers,
Richard Hundt

[1] http://www.cs.uiowa.edu/~fleck/agExample.pdf

Follow-Ups:
- Re: bottom-up processing, Roberto Ierusalimschy

References:
- bottom-up processing, spir

Prev by Date: Re: mochalua heap memory issues?
Next by Date: Re: help to (slightly) modify Lua interpreter
Previous by thread: bottom-up processing
Next by thread: Re: bottom-up processing
Index(es):
- Date
- Thread