Re: Extending C++ classes under Lua

lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

Subject: Re: Extending C++ classes under Lua
From: Don Hopkins <dhopkins@...>
Date: Fri, 15 Sep 2006 21:51:37 -0700

mark gossage wrote:

Hi folks,
We all seem to be looking at the same kind of thing. Basically extending a C++ object within Lua, and being able to call it within either C++ or Lua.  I think with a bit of effort, putting our heads together we should be able to crack this one.

Here is a quick summary of the current state of SWIG-LUA:
When SWIG wrappers a class as a full userdata. The userdata holds a pointer to the object, a pointer to the SWIG_TYPE (an internal type structure) and a flag to point out if this object should be GC'ed.
It then adds a metadata to the object, which holds all the methods, as well as a bunch of functions to read/write the attributes (which is why you can access the attributes naturally). This metatable is held in the registry and actually shared between all instances of the object.

Thanks for describing how SWIG-Lua works. It's a great piece of work,and I'm glad to be using it.

The various SWIG back-ends have different approaches and abilities,depending on the target language. Lua and Python share a lot in thingsin common. I've used the SWIG-Python, and I'm just learning Lua andSWIG-Lua. The Python back-end is pretty complicated (python.cxx is127067 bytes) compawred to the current Lua back-end (37,054). But with alanguage like Lua, small and simple is a good thing, and I hope we cankeep it that way!

One of the abilities that the SWIG-Python back-end supports is togenerate Python wrapper classes ("shadow classes") around the lowerlevel procedural entry points, in such a way that you can subclass themfrom Python.

Python only has one standard object system (unless you countZope/CMF/Plone, which have about six and a half between them ;-), butLua lets you roll your own object system, so SWIG can't make as manyassumptions about the kind of Lua object wrappers to generate, and ithas to be more flexible.

One cool thing about SWIG is that a lot of it is written with typemaplibraries. You can hook into typemaps and and extend them, and it'sflexible enough that the back-end can define new kinds of typemaps thatlet the user hook into the wrapper generation in language specific ways.It would be great if the SWIG-Lua back-end would generate Lua wrapperfiles, so you could write typemaps to tailor the wrappers for whateverkind of Lua object system you're using. Of course it should included alibrary of typemaps to support the best practices (whatever those are --is there any consensus?).

I like Ariel's idea of having a function to setup the inheritence properly, I think this is a simple, but usable solution.
Because of the was the current SWIG wrappers, it has to remain a userdata with a metatable.
But I reckon, that the metatable should not be shared, and it could be used to hold the new attributes and new functions.
I am not so clear on what we do with the C++ (one problem at a time), but keeping a ref to the metatable seems like a good idea. Don, I seem to remember that you suggested having a ref from the C++ back to the userdata, that might work. But I think it might mess up the GC as the C++ has a ref to the userdata, so the userdata cannot be GC'ed.

I've taken a look at how tolua++ works, and it's quite nice and wellintegrated with Lua, and solves lots of these problems. It has a"setpeer" function that in Lua 5.1 uses the "setfenv" call to attach anenvironment table (i.e. the Lua class) to the userdata object, withoutstomping on its metatable. But that means the metatable has to be in onthe conspiracy, and know about delegating to the environment (peerobject) as well as cutting the C++ object in on its part of the action.We could modify the SWIG back-end to have a hook in the wrapper functionSWIG_Lua_class_get, so you can write a delegation typemap that tries theenvironment table. Like tolua_event.c's class_index_event does:


       lua_getfenv(L,1);
       if (!lua_rawequal(L, -1, TOLUA_NOPEER)) {
           lua_pushvalue(L, 2); /* key */

lua_gettable(L, -2); /* on lua 5.1, we trade the"tolua_peers" lookup for a gettable call */

           if (!lua_isnil(L, -1))
               return 1;
       };

I'm still trying to figure this out, and writing it down helps work itout, so please tell me if I've got anything wrong or missed anything.

tolua++ uses a weak value table to map from lua wrapper (peer) objectsto the corresponding userdata's (which can be gc'ed). In Lua 5.1, theenv slot on the userdata directly and efficiently links the other way.

I haven't been able to figure out by reading the code if or how lua++makes sure there are never two userdata's referring to the same C++object. I think it's worth the extra effort it takes to ensure thatthere's a 1:1 mapping between C++ objects and wrappers (interninguserdata), because that enables you to hang extra properties off of thewrapper objects, and they won't go away or get confused with multiplewrappers around the same object.

I have commited in the the SWIG CVS a new file which helps for writing callbacks:
http://swig.cvs.sourceforge.net/swig/SWIG/Lib/lua/lua_fnptr.i?revision=1.1&view=markup
(there is also an example of using it in CVS)

This might be of you for some of you.

Can some of you help me with some more idea's on how to address the rest of the issues?

Thanks,
Mark

Thanks for posting the typemaps -- they're so cool! Your SWIGLUA_REFtypemap is similar to the one I wrote to handle references. But I justassumed there was one global interpreter named "L", and stored plaininteger Lua object reference ids in my C++ members (making a "typedefint LuaRef" to let SWIG know what I meant). And I had to call theequivalent to your swiglua_ref_clear function in my C++ object'sdestructor to make sure the corresponding ref's get cleaned up.

The primary LuaRef I was using for each C++ object was a reference toitself in Lua-land (the peer object), so it was easy for the C++ objectto pass the peer object back to the handlers as the "self" argument, andto access Lua properties attached to the peer object (like instancevariables and callback methods), and stuff like that.

An alternative more gc-friendly approach would be to use a weak tablelike tolua++ is doing. My stupid objects are owned by the application,created by a factory and destroyed through helper functions, and I don'tcurrently intend for the Lua programmer to create and destroy themdirectly with new and delete (at least at this stage) or for the Lua GCto collect them. Of course that would be nice once we figure out thebest way to do that, but it would be best to support both approaches toobject ownership (the application owns, creates and destroys theobjects, versus the Lua programmer can call the class's new and object'sdelete methods, and the Lua GC controls their lifetimes.

I'm still learning Lua and trying to get my head around it, especiallyhow the meta-object programming stuff works, and how metatables,userdata and native code extensions interact.

One question I have about Lua OOP in general is: why (in the examples onthe Wiki) is there both a metatable, and also a separate method table(which the metatable points to with its __index attribute)? This seemskind of wastefully JavaScripty, and not as minimally Selfish as I'dexpect. Is there any reason not to put methods and class variablesdirectly into the metatable, and dispense with the extra method table inthe metatable's __index attribute?

Lua is a lot like Self. But one difference is that Self lets you markany slot as inheritable, so you can implement multiple inheritance byhaving multiple parent slots (with different names of course). Like theuserdata has both a metatable and an env slot (but the env slot isgeneral purpose and not automatically inherited from, so the metatablehas to know how to delegate to the env for "dual inheritence" to work).What we're trying to do by binding together Lua "peer" object tables(the Lua object) with Lua userdata objects (the C++ object), C++metatables (the C++ behavior) and Lua scripted classes (the Luabehavior), seems a lot like multiple inheritance, where we first checkthe "peer" table for user defined attributes/methods, then check the Luaclass for scripted attributes/methods, then the C++ metatable for nativecode attributes/methods.


userdata:

object => pointer to C++ object, represents object binary data andnative codemetatable => C++ metatable behavior, dispatches attributes andmethods into binary data and native code

   env => Lua peer table, represents scripted side of object

peer's properties: scripter defined instance variables,callbacks, etc.

      peer's metatable => Lua class behavior
         class's properties: scripter defined class variables and methods
         class's metatable => Lua superclass behavior, etc...

global weak value dictionary:
 lua peer table (strong) => userdata (weak)

Then the C++ object needs a way to get to the corresponding userdata.That could be:


global weak value dictionary:
 integer address of C++ object => userdata (weak)

Or it we could use the (more efficient?) approach of putting an integerreference to the userdata in the C++ object, like I was doing withLuaRef's. But that may leak memory (and it imposes on the design of theC++ object, making it harder to wrap uncooperative code, i.e. librarieswritten by other people), so it might be better to use a weak dictionaryto map from C++ object addresses to userdata's. (Or does Lua internuserdata's automatically? That'd be nice!)


   -Don

Follow-Ups:
- Re: Extending C++ classes under Lua, Root
- Re: Extending C++ classes under Lua, Javier Guerra

Prev by Date: Inheritance with Closure-based Classes
Next by Date: Re: Extending C++ classes under Lua
Previous by thread: Inheritance with Closure-based Classes
Next by thread: Re: Extending C++ classes under Lua
Index(es):
- Date
- Thread