Re: Reloadable Lua "Modules"

lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

Subject: Re: Reloadable Lua "Modules"
From: Rici Lake <lua@...>
Date: Tue, 13 Feb 2007 12:55:18 -0500

On 13-Feb-07, at 10:46 AM, Jerome Vuarand wrote:

David Given wrote:

However, security issues are much harder. Lua basically
doesn't do security.
It's possible to very easily set up sandboxes where you can
deny access to, say, loadfile() and require(), but ensuring
that you haven't accidentally left holes in your sandbox that
allow privileged escalation is infeasibly hard. So you can
make it *difficult* for your programmers to break things, but
it's very hard to make it *impossible*. Frankly, if your
programmers aren't going to be actively malicious, I'd be
inclined not to bother --- if you enforce decent coding
standards and bite people's heads off if they access things
they shouldn't, they should get the message.

I don't agree with your statement that it's impossible to make a lua

state completely secure. If your users have only the possibility toload

Lua code, you can execute their code in a sandbox, where each access to
globals goes through proxies which ensure you're not trying to do
malicious things. This means you can even let access to most of the
basic Lua API. You can even prevent your user scripts from entering
infinite loop by adding instruction count hooks. Overall I think
ensuring total security in Lua is easy.

I'd be happy to provide code examples if you can give me a situation
that seems problematic.

It seems to me that there are a couple of problematic issues, mostlyhaving to do with the module system.

The key issue is library tables. Untrusted code can modify values inlibrary tables; this is not necessarily malicious so one doesn't reallywant to block it (they might simply be adding more string methods, forexample), but it can also interfere with other code.

One solution is to provide each sandbox with its own instance of everylibrary table. That's straightforward for most standard libraries; anefficient solution (which unfortunately prevents sandboxed code from

iterative introspection on the contents of a library table) is:

  function addlib(sandbox, lib)
    sandbox[lib] = setmetatable({}, {
      __metatable = false,
      __index = require(lib)
    }
  end

  function addglobal(sandbox, glob)
    sandbox[glob] = getfenv()[glob]
  end

  local function words(s) return s:gmatch"%S+" end
  function Sandbox()
    local sandbox = {}
    -- The ... here is not literal :)
    for w in words[[math string table coroutine ...]] do
      addlib(sandbox, w)
    end
    for w in words[[ipairs next pairs ...]] do
      addglobal(sandbox, w)
    end
    return sandbox
  end

There are some obvious improvements that could be made, but that'sclose enough for a demonstration.

Unfortunately, that's not good enough.

First, we've carefully protected the global "string" from beingpermanently altered. However, while the sandbox can still modify theirlocal "string" table, such changes don't add OO-style calls to strings,which require the additional methods to be added to the original stringtable, which is available as getmetatable"".__index. That is availablebecause we haven't protected the string metatable; we can do so byadding a __metatable key to it, but that only prevents damage; itdoesn't give the sandbox the flexibility of adding string methods.[Note 1]

So, are our only options to cripple the sandbox or to leave it open? Ifthat's true, Rings are starting to look more attractive.

We could certainly come up with a solution for the particular case ofthe string library table, at least if we're prepared to run sandboxesin their own lua_thread (although this is complicated for callbacks),by replacing the string __index metamethod with a function whichreferences the current thread's "global" string table. Unfortunately,that would severely slow down OO-style string method calls;furthermore, it's hard to see how to generalize that.

Up to now, I've only mentioned the built-in libraries, which arerelatively amenable to special case fixes. But what about Lua orextension modules loaded with require()?

Here we confront two issues. First, require() doesn't actually use thepackage.loaded table to look up cached modules; it actually uses atable stored in the Registry. package.loaded is initialized to thattable, but redefining package.loaded does not change the behaviour ofrequire(). So in order to allow sandboxes to use require(), we're goingto have to provide a wrapper around require(). But that's going tocreate some other issues.

First, not all modules can realistically be loaded more than once. Thatmight be considered a design flaw in such modules -- it usually has todo with the module maintaining a global state -- but it will definitelyrequire module-by-module analysis in order to get sandboxing to workcorrectly. On the whole, it should be possible for a sandbox to atleast use require to load Lua modules from some protected filespace;otherwise, we're adding a significant level of complexity to sandboxedcode (the inability to be split into modules, for example).

This is complicated by the fact that many modules are created using thebuilt-in function module() with the package.seeall facility. That willleak globals unless the require() is called inside the sandbox. [Note2]

A sandboxing issue unrelated to the package system is the fact thathooks are relative to lua_threads (coroutines), not to the globalstate. If a sandbox's execution time is limited by running it with acount hook, for example, the sandbox could avoid that simply bycreating a new coroutine and running the resource-intensive code in thecoroutine. One wouldn't want to prevent the use of coroutines bysandboxed code -- coroutines are far too useful to simply throw away --but it may be necessary to provide an alternative implementation of thevarious coroutine methods in order for newly created coroutines toinherit hooks. [Note 3]

The bottom line is that it is very easy to create a fully securesandbox in Lua, as long as you're prepared to severely restrict the useof Lua in the sandbox. If you wish to sandbox while preserving as manyuseful features of the language as possible, it may well be thatseparate states are a better solution, even though that inhibits datatransfer.

---- Notes

1) The problem with the string metatable could possibly be fixed bymaking the vector of basic-type metatables part of the (thread)lua_State rather than the global lua_State. But that won't help with,for example, the metatable for files created by io.open(). Of course,the io library is likely to be replaced in any sandbox, but a similarissue exists for any unprotected metatable for any object-type -- say atype based on tables, and created by some extension library (either Luaor C).

One might wish that such libraries uniformly protected metatables, butthe reality is that few do so; to save module-by-module inspection, theeasy solution is to simply generate a new instance of the extensionlibrary for each sandbox, but as mentioned above that may conflict withsingleton constraints.

2) A solution proposed by David Manura in the LuaDesignPatterns page onthe Wiki involves creating a new package table by copying keys from thepackage environment table created by module(). This leaves the closuresin the package able to access the outer globals, while preventingleakage to the package consumer. That will work well in many cases, butwill fail if package state is maintained as a key in the package table-- something which should be avoided in any case, but which doeshappen.

3) This is one of the reasons I believe that Lua needs a lighter-weightcoroutine implementation, which differs from the existing threadimplementation by not having as much state (eg., no hooks) and whichcan be implemented directly in the Lua VM rather than through arecursive call into the Lua VM.

Follow-Ups:
- RE: Reloadable Lua "Modules", Jerome Vuarand

References:
- RE: Reloadable Lua "Modules", Jerome Vuarand

Prev by Date: RE: Problems linking LuaFileSystem module on Windows.
Next by Date: RE: Reloadable Lua "Modules"
Previous by thread: RE: Reloadable Lua "Modules"
Next by thread: RE: Reloadable Lua "Modules"
Index(es):
- Date
- Thread