Re: Lua bytecode doesn't compress well (with LZMA)

lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

Subject: Re: Lua bytecode doesn't compress well (with LZMA)
From: KHMan <keinhong@...>
Date: Tue, 01 Dec 2009 17:46:34 +0800

Juris Kalnins wrote:

Squish actually uses this as its main output filter,  (there's more
than one, such as the one that replaces Lua keywords with single bytes
> 128) - and it works really well.


And it's very simple to modify llex.c to parse such alternative tokens,
so that reduced scripts can be used directly (btw, Lua also has 20+ unused
bytes from the 0..31 control character range).

I have one query for Matthew, the author of squish. Want to findout (the lazy way) a few data points (always love data points) toget a feel of things:

Does anyone have a set of data points for squish, where we comparedata-compressed sizes of sources, where:

(1) keyword token replacement filter, versus
(2) no keyword token replacement

Now, in (2), LZ coding would zap most keywords into a slidingdictionary match code, whereas in (1), the initial size of thefiltered sources will be smaller but there is more variation inthe symbol frequencies of the source code (token symbols added)and less chance to make sliding dictionary matches.

So, would there be a big difference when we compare compressedsources? Say, we tabulate results as:

(1) original
(2) token filtered
(3) original, compressed
(4) token filtered, compressed

--
Cheers,
Kein-Hong Man (esq.)
Kuala Lumpur, Malaysia

Follow-Ups:
- Re: Lua bytecode doesn't compress well (with LZMA), Matthew Wild

References:
- Re: Lua bytecode doesn't compress well (with LZMA), Matthew Wild
- Re: Lua bytecode doesn't compress well (with LZMA), Juris Kalnins

Prev by Date: Re: "dynamic" closures
Next by Date: Re: Portably iterate over a directory
Previous by thread: Re: Lua bytecode doesn't compress well (with LZMA)
Next by thread: Re: Lua bytecode doesn't compress well (with LZMA)
Index(es):
- Date
- Thread