Re: Unpooled strings?

lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

Subject: Re: Unpooled strings?
From: "Alex Davies" <alex.mania@...>
Date: Tue, 29 Apr 2008 15:42:59 +0800

----- Original Message -----From: "Jerome Vuarand"

Sent: Tuesday, April 29, 2008 11:41 AM
Subject: Re: Unpooled strings?

One last note (because I thought of that afterward): if that's the
cost of hashing the strings that is bothering you (rather than the
memory usage of copying your long strings), the hash is not computed
on the whole string but just on a few tens of characters at its
beginning (I don't remember exactly how many), so the cost of hashing
a 100kB string is the same as for a 100B string.

You should also check Mike Palls faster string hashing algorithm. Itgenerates a better hash from 3 ints in the string, but unfortunately isn'tANSI C (due to requiring unaligned loads).


http://lua-users.org/lists/lua-l/2007-12/msg00238.html

Jerome is right though. Any function in the string library except forstring.len is likely to [greatly] exceed the time taken to intern thestring, making the optimization moot.

Also there's quite a bit of complexitity there and features lost. Eg withoutmodifying the core, you could never get equality to work correctly betweenthe strings (which, due to not being interned, would always require a fullmemory compare anyway), or table hashing for that matter. Mind you, this maynot be an issue for very-likely-to-be-unique multi hundred k strings though.

One thing that puzzles me though, is if they're constant... why not justpush them at the start of the program? If it's an embedded processor I cansee the problem there, but else wise memcpy isn't -that- slow. If they'renot constant, I'm intrigued as to how you can guarantee their safety in thecase that they aren't "forgotten". The only real use I can see fornon-interned strings is to make them fully mutable, which would bring withit large performance increases for some applications.

If you're still keen though, checkhttp://lua-users.org/wiki/SpeedingUpStrings by Rici Lake. It providessomething similar to what you want, but modifies internals instead of usinguserdata (ie, should be faster). Do note though, that the project was leftas the results were not as high as expected.

- Alex

Follow-Ups:
- Re: Unpooled strings?, Chip Salzenberg

References:
- Unpooled strings?, Chip Salzenberg
- Re: Unpooled strings?, Jerome Vuarand

Prev by Date: Re: Can LuaInterface use an existing lua installation ?
Next by Date: Re: How to distribute Boyer-Moore code?
Previous by thread: Re: Unpooled strings?
Next by thread: Re: Unpooled strings?
Index(es):
- Date
- Thread