Re: Res: Script speed improvement advice

lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

Subject: Re: Res: Script speed improvement advice
From: Nathaniel Trevivian <nathanielt@...>
Date: Fri, 13 Feb 2009 17:44:16 +0000

Hi - first post here!
I could really do with some advice on how to speed up a script I'vewritten.
Basically, the script does a string.find on a large piece of text(500 words-ish - sometimes more) for occurrences of any word from alist of over 40,000.At the moment, I'm going through a table which contains the 40,000"check words" and performing this string.find on each one.
Something similar to:
===========================
local #d = getTagValues(); --returns 40,000 entry table of stringvalues
for i=1,#d do
m, document = checkForItem(d[i], document); --check to see ifdocument contains word from table
if m == true then
if ll == "FULL" then
logln(os.date("%Y/%m/%d %H:%M:%S")..": Found item: "..d[i]);
end
end
end
===========================
This is taking anything up to 15 seconds or more. That soundsfairly quick, but this is actually a bottle neck in an extremelyquick system.Is there a quicker way to do what I'm doing here? Ideally it woulddo all 40k checks in a second - at the mo I would be grateful for 3seconds!
I realise that if there isn't then I should perhaps consider adifferent methodology altogether - which I am doing, but we'veinvested quite a lot in this script and it would be great if itcould really perform.
Thanks for any help.

Nathan Trevivian
___________________________________________________________________________
The information contained in this message is for the intendedaddressee onlyand may contain confidential and/or privilegedinformation. If you are not the intended addressee, please deletethis message and notify the sender; do not copy or distribute thismessage or disclose its contents to anyone.
Any views or opinions expressed in this message are those of theauthor and do not necessarily represent those of GateWest New MediaLtd.
____________________________________________________________________________
I think you could do some preprocessing, as putting the words
in a different array according its first letter, and then checkingonly
the words which have the same initial letter. This should decrease the
number of checking/iterations.

Sérgio

Thanks, Sérgio.

Unfortunately, I need to check the document for occurrences of allwords/phrases in the list.


Is there perhaps a more efficient loop I could be using?

Once again -thanks for your help. Much appreciated.

Follow-Ups:
- Re: Res: Script speed improvement advice, Norman Ramsey

References:
- Script speed improvement advice, Nathaniel Trevivian
- Res: Script speed improvement advice, Sérgio Medeiros

Prev by Date: Re: Script speed improvement advice
Next by Date: Re: Script speed improvement advice
Previous by thread: Re: Res: Script speed improvement advice
Next by thread: Re: Res: Script speed improvement advice
Index(es):
- Date
- Thread