Re: Poor man's benchmarking issue

lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

Subject: Re: Poor man's benchmarking issue - help needed
From: Lorenzo Donati <lorenzodonatibz@...>
Date: Wed, 31 Aug 2011 13:46:39 +0200

On 31/08/2011 11.32, TNHarris wrote:

Lorenzo Donati<lorenzodonatibz@interfree.it>  wrote:


According to the following benchmarks, built-in math.floor and
math.ceil
are slower than their counterpart rewritten using the core % operator!

Partly because you're measuring table lookups. Alias math.floor and math.ceil to locals and the times I get go from 40% slower to just 25% slower.


Ouch! Silly me! I knew that!

My code is "littered" by local foo = lib.foo statements! :-)

My original benchmark did have locals. I did that mistake when pullingit out of context in order to post it :-(


Also, that ce2 function is wrong when x%1 equals 0.


Uh! Right. I was too hasty.

Let's try again:

----------------------------------------------
local StatTime = require 'ld.debug'.StatTime

local floor = math.floor
local ceil = math.ceil

local function flo( x ) return floor( x ) end
local function flo2( x ) return x - x%1 end

local function ce( x ) return ceil( x ) end
local function ce2( x )
   local fp = x%1
   if fp == 0 then return x end
   return x - fp + 1
end


print( StatTime( 1e8, { flo, flo2 }, 5.5 ) )
  --> 2.275e-007	1	0.675
print( StatTime( 1e8, { ce, ce2 }, 5.5 ) )
  --> 2.3765e-007	1	0.745
----------------------------------------------


Still quite slower.


BTW I was contacted by PM and I got this objection:

[...]

Your function "flo2" calculates the
floor of a number. But the function "flo" calls another function which
then calculates the floor. That is adding an extra function call to
"flo", which distorts the benchmark. You should compare the two
functions which calculate floor directly, i.e, math.floor on the one
hand and your function flo2 on the other.

Instead of this
    print(StatTime(1e8,{ flo, flo2 }, 5.5))
    print(StatTime(1e8,{ ce, ce2 }, 5.5))

Try this
    print(StatTime(1e8,{ math.floor, flo2 }, 5.5))
    print(StatTime(1e8,{ math.ceil, ce2 }, 5.5))

and you will see that the bult in functions are indeed faster.

[...]

But I don't think this objection is correct, since I did it on purposeto wrap floor and ceil in flo and ce, in order to compare them on parwith flo2 and ce2, because I wanted to compare them against the idioms

using %, not to an hypothetical reimplementation of ceil and floor using %.

And so it seems that my conclusion is valid: there is no advantage inusing floor and ceil instead of using % directly, except whenreadability is important (admittedly most of the times when one is usingLua and not LuaJIT).

Anyway this is a big gotcha, IMHO: one wouldn't expect that a libraryfunction would be slower than a do-it-yourself alternative.

But the question about "useless fat" in Lua remains: why then log10 wasdropped and ceil and floor remain? It's only a matter of statisticalrelevance? (I suppose that most people use floor and ceil much moreoften than log10). I remember Lua team, replying to people complainingabout the missing log10 that they could roll their own. Why floor andceil are so special?



thanks!
-- Lorenzo

P.S.: has anyone given a look at my benchmarking function? I hope Ididn't do something too stupid there (especially with the collectgarbagesequence etc.)

References:
- Poor man's benchmarking issue - help needed, Lorenzo Donati
- Re: Poor man's benchmarking issue - help needed, TNHarris

Prev by Date: Re: Game Developing for Android
Next by Date: Re: Acceptable indicies
Previous by thread: Re: Poor man's benchmarking issue - help needed
Index(es):
- Date
- Thread