lua-users home
lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

On Fri, Aug 7, 2015 at 10:59 AM, Coda Highland <> wrote:
> On Fri, Aug 7, 2015 at 10:56 AM, Javier Guerra Giraldez
> <> wrote:
>> On Fri, Aug 7, 2015 at 12:48 PM, Coda Highland <> wrote:
>>> Oh, there quite is:
>> ah, ok.  yes, this is avoidable in some cases, like CAS and simple
>> atomics.  still, the time penalty is the invalidation request that
>> must be propagated to interested cores. (fortunately, this is only
>> sent to those other cores that actually hold the relevant address in
>> cache)
>> the numbers i've shared are totally "lock- and cas-free"  :-)
> Unfortunately, the cores that hold the affected region in cache are
> your own other threads -- in other words, the very things you're most
> concerned about the performance of.
> /s/ Adam

That is to say, that's why you should minimize the number of locks you
have to use, which is exactly what batching stuff up and signaling
does, which is exactly what you're doing. This statement was meant in
support of your results, not opposition to it. :P

/s/ Adam