Re: checking for Not a Number?

lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

Subject: Re: checking for Not a Number?
From: Rici Lake <lua@...>
Date: Wed, 27 Dec 2006 16:09:19 -0500


On 26-Dec-06, at 11:51 PM, Joe Smith wrote:

"Luiz Henrique de Figueiredo" <lhf@tecgraf.puc-rio.br> wrote inmessage 20061221072046.A23167@lua.tecgraf.puc-rio.br">news:20061221072046.A23167@lua.tecgraf.puc-rio.br...
Directly compairing floating point numbers for equality is *always*a bad
idea.
Only if the numbers are supposed to be the same number but are theresult
of two different computations.
For example, you can find that a floating point value which is not a
NaN is not equal to itself.
I find this very surprising. And possibily wrong, ie against IEEE 754.
The explanation that followed made no sense to me. It would makesense forfloating point values held in *different* variables, but not for the*same*variable. It seems to me that the gcc docs is spreading unnecessaryFUD.
But I must be missing something...
I've heard this same explanation elsewere too. (Somebody saw thisproblem in production code).Apparently, the registers used in the FPU are oversized, and thuscalculations performed there-in have morie bits than the datatypeshould.Thus when compairing a value in the register to the truncated value inthe memory, they can be non-equal.I lack access to the IEEE 754 standard, and cannot make heads or tailsof much of the ISO C standard when it talks about floating pointnumbers, so I'm not certain if this behavior violates either standard,but I've seen this documented more than once.
On the other hand, it is highly unlikely that the code listed wouldever be affected by this, but it is still troubling.

I can't see how (x == x) could yield the incorrect answer. The warningthat a number may not be equal to "itself" should probably beinterpreted as meaning that two different computations of the samevalue, even if they are textually identical, may result in differentanswers. Consequently, the code:


double q;
q = 3.0/7.0
if (q == 3.0/7.0) printf("Equal!\n");
else printf("Not equal!\n");

might print either "Equal!" or "Not equal!". However, note that q isnot compared to itself, but rather compared to a similar computation.(As the reference below mentions, the result may be different withdifferent compiler settings; unless optimization is disabled, most Ccompilers would compute the value 3.0/7.0 at compile time.)

The example was taken from<http://cch.loria.fr/documentation/IEEE754/ACM/addendum.html>, and thatlink in turn comes from<http://www.vinc17.org/research/extended.en.html>. Those are both worthreading, and can be found in various places with various URLs.

To cut to the chase, though, the particular behaviour (use of 80-bitinternal precision on x86) is part of the floating point status; it canbe set by (non-standard) library functions. Different OSs providedifferent default floating point modes. While it would be nice ifstandard numerical libraries worked regardless of the floating pointmode, and deviation from this could well be considered a bug, the factremains that many library implementations will fail in various ways ifthe floating point mode is not as they expect it to be.

Regardless, the use of floating point to store integers is not at allproblematic, except possibly for the lack of an integer divideprimitive. As long as a computation (and all intermediate computations)can be represented exactly in 53 bits, the result will be entirelypredictable regardless of floating point mode.

It is also worth mentioning that integer arithmetic is subject tosimilar inconsistencies. In particular, it's not at all uncommon to seecode like this:


Foo *make_foo_vector(size_t n) {
  if (n * sizeof(Foo) > MAX_ALLOC)
    return NULL; /* Or issue an error */
  return malloc(n * sizeof(Foo));
}

This is, of course, incorrect and may result in the successful returnof a vector of less than n Foo's, which may in turn lead to abuffer-overrun. On machines where size_t is <= 32 bits, the code couldbe corrected by doing the computation at line 2 in double-precisionfloating point, although I suppose the correct solution is to divideinstead of multiplying.

As another illustration of the perils of integer arithmetic, considerthe following incorrect code for verifying that f(n) == g(n) for allvalues of n:


int check_all_values(int (*f)(int), int (*g)(int)) {
  int n;
  for (n = INT_MIN; n <= INT_MAX; n++) {
    if (f(n) != g(n)) return 0;
  }
  return 1;
}

Follow-Ups:
- Re: checking for Not a Number?, Glenn Maynard
- Re: checking for Not a Number?, Joe Smith

References:
- checking for Not a Number?, Tom Bates
- Re: checking for Not a Number?, Luiz Henrique de Figueiredo
- Re: checking for Not a Number?, Joe Smith
- Re: checking for Not a Number?, Luiz Henrique de Figueiredo
- Re: checking for Not a Number?, Joe Smith

Prev by Date: Re: Lua FFI/ffcall Library
Next by Date: Re: Re[2]: Lua's opportunity
Previous by thread: Re: checking for Not a Number?
Next by thread: Re: checking for Not a Number?
Index(es):
- Date
- Thread