Re: Setting Float Precision in Lua.c

lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

Subject: Re: Setting Float Precision in Lua.c
From: KHMan <keinhong@...>
Date: Thu, 7 Jun 2018 10:04:59 +0800

On 6/6/2018 8:53 PM, Albert Chan wrote:

Why don't you compile your binaries for SSE2 only? Even easier, just compile to 64-bit binaries? Surprising you mentioned Windows uses extended precision by default when there is x64 on every 64-bit capable Intel/AMD/other chip... and has been so for many, many years already.


I already picked 53-bits roundings.
I use my own laptop behavior just as a example, same with fsum.lua

David Gay's dtoa.c strtod maybe a better example.
With 53-bits roundings, it optimized away common cases [1]

strtod("123456789e-20", NULL)
= 123456789 / 1e20   -- both numbers exactly represented in double
= 1.23456789e-012    -- division guaranteed correct rounding


Here is a different approach (the long story approach):
=======================================================

(It was bubbling in my brain so I had to type it out. If you don'tunderstand this, then I really cannot help any further.)


Say, all values are on a line.

A float double actually represents a number that lies anywhere ona segment on that line. It may be exactly the value of therepresentation, but it can also be a little more, or a littleless. All those values in a segment need to be shoehorned into onebinary representation. It's a single binary representation, yetthe values can all be different. It's an approximation.

The examples you keep offering imply exact numbers, that is, theyare points on the line. Then in the examples, the arithmeticoperation is performed, and the FPU should round and hit anotherpoint on the line. There is an expectation of mathematicalperfection or mathematical elegance.

When we work with actual numbers instead of ideal examples, wealways understand that when operations are performed, the resultvalues hardly ever hit the exact points on the line that equal abinary representation. Instead, the result value is close, withinthe segment which has that binary representation. So there iserror, and error usually accumulates.

Since a binary representation really means a segment of possiblevalues on the value line, when we do arithmetic with two segments,we end up with a bigger segment. We can have many combinations ofoperands and result within those segments and they are all validfor the binary representation. But how correct are those values?Normally we know the quality of our inputs and they are much lessthan 16 digits of precision, so we often successfully manageerrors in calculations.

But some people are of the notion that when arithmetic is done ontwo points on the value line, the result should hit an exact pointwhen such a situation arises. It appears that some people have thefirst mental model (segments), others have the second mental model(points). But if we keep thinking about all those exact points onthe line, then the problem is that values next to those pointscannot be shoehorned into beautifully exact and artificialmathematical examples.

If we want exact calculations all the time, just use floats asintegers. We can assume the integers are exact, as points on thevalue line. We also need to do things that don't mess up thismodel. But once the result has a fraction, for example when adivision is done, that value is most likely no longer exactlyrepresentable. It's an approximation.

For non-mathematicians, we work with regular numbers or data allthe time and they get processed and the end value is approximatedby the resulting binary representation. Those values do not hitthe points on the value line that are exactly the value of thebinary representations. But we have 16 decimal digits to workwith, so we format the result properly for user consumption byrounding to much less than 16 digits of precision. This is why Imentioned the concepts of engineering compromises versusmathematical perfection.

So it's no problem for most of us. But if mathematicians keepthinking about ideal situations and keep trying to hit exactpoints on the value line, then they should keep on doing so andnot bother the rest of us about it.

[snip snip snip]



--
Cheers,
Kein-Hong Man (esq.)
Selangor, Malaysia

Follow-Ups:
- Re: Setting Float Precision in Lua.c, Dirk Laurie

References:
- Setting Float Precision in Lua.c, Albert Chan
- Re: Setting Float Precision in Lua.c, KHMan
- Re: Setting Float Precision in Lua.c, Roberto Ierusalimschy
- Re: Setting Float Precision in Lua.c, Albert Chan
- Re: Setting Float Precision in Lua.c, KHMan
- Re: Setting Float Precision in Lua.c, Albert Chan

Prev by Date: Re: Lua code formatter
Next by Date: Re: Setting Float Precision in Lua.c
Previous by thread: Re: Setting Float Precision in Lua.c
Next by thread: Re: Setting Float Precision in Lua.c
Index(es):
- Date
- Thread