Different behavior or sqrt when compiled with 64 or 32 bits

advertisements

I'm using sqrt() function from math library, when I build for 64 bit using -m64 I'm getting correct result but when I build for 32 bit I have very inconsistent behaviour.

For example on 64bit

double dx = 0x1.fffffffffffffp+1023;
sqrt(dx); // => 0x1.fffffffffffffp+511
sqrt(0x1.fffffffffffffp+1023);// => 0x1.fffffffffffffp+511

(which I believe is the correctly rounded result, verified with mpfr)

But on 32 bit same input value it behaves differently.

double dx = 0x1.fffffffffffffp+1023;
sqrt(dx); // => 0x1.0p+512
sqrt(0x1.fffffffffffffp+1023); // => 0x1.fffffffffffffp+511

When the same value passed in a variable I'm getting wrong result. I checked rounding mode before and after each call and all are set to round to nearest. What the reason? I'm using gcc 4.6 on a 64bit machine, and options are -mfpmath=sse and -march=pentium for both x86 nad x64 cases.

You haven't said which compiler or architecure you're using, but assuming gcc on x86 / x86-64 then the difference is likely down to the fact that by default gcc uses 387 floating point instructions on 32 bit x86, whereas it uses SSE instructions on x86-64.

The 387 floating point registers are 80 bits wide, whereas double is 64 bits wide. This means that intermediate results can have higher precision using the 387 instructions, which can result in a slightly different answer after rounding. (The SSE2 instructions operate on packed 64 bit doubles).

There's a few ways to change the way the compiler operates, depending on what you want:

If you use the -ffloat-store option on x86 builds, the compiler will discard extra precision whenever you store a value in a double variable;
If you use the -mfpmath=sse options on x86 builds, along with -msse2 or an -march= switch that specifies an SSE2-supporting architecture, the compiler will use SSE instructions for floating point just as on x86-64. The code will only run on CPUs that support SSE2, though (Pentium-M / Pentium 4 and later).
If you use the -mfpmath=387 option on x86-64 builds, the compiler will use 387 instructions for floating point just as on x86. This isn't recommended, though - the x86-64 ABI specifies that floating point values are passed in SSE registers, so the compiler has to do a lot of shuffling between 387 and SSE registers with this option.

Different behavior or sqrt when compiled with 64 or 32 bits

Different behavior or sqrt when compiled with 64 or 32 bits

Recommend

Requirejs shim: want to save backbone plugins directly in the main backbone

数据驱动体验 | 通过数据分析用户行为来提升体验服务价值

Samsung, please don’t go big for your next smartwatch

“将改变世界！”马斯克称特斯拉最早明年开始造人形机器人“擎天柱”

Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language

APP导航栏设计分析：5种设计样式+4种交互状态

获巴菲特公司42亿美元大手笔加仓，惠普股价应声大涨18%

Bitcoin Advocacy Project Launches Super PAC - Bitcoin Magazine: Bitcoin News, Ar...

Find Like-Minded, Entrepreneurial Peers in These New Founder Groups

网大开启新分账模式，靠前6分钟玩梗儿要凉？

About Joyk