acer

working precision (Digits=16, or evalhf,...

Answered: acer 32525

September 09 2012

0 0

Hi Axel,

I'm not sure that it is ok to measure the accuracy of the approximant by evaluating it under Digits=18 of radix-10 working precision. The Questioner mentioned using the result as C source.

Please pardon any mistakes, but here is some attempt at accuracy measurement:

>

restart:

>

H := proc(z) option inline;
(.466433940928848490e-3+(1.11523043757015811+(-1.72989528300231501
+(4.49729788979802074+(-.319393610864014770+(.545253820724829192e-2
-.228858722931549036e-4*z)*z)*z)*z)*z)*z)/(.139930182278641728e-2
+(3.34513159198135990+(-6.52796503942793992+(15.5613951451798833+
(-6.34533642326600689+(.374356830875784296+(-.590832471659976894e-2
+(.235408172559887276e-4+.461516094568396518e-9*z)*z)*z)*z)*z)*z)*z)*z);
end proc:
approx:= proc(x::float) option inline; H(x^2)*x^3+x end proc:
capprox:=Compiler:-Compile(approx):

>	J := proc(x::float) local t1; t1 := x^2; (.4290978931547805+(-0.2150922379775017e-1+(0.1194845227917730e-4 +(0.1351777597551553e-6+0.1467942640582486e-8t1)t1)t1)t1)t1x /(1.287293679464341+(-.5794451431789617+0.2339492595289925e-1t1)t1) +x; end proc: cJ:=Compiler:-Compile(J):

>

populate_approx:=proc(V::Vector(datatype=float[8]),
                      N::integer[4])
  local i::integer[4], q::float[8], x::float[8];
  q:=Pi/(4.0*N);
  for i from 1 to N do
     x:=(i-1.0)*q;
     V[i]:=approx(x);
  end do:
  NULL:
end proc:
cpopulate_approx:=Compiler:-Compile(populate_approx):

>

populate_J:=proc(V::Vector(datatype=float[8]),
                 N::integer[4])
  local i::integer[4], q::float[8], x::float[8], t1::float[8];
  q:=Pi/(4.0*N);
  for i from 1 to N do
    x:=(i-1.0)*q;
    t1 := x^2;
    V[i]:=(.4290978931547805+(-0.2150922379775017e-1+(0.1194845227917730e-4
           +(0.1351777597551553e-6+0.1467942640582486e-8*t1)*t1)*t1)*t1)*t1*x
           /(1.287293679464341+(-.5794451431789617+0.2339492595289925e-1*t1)*t1)
           +x;
  end do:
  NULL:
end proc:
cpopulate_J:=Compiler:-Compile(populate_J):

>	populate_approx_diff:=proc(V::Vector,N::posint) local i, q, x; q:=evalf(Pi/(4.0N)); for i from 1 to N do x:=(i-1.0)q; V[i]:=abs(capprox(x)-tan(x)); end do: NULL: end proc:

>	populate_J_diff:=proc(V::Vector,N::posint) local i, q, x, t1; q:=evalf(Pi/(4.0N)); for i from 1 to N do x:=(i-1.0)q; V[i]:=abs(cJ(x)- tan(x)); end do: NULL: end proc:

We can compute a large number of values, using these candidate replacements of the tangent function. By inlining

the formulae right into procedures that do the calculations inplace on float[8] Vectors we can use Compiler:-Compile

and obtained compiled (from C) programs that run until finished without pausing to call back to Maple.

So these next timings illustrate how quickly the function can do (nc=10^7) executions, when compiled from C source.

There is just the initial overhead of making the external call, but no extra overhead per individual execution of the

compiled tangent candidate functions.

Unsurprisingly, compiled `J` is faster to populate the double precision array, as it does fewer arithmetic operations

than does compiled `approx`. (Another potential benefit of "smaller" code may be the it's easier to shoehorn into

the limited space on an embedded device.)

>

nc:=10^7:

>	v1:=Vector[row](nc,datatype=float[8]): CodeTools:-Usage( cpopulate_approx(v1,nc) );

memory used=0.51KiB, alloc change=0 bytes, cpu time=375.00ms, real time=368.00ms

>	v2:=Vector[row](nc,datatype=float[8]): CodeTools:-Usage( cpopulate_J(v2,nc) );

memory used=0.51KiB, alloc change=0 bytes, cpu time=234.00ms, real time=238.00ms

Now we wish to compute the absolute errors of our candidate functions.

If we compared with the values of tan(x) computed at the same hardware double precision then we would not be taking

into account that values from the double precision builtin `tan` also have some error. We want to compare out candidate

functions against more correct values of tangent, and avoid inadvertantly conjoining any errors in the builtin. Hence we

should raise the working precision when computing the baseline tangent values, and it's also a good idea to do the

subtraction at higher precision. Below, we'll use Digits=30, which ought to suffice.

But it's important to note that the candidate replacements for tangent should themselves not be run under higher working

precision, since the end goal is to use then in compiled C, say. The easiest way to get their values is to run Maple's own

Compiler:-Compile against them, which generates something similar to CodeGeneration[C] and then compiles/links to

a shared library that Maple can access via its external calling. All that process is normally hidden, by the Compiler.

>	Digits:=30:

>

nc:=10^5:

>	v1diff:=Vector(nc,datatype=float[8]): populate_approx_diff(v1diff,nc):

>	v2diff:=Vector(nc,datatype=float[8]): populate_J_diff(v2diff,nc);

We can find the largest absolute error, in both of those collections of values computed. These are very close, about

2.5 ulps each.

>	max(v1diff), max(v2diff);

We can compute the average absolute error, in both of those collections of values computed. Again, they are quite close.

>	[add(t, t in v1diff), add(t, t in v2diff)]/nc: evalf[5](%)[];

We can also plot the two collections of absolute errors.

>	xpts:=Vector(nc,i->evalhf((i-1.0)Pi/(4.0nc)),datatype=float[8]):

>	plots:-display(Array([ plot(Matrix([xpts,v1diff],datatype=float[8]),style=point, view=0..3e-16,symbolsize=1), plot(Matrix([xpts,v2diff],datatype=float[8]),style=point, view=0..3e-16,symbolsize=1) ]));

Download compiledcompared2.mw

acer

working precision (Digits=16, or evalhf,...

Commented: acer 32525

September 09 2012

Hi Axel,

I'm not sure that it is ok to measure the accuracy of the approximant by evaluating it under Digits=18 of radix-10 working precision. The Questioner mentioned using the result as C source.

Please pardon any mistakes, but here is some attempt at accuracy measurement:

>

restart:

>

H := proc(z) option inline;
(.466433940928848490e-3+(1.11523043757015811+(-1.72989528300231501
+(4.49729788979802074+(-.319393610864014770+(.545253820724829192e-2
-.228858722931549036e-4*z)*z)*z)*z)*z)*z)/(.139930182278641728e-2
+(3.34513159198135990+(-6.52796503942793992+(15.5613951451798833+
(-6.34533642326600689+(.374356830875784296+(-.590832471659976894e-2
+(.235408172559887276e-4+.461516094568396518e-9*z)*z)*z)*z)*z)*z)*z)*z);
end proc:
approx:= proc(x::float) option inline; H(x^2)*x^3+x end proc:
capprox:=Compiler:-Compile(approx):

>	J := proc(x::float) local t1; t1 := x^2; (.4290978931547805+(-0.2150922379775017e-1+(0.1194845227917730e-4 +(0.1351777597551553e-6+0.1467942640582486e-8t1)t1)t1)t1)t1x /(1.287293679464341+(-.5794451431789617+0.2339492595289925e-1t1)t1) +x; end proc: cJ:=Compiler:-Compile(J):

>

populate_approx:=proc(V::Vector(datatype=float[8]),
                      N::integer[4])
  local i::integer[4], q::float[8], x::float[8];
  q:=Pi/(4.0*N);
  for i from 1 to N do
     x:=(i-1.0)*q;
     V[i]:=approx(x);
  end do:
  NULL:
end proc:
cpopulate_approx:=Compiler:-Compile(populate_approx):

>

populate_J:=proc(V::Vector(datatype=float[8]),
                 N::integer[4])
  local i::integer[4], q::float[8], x::float[8], t1::float[8];
  q:=Pi/(4.0*N);
  for i from 1 to N do
    x:=(i-1.0)*q;
    t1 := x^2;
    V[i]:=(.4290978931547805+(-0.2150922379775017e-1+(0.1194845227917730e-4
           +(0.1351777597551553e-6+0.1467942640582486e-8*t1)*t1)*t1)*t1)*t1*x
           /(1.287293679464341+(-.5794451431789617+0.2339492595289925e-1*t1)*t1)
           +x;
  end do:
  NULL:
end proc:
cpopulate_J:=Compiler:-Compile(populate_J):

>	populate_approx_diff:=proc(V::Vector,N::posint) local i, q, x; q:=evalf(Pi/(4.0N)); for i from 1 to N do x:=(i-1.0)q; V[i]:=abs(capprox(x)-tan(x)); end do: NULL: end proc:

>	populate_J_diff:=proc(V::Vector,N::posint) local i, q, x, t1; q:=evalf(Pi/(4.0N)); for i from 1 to N do x:=(i-1.0)q; V[i]:=abs(cJ(x)- tan(x)); end do: NULL: end proc:

We can compute a large number of values, using these candidate replacements of the tangent function. By inlining

the formulae right into procedures that do the calculations inplace on float[8] Vectors we can use Compiler:-Compile

and obtained compiled (from C) programs that run until finished without pausing to call back to Maple.

So these next timings illustrate how quickly the function can do (nc=10^7) executions, when compiled from C source.

There is just the initial overhead of making the external call, but no extra overhead per individual execution of the

compiled tangent candidate functions.

Unsurprisingly, compiled `J` is faster to populate the double precision array, as it does fewer arithmetic operations

than does compiled `approx`. (Another potential benefit of "smaller" code may be the it's easier to shoehorn into

the limited space on an embedded device.)

>

nc:=10^7:

>	v1:=Vector[row](nc,datatype=float[8]): CodeTools:-Usage( cpopulate_approx(v1,nc) );

memory used=0.51KiB, alloc change=0 bytes, cpu time=375.00ms, real time=368.00ms

>	v2:=Vector[row](nc,datatype=float[8]): CodeTools:-Usage( cpopulate_J(v2,nc) );