summaryrefslogtreecommitdiff
path: root/lib/freebl/mpi/doc/timing.txt
blob: 58f37c9dff12562e40f2777eb83da705bed70b4f (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
MPI Library Timing Tests

Hardware/OS
(A) SGI O2 1 x MIPS R10000 250MHz IRIX 6.5.3
(B) IBM RS/6000 43P-240 1 x PowerPC 603e 223MHz AIX 4.3
(C) Dell GX1/L+ 1 x Pentium III 550MHz Linux 2.2.12-20
(D) PowerBook G3 1 x PowerPC 750 266MHz LinuxPPC 2.2.6-15apmac
(E) PowerBook G3 1 x PowerPC 750 266MHz MacOS 8.5.1
(F) PowerBook G3 1 x PowerPC 750 400MHz MacOS 9.0.2

Compiler
(1) MIPSpro C 7.2.1 -O3 optimizations
(2) GCC 2.95.1 -O3 optimizations
(3) IBM AIX xlc -O3 optimizations (version unknown)
(4) EGCS 2.91.66 -O3 optimizations
(5) Metrowerks CodeWarrior 5.0 C, all optimizations
(6) MIPSpro C 7.30 -O3 optimizations
(7) same as (6), with optimized libmalloc.so

Timings are given in seconds, computed using the C library's clock()
function.  The first column gives the hardware and compiler
configuration used for the test. The second column indicates the
number of tests that were aggregated to get the statistics for that
size.  These were compiled using 16 bit digits.

Source data were generated randomly using a fixed seed, so they should
be internally consistent, but may vary on different systems depending
on the C library.  Also, since the resolution of the timer accessed by
clock() varies, there may be some variance in the precision of these
measurements.

Prime Generation (primegen)

128 bits:
A1      200     min=0.03, avg=0.19, max=0.72, sum=38.46
A2      200     min=0.02, avg=0.16, max=0.62, sum=32.55
B3      200     min=0.01, avg=0.07, max=0.22, sum=13.29
C4      200     min=0.00, avg=0.03, max=0.20, sum=6.14
D4      200     min=0.00, avg=0.05, max=0.33, sum=9.70
A6      200     min=0.01, avg=0.09, max=0.36, sum=17.48
A7      200     min=0.00, avg=0.05, max=0.24, sum=10.07

192 bits:
A1      200     min=0.05, avg=0.45, max=3.13, sum=89.96
A2      200     min=0.04, avg=0.39, max=2.61, sum=77.55
B3      200     min=0.02, avg=0.18, max=1.25, sum=36.97
C4      200     min=0.01, avg=0.09, max=0.33, sum=18.24
D4      200     min=0.02, avg=0.15, max=0.54, sum=29.63
A6      200     min=0.02, avg=0.24, max=1.70, sum=47.84
A7      200     min=0.01, avg=0.15, max=1.05, sum=30.88

256 bits:
A1      200     min=0.08, avg=0.92, max=6.13, sum=184.79
A2      200     min=0.06, avg=0.76, max=5.03, sum=151.11
B3      200     min=0.04, avg=0.41, max=2.68, sum=82.35
C4      200     min=0.02, avg=0.19, max=0.69, sum=37.91
D4      200     min=0.03, avg=0.31, max=1.15, sum=63.00
A6      200     min=0.04, avg=0.48, max=3.13, sum=95.46
A7      200     min=0.03, avg=0.37, max=2.36, sum=73.60

320 bits:
A1      200     min=0.11, avg=1.59, max=6.14, sum=318.81
A2      200     min=0.09, avg=1.27, max=4.93, sum=254.03
B3      200     min=0.07, avg=0.82, max=3.13, sum=163.80
C4      200     min=0.04, avg=0.44, max=1.91, sum=87.59
D4      200     min=0.06, avg=0.73, max=3.22, sum=146.73
A6      200     min=0.07, avg=0.93, max=3.50, sum=185.01
A7      200     min=0.05, avg=0.76, max=2.94, sum=151.78

384 bits:
A1      200     min=0.16, avg=2.69, max=11.41, sum=537.89
A2      200     min=0.13, avg=2.15, max=9.03, sum=429.14
B3      200     min=0.11, avg=1.54, max=6.49, sum=307.78
C4      200     min=0.06, avg=0.81, max=4.84, sum=161.13
D4      200     min=0.10, avg=1.38, max=8.31, sum=276.81
A6      200     min=0.11, avg=1.73, max=7.36, sum=345.55
A7      200     min=0.09, avg=1.46, max=6.12, sum=292.02

448 bits:
A1      200     min=0.23, avg=3.36, max=15.92, sum=672.63
A2      200     min=0.17, avg=2.61, max=12.25, sum=522.86
B3      200     min=0.16, avg=2.10, max=9.83, sum=420.86
C4      200     min=0.09, avg=1.44, max=7.64, sum=288.36
D4      200     min=0.16, avg=2.50, max=13.29, sum=500.17
A6      200     min=0.15, avg=2.31, max=10.81, sum=461.58
A7      200     min=0.14, avg=2.03, max=9.53, sum=405.16

512 bits:
A1      200     min=0.30, avg=6.12, max=22.18, sum=1223.35
A2      200     min=0.25, avg=4.67, max=16.90, sum=933.18
B3      200     min=0.23, avg=4.13, max=14.94, sum=825.45
C4      200     min=0.13, avg=2.08, max=9.75, sum=415.22
D4      200     min=0.24, avg=4.04, max=20.18, sum=808.11
A6      200     min=0.22, avg=4.47, max=16.19, sum=893.83
A7      200     min=0.20, avg=4.03, max=14.65, sum=806.02

Modular Exponentation (metime)

The following results are aggregated from 200 pseudo-randomly
generated tests, based on a fixed seed. 

                      base, exponent, and modulus size (bits)
P/C       128   192   256   320   384   448   512   640   768   896  1024
------- -----------------------------------------------------------------
A1      0.015 0.027 0.047 0.069 0.098 0.133 0.176 0.294 0.458 0.680 1.040
A2      0.013 0.024 0.037 0.053 0.077 0.102 0.133 0.214 0.326 0.476 0.668
B3      0.005 0.011 0.021 0.036 0.056 0.084 0.121 0.222 0.370 0.573 0.840
C4      0.002 0.006 0.011 0.020 0.032 0.048 0.069 0.129 0.223 0.344 0.507
D4      0.004 0.010 0.019 0.034 0.056 0.085 0.123 0.232 0.390 0.609 0.899
E5      0.007 0.015 0.031 0.055 0.088 0.133 0.183 0.342 0.574 0.893 1.317
A6      0.008 0.016 0.038 0.042 0.064 0.093 0.133 0.239 0.393 0.604 0.880
A7      0.005 0.011 0.020 0.036 0.056 0.083 0.121 0.223 0.374 0.583 0.855

Multiplication and Squaring tests, (mulsqr)

The following results are aggregated from 500000 pseudo-randomly
generated tests, based on a per-run wall-clock seed.  Times are given
in seconds, except where indicated in microseconds (us).

(A1)

bits    multiply    square  ad  percent time/mult   time/square
64      9.33        9.15    >   1.9     18.7us      18.3us
128     10.88       10.44   >   4.0     21.8us      20.9us
192     13.30       11.89   >   10.6    26.7us      23.8us
256     14.88       12.64   >   15.1    29.8us      25.3us
320     18.64       15.01   >   19.5    37.3us      30.0us
384     23.11       17.70   >   23.4    46.2us      35.4us
448     28.28       20.88   >   26.2    56.6us      41.8us
512     34.09       24.51   >   28.1    68.2us      49.0us
640     47.86       33.25   >   30.5    95.7us      66.5us
768     64.91       43.54   >   32.9    129.8us     87.1us
896     84.49       55.48   >   34.3    169.0us     111.0us
1024    107.25      69.21   >   35.5    214.5us     138.4us
1536    227.97      141.91  >   37.8    456.0us     283.8us
2048    394.05      242.15  >   38.5    788.1us     484.3us

(A2)

bits    multiply    square  ad  percent time/mult   time/square
64      7.87        7.95    <   1.0     15.7us      15.9us
128     9.40        9.19    >   2.2     18.8us      18.4us
192     11.15       10.59   >   5.0     22.3us      21.2us
256     12.02       11.16   >   7.2     24.0us      22.3us
320     14.62       13.43   >   8.1     29.2us      26.9us
384     17.72       15.80   >   10.8    35.4us      31.6us
448     21.24       18.51   >   12.9    42.5us      37.0us
512     25.36       21.78   >   14.1    50.7us      43.6us
640     34.57       29.00   >   16.1    69.1us      58.0us
768     46.10       37.60   >   18.4    92.2us      75.2us
896     58.94       47.72   >   19.0    117.9us     95.4us
1024    73.76       59.12   >   19.8    147.5us     118.2us
1536    152.00      118.80  >   21.8    304.0us     237.6us
2048    259.41      199.57  >   23.1    518.8us     399.1us

(B3)

bits    multiply    square  ad  percent time/mult   time/square
64      2.60        2.47    >   5.0     5.20us      4.94us
128     4.43        4.06    >   8.4     8.86us      8.12us
192     7.03        6.10    >   13.2    14.1us      12.2us
256     10.44       8.59    >   17.7    20.9us      17.2us
320     14.44       11.64   >   19.4    28.9us      23.3us
384     19.12       15.08   >   21.1    38.2us      30.2us
448     24.55       19.09   >   22.2    49.1us      38.2us
512     31.03       23.53   >   24.2    62.1us      47.1us
640     45.05       33.80   >   25.0    90.1us      67.6us
768     63.02       46.05   >   26.9    126.0us     92.1us
896     83.74       60.29   >   28.0    167.5us     120.6us
1024    106.73      76.65   >   28.2    213.5us     153.3us
1536    228.94      160.98  >   29.7    457.9us     322.0us
2048    398.08      275.93  >   30.7    796.2us     551.9us

(C4)

bits    multiply    square  ad  percent time/mult   time/square
64      1.34        1.28    >   4.5     2.68us      2.56us
128     2.76        2.59    >   6.2     5.52us      5.18us
192     4.52        4.16    >   8.0     9.04us      8.32us
256     6.64        5.99    >   9.8     13.3us      12.0us
320     9.20        8.13    >   11.6    18.4us      16.3us
384     12.01       10.58   >   11.9    24.0us      21.2us
448     15.24       13.33   >   12.5    30.5us      26.7us
512     19.02       16.46   >   13.5    38.0us      32.9us
640     27.56       23.54   >   14.6    55.1us      47.1us
768     37.89       31.78   >   16.1    75.8us      63.6us
896     49.24       41.42   >   15.9    98.5us      82.8us
1024    62.59       52.18   >   16.6    125.2us     104.3us
1536    131.66      107.72  >   18.2    263.3us     215.4us
2048    226.45      182.95  >   19.2    453.0us     365.9us

(A7)

bits    multiply    square  ad  percent time/mult   time/square
64      1.74        1.71    >   1.7     3.48us      3.42us
128     3.48        2.96    >   14.9    6.96us      5.92us
192     5.74        4.60    >   19.9    11.5us      9.20us
256     8.75        6.61    >   24.5    17.5us      13.2us
320     12.5        8.99    >   28.1    25.0us      18.0us
384     16.9        11.9    >   29.6    33.8us      23.8us
448     22.2        15.2    >   31.7    44.4us      30.4us
512     28.3        19.0    >   32.7    56.6us      38.0us
640     42.4        28.0    >   34.0    84.8us      56.0us
768     59.4        38.5    >   35.2    118.8us     77.0us
896     79.5        51.2    >   35.6    159.0us     102.4us
1024    102.6	    65.5    >	36.2	205.2us	    131.0us
1536    224.3	    140.6   >	37.3	448.6us	    281.2us
2048    393.4	    244.3   >	37.9	786.8us	    488.6us

------------------------------------------------------------------
 This Source Code Form is subject to the terms of the Mozilla Public
 # License, v. 2.0. If a copy of the MPL was not distributed with this
 # file, You can obtain one at http://mozilla.org/MPL/2.0/.