summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authortege <tege@gmplib.org>1996-08-08 13:57:06 +0200
committertege <tege@gmplib.org>1996-08-08 13:57:06 +0200
commitaa1e8520d6018b7ea49480b7e56cee20bfabe435 (patch)
tree245b390460e3b83ee9e6b9221eee9b1fa51f14c1
parent0b5af3c971eb8b2fa4d53b0b63a3b8b62f36b48e (diff)
downloadgmp-aa1e8520d6018b7ea49480b7e56cee20bfabe435.tar.gz
*** empty log message ***
-rw-r--r--README18
-rw-r--r--SPEED29
2 files changed, 22 insertions, 25 deletions
diff --git a/README b/README
index 3afa67745..aa449ead2 100644
--- a/README
+++ b/README
@@ -7,7 +7,7 @@ of functions, and the functions have a regular interface.
GNU MP is designed to be as fast as possible, both for small operands and for
huge operands. The speed is achieved by using fullwords as the basic
-arithmetic type, by using fast algorithms, by carefully optimized assembly
+arithmetic type, by using fast algorithms, with carefully optimized assembly
code for the most common inner loops for a lots of CPUs, and by a general
emphasis on speed (instead of simplicity or elegance).
@@ -93,25 +93,25 @@ GNU MP user:
There are five classes of functions in GNU MP.
- 1. Signed integer arithmetic functions, mpz_*. These functions are intended
+ 1. Signed integer arithmetic functions (mpz). These functions are intended
to be easy to use, with their regular interface. The associated type is
`mpz_t'.
- 2. Rational arithmetic functions, mpq_*. For now, just a small set of
+ 2. Rational arithmetic functions (mpq). For now, just a small set of
functions necessary for basic rational arithmetics. The associated type
is `mpq_t'.
- 3. Floating-point arithmetic functions, mpf_*. If the C type `double'
+ 3. Floating-point arithmetic functions (mpf). If the C type `double'
doesn't give enough precision for your application, declare your
variables as `mpf_t' instead, set the precision to any number desired,
and call the functions in the mpf class for the arithmetic operations.
4. Positive-integer, hard-to-use, very low overhead functions are in the
- mpn_* class. No memory management is performed. The caller must ensure
+ mpn class. No memory management is performed. The caller must ensure
enough space is available for the results. The set of functions is not
regular, nor is the calling interface. These functions accept input
arguments in the form of pairs consisting of a pointer to the least
- significant word, and a integral size telling how many limbs (= words)
+ significant word, and an integral size telling how many limbs (= words)
the pointer points to.
Almost all calculations, in the entire package, are made by calling these
@@ -132,6 +132,6 @@ the INSTALL file in this directory.
If you find a bug in the library, please make sure to tell us about it!
-Report bugs and propose modifications and enhancements to
-bug-gmp@prep.ai.mit.edu. What information is needed in a good bug report is
-described in the manual.
+Report bugs to bug-gmp@prep.ai.mit.edu. What information is needed in a good
+bug report is described in the manual. The same address can be used for
+suggesting modifications and enhancements.
diff --git a/SPEED b/SPEED
index e888e17e5..b96452acf 100644
--- a/SPEED
+++ b/SPEED
@@ -18,22 +18,20 @@ accurate.
------------+-----------------------------------------------------------------
DEC/Alpha |
EV4 | 4.75 cycles/64b 7.75 cycles/64b 42 cycles/64b 42 cycles/64b
- 200MHz | 2.7 Gb/s 1.65 Gb/s 20 Gb/s 20 Gb/s
+ 300MHz | 4.04 Gb/s 2.48 Gb/s 29 Gb/s 29 Gb/s
EV5 old code| 4.0 cycles/64b 5.5 cycles/64b 18 cycles/64b 18 cycles/64b
267MHz | 4.27 Gb/s 3.10 Gb/s 61 Gb/s 61 Gb/s
- 417MHz | 6.67 Gb/s 4.85 Gb/s 95 Gb/s 95 Gb/s
+ 500MHz | 8.0 Gb/s 5.82 Gb/s 114 Gb/s 114 Gb/s
EV5 tuned | 3.25 cycles/64b 4.75 cycles/64b
267MHz | 5.25 Gb/s 3.59 Gb/s as above
- 417MHz | 8.21 Gb/s 5.61 Gb/s
+ 500MHz | 9.85 Gb/s 6.74 Gb/s
------------+-----------------------------------------------------------------
Sun/SPARC |
SPARC v7 | 14.0 cycles/32b 8.5 cycles/32b 37-54 cycl/32b 37-54 cycl/32b
-SuperSPARC | 3 cycles/32b 2.5 cycles/32b 8.2 cycles/32b 10.8 cycles/32b
- 50MHz | 0.53 Gb/s 0.64 Gb/s 6.2 Gb/s 4.7 Gb/s
-**SuperSPARC| tuned addmul and submul will take: 9.25 cycles/32b
-MicroSPARC2 | ? 6.65 cycles/32b 30 cycles/32b 31.5 cycles/32b
- 110MHz | ? 0.53 Gb/s 3.75 Gb/s 3.58 Gb/s
-SuperSPARC2 | ? ? ? ?
+SuperSPARC-2| 3.0 cycles/32b 2.5 cycles/32b 8.2 cycles/32b 10.8 cycles/32b
+ 90MHz | 0.96 Gb/s 1.15 Gb/s 11.2 Gb/s 8.53 Gb/s
+MicroSPARC-2| 8.0 cycles/32b 6.5 cycles/32b 30 cycles/32b 31.5 cycles/32b
+ 110MHz | 0.53 Gb/s 3.75 Gb/s 3.58 Gb/s
Ultra/32 (4)| 2.5 cycles/32b 6.5 cycles/32b 13-27 cyc/32b 16-30 cyc/32b
182MHz | 2.33 Gb/s 0.896 Gb/s 14.3-6.9 Gb/s
Ultra/64 (5)| 2.5 cycles/64b 10 cycles/64b 40-70 cyc/64b 46-76 cyc/64b
@@ -55,7 +53,6 @@ R8000/64 | 3 cycles/64b 4.6 cycles/64b 8 cycles/64b 8 cycles/64b
75MHz | 1.6 Gb/s 1.0 Gb/s 38 Gb/s 38 Gb/s
*R10000/64 | 2 cycles/64b 3 cycles/64b 11 cycles/64b 11 cycles/64b
*200MHz | 6.4 Gb/s 4.27 Gb/s 74 Gb/s 74 Gb/s
- *250MHz | 8.0 Gb/s 5.33 Gb/s 93 Gb/s 93 Gb/s
------------+-----------------------------------------------------------------
Motorola |
MC68020 | ? 24 cycles/32b 62 cycles/32b 70 cycles/32b
@@ -73,8 +70,8 @@ PA7100LC | ? ? ? ?
PA7200 (3) | 3 cycles/32b 4 cycles/32b 7 cycles/32b 6.5 cycles/32b
100MHz | 1.07 Gb/s 0.80 14 Gb/s 15.8 Gb/s
PA7300LC | ? ? ? ?
-*PA8000 | 3 cycles/64b 4 cycles/64b 7 cycles/64b 6.5 cycles/64b
- 180MHz | 3.84 Gb/s 2.88 Gb/s 105 Gb/s 113 Gb/s
+*PA8000 | 3 cycles/64b 4 cycles/64b 7 cycles/32b 6.5 cycles/32b
+ *180MHz | 3.84 Gb/s 2.88 Gb/s 26.3 Gb/s 28.4 Gb/s
------------+-----------------------------------------------------------------
Intel/x86 |
386DX | 20 cycles/32b 17 cycles/32b 41-70 cycl/32b 50-79 cycl/32b
@@ -83,7 +80,7 @@ Intel/x86 |
486DX4 | 9.5 cycles/32b 9.25 cycles/32b 17-23 cycl/32b 20-26 cycl/32b
100MHz | 0.34 Gb/s 0.35 Gb/s 6.0-4.5 Gb/s 5.1-3.9 Gb/s
Pentium | 2/6 cycles/32b 2.5 cycles/32b 13 cycles/32b 14 cycles/32b
- 167MHz | 2.7/0.89 Gb/s 2.1 Gb/s 13.1 Gb/s 12.2 Gb/s
+ 200MHz | 3.2/1.07 Gb/s 2.56 Gb/s 15.8 Gb/s 14.6 Gb/s
Pentium Pro | 2.5 cycles/32b 3.5 cycles/32b 6 cycles/32b 9 cycles/32b
200MHz | 2.6 Gb/s 1.8 Gb/s 34 Gb/s 23 Gb/s
------------+-----------------------------------------------------------------
@@ -96,8 +93,8 @@ PPC601 (1) | 3 cycles/32b 6 cycles/32b 11-16 cycl/32b 14-19 cycl/32b
PPC601 (2) | 5 cycles/32b 6 cycles/32b 13-22 cycl/32b 16-25 cycl/32b
67MHz (2) | 0.43 Gb/s 0.36 Gb/s 5.3-3.0 Gb/s 4.3-2.7 Gb/s
PPC603 | ? ? ? ?
-*PPC604 | 2 3 2 3
- *167MHz | 57 Gb/s
+*PPC604 | 2 3 4 4
+ *225MHz | 58 Gb/s 58 Gb/s
PPC620 | ? ? ? ?
------------+-----------------------------------------------------------------
Tege |
@@ -143,7 +140,7 @@ MC88110 | 1.5 cyc/32b (6) 1.5 cycle/32b 1.5 cycles/32b 2.25 cycles/32b
------------+-----------------------------------------------------------------
Intel/x86 |
486DX4 |
-Pentium P5x | 5 cycles/32b 2 cycles/32b 11.5 cycles/32b 13 cycles/32b
+Pentium | 5 cycles/32b 2 cycles/32b 11.5 cycles/32b 13 cycles/32b
Pentium Pro | 2 cycles/32b 3 cycles/32b 4 cycles/32b 6 cycles/32b
------------+-----------------------------------------------------------------
IBM/POWER |