diff options
author | tege <tege@gmplib.org> | 1996-08-08 13:57:06 +0200 |
---|---|---|
committer | tege <tege@gmplib.org> | 1996-08-08 13:57:06 +0200 |
commit | aa1e8520d6018b7ea49480b7e56cee20bfabe435 (patch) | |
tree | 245b390460e3b83ee9e6b9221eee9b1fa51f14c1 | |
parent | 0b5af3c971eb8b2fa4d53b0b63a3b8b62f36b48e (diff) | |
download | gmp-aa1e8520d6018b7ea49480b7e56cee20bfabe435.tar.gz |
*** empty log message ***
-rw-r--r-- | README | 18 | ||||
-rw-r--r-- | SPEED | 29 |
2 files changed, 22 insertions, 25 deletions
@@ -7,7 +7,7 @@ of functions, and the functions have a regular interface. GNU MP is designed to be as fast as possible, both for small operands and for huge operands. The speed is achieved by using fullwords as the basic -arithmetic type, by using fast algorithms, by carefully optimized assembly +arithmetic type, by using fast algorithms, with carefully optimized assembly code for the most common inner loops for a lots of CPUs, and by a general emphasis on speed (instead of simplicity or elegance). @@ -93,25 +93,25 @@ GNU MP user: There are five classes of functions in GNU MP. - 1. Signed integer arithmetic functions, mpz_*. These functions are intended + 1. Signed integer arithmetic functions (mpz). These functions are intended to be easy to use, with their regular interface. The associated type is `mpz_t'. - 2. Rational arithmetic functions, mpq_*. For now, just a small set of + 2. Rational arithmetic functions (mpq). For now, just a small set of functions necessary for basic rational arithmetics. The associated type is `mpq_t'. - 3. Floating-point arithmetic functions, mpf_*. If the C type `double' + 3. Floating-point arithmetic functions (mpf). If the C type `double' doesn't give enough precision for your application, declare your variables as `mpf_t' instead, set the precision to any number desired, and call the functions in the mpf class for the arithmetic operations. 4. Positive-integer, hard-to-use, very low overhead functions are in the - mpn_* class. No memory management is performed. The caller must ensure + mpn class. No memory management is performed. The caller must ensure enough space is available for the results. The set of functions is not regular, nor is the calling interface. These functions accept input arguments in the form of pairs consisting of a pointer to the least - significant word, and a integral size telling how many limbs (= words) + significant word, and an integral size telling how many limbs (= words) the pointer points to. Almost all calculations, in the entire package, are made by calling these @@ -132,6 +132,6 @@ the INSTALL file in this directory. If you find a bug in the library, please make sure to tell us about it! -Report bugs and propose modifications and enhancements to -bug-gmp@prep.ai.mit.edu. What information is needed in a good bug report is -described in the manual. +Report bugs to bug-gmp@prep.ai.mit.edu. What information is needed in a good +bug report is described in the manual. The same address can be used for +suggesting modifications and enhancements. @@ -18,22 +18,20 @@ accurate. ------------+----------------------------------------------------------------- DEC/Alpha | EV4 | 4.75 cycles/64b 7.75 cycles/64b 42 cycles/64b 42 cycles/64b - 200MHz | 2.7 Gb/s 1.65 Gb/s 20 Gb/s 20 Gb/s + 300MHz | 4.04 Gb/s 2.48 Gb/s 29 Gb/s 29 Gb/s EV5 old code| 4.0 cycles/64b 5.5 cycles/64b 18 cycles/64b 18 cycles/64b 267MHz | 4.27 Gb/s 3.10 Gb/s 61 Gb/s 61 Gb/s - 417MHz | 6.67 Gb/s 4.85 Gb/s 95 Gb/s 95 Gb/s + 500MHz | 8.0 Gb/s 5.82 Gb/s 114 Gb/s 114 Gb/s EV5 tuned | 3.25 cycles/64b 4.75 cycles/64b 267MHz | 5.25 Gb/s 3.59 Gb/s as above - 417MHz | 8.21 Gb/s 5.61 Gb/s + 500MHz | 9.85 Gb/s 6.74 Gb/s ------------+----------------------------------------------------------------- Sun/SPARC | SPARC v7 | 14.0 cycles/32b 8.5 cycles/32b 37-54 cycl/32b 37-54 cycl/32b -SuperSPARC | 3 cycles/32b 2.5 cycles/32b 8.2 cycles/32b 10.8 cycles/32b - 50MHz | 0.53 Gb/s 0.64 Gb/s 6.2 Gb/s 4.7 Gb/s -**SuperSPARC| tuned addmul and submul will take: 9.25 cycles/32b -MicroSPARC2 | ? 6.65 cycles/32b 30 cycles/32b 31.5 cycles/32b - 110MHz | ? 0.53 Gb/s 3.75 Gb/s 3.58 Gb/s -SuperSPARC2 | ? ? ? ? +SuperSPARC-2| 3.0 cycles/32b 2.5 cycles/32b 8.2 cycles/32b 10.8 cycles/32b + 90MHz | 0.96 Gb/s 1.15 Gb/s 11.2 Gb/s 8.53 Gb/s +MicroSPARC-2| 8.0 cycles/32b 6.5 cycles/32b 30 cycles/32b 31.5 cycles/32b + 110MHz | 0.53 Gb/s 3.75 Gb/s 3.58 Gb/s Ultra/32 (4)| 2.5 cycles/32b 6.5 cycles/32b 13-27 cyc/32b 16-30 cyc/32b 182MHz | 2.33 Gb/s 0.896 Gb/s 14.3-6.9 Gb/s Ultra/64 (5)| 2.5 cycles/64b 10 cycles/64b 40-70 cyc/64b 46-76 cyc/64b @@ -55,7 +53,6 @@ R8000/64 | 3 cycles/64b 4.6 cycles/64b 8 cycles/64b 8 cycles/64b 75MHz | 1.6 Gb/s 1.0 Gb/s 38 Gb/s 38 Gb/s *R10000/64 | 2 cycles/64b 3 cycles/64b 11 cycles/64b 11 cycles/64b *200MHz | 6.4 Gb/s 4.27 Gb/s 74 Gb/s 74 Gb/s - *250MHz | 8.0 Gb/s 5.33 Gb/s 93 Gb/s 93 Gb/s ------------+----------------------------------------------------------------- Motorola | MC68020 | ? 24 cycles/32b 62 cycles/32b 70 cycles/32b @@ -73,8 +70,8 @@ PA7100LC | ? ? ? ? PA7200 (3) | 3 cycles/32b 4 cycles/32b 7 cycles/32b 6.5 cycles/32b 100MHz | 1.07 Gb/s 0.80 14 Gb/s 15.8 Gb/s PA7300LC | ? ? ? ? -*PA8000 | 3 cycles/64b 4 cycles/64b 7 cycles/64b 6.5 cycles/64b - 180MHz | 3.84 Gb/s 2.88 Gb/s 105 Gb/s 113 Gb/s +*PA8000 | 3 cycles/64b 4 cycles/64b 7 cycles/32b 6.5 cycles/32b + *180MHz | 3.84 Gb/s 2.88 Gb/s 26.3 Gb/s 28.4 Gb/s ------------+----------------------------------------------------------------- Intel/x86 | 386DX | 20 cycles/32b 17 cycles/32b 41-70 cycl/32b 50-79 cycl/32b @@ -83,7 +80,7 @@ Intel/x86 | 486DX4 | 9.5 cycles/32b 9.25 cycles/32b 17-23 cycl/32b 20-26 cycl/32b 100MHz | 0.34 Gb/s 0.35 Gb/s 6.0-4.5 Gb/s 5.1-3.9 Gb/s Pentium | 2/6 cycles/32b 2.5 cycles/32b 13 cycles/32b 14 cycles/32b - 167MHz | 2.7/0.89 Gb/s 2.1 Gb/s 13.1 Gb/s 12.2 Gb/s + 200MHz | 3.2/1.07 Gb/s 2.56 Gb/s 15.8 Gb/s 14.6 Gb/s Pentium Pro | 2.5 cycles/32b 3.5 cycles/32b 6 cycles/32b 9 cycles/32b 200MHz | 2.6 Gb/s 1.8 Gb/s 34 Gb/s 23 Gb/s ------------+----------------------------------------------------------------- @@ -96,8 +93,8 @@ PPC601 (1) | 3 cycles/32b 6 cycles/32b 11-16 cycl/32b 14-19 cycl/32b PPC601 (2) | 5 cycles/32b 6 cycles/32b 13-22 cycl/32b 16-25 cycl/32b 67MHz (2) | 0.43 Gb/s 0.36 Gb/s 5.3-3.0 Gb/s 4.3-2.7 Gb/s PPC603 | ? ? ? ? -*PPC604 | 2 3 2 3 - *167MHz | 57 Gb/s +*PPC604 | 2 3 4 4 + *225MHz | 58 Gb/s 58 Gb/s PPC620 | ? ? ? ? ------------+----------------------------------------------------------------- Tege | @@ -143,7 +140,7 @@ MC88110 | 1.5 cyc/32b (6) 1.5 cycle/32b 1.5 cycles/32b 2.25 cycles/32b ------------+----------------------------------------------------------------- Intel/x86 | 486DX4 | -Pentium P5x | 5 cycles/32b 2 cycles/32b 11.5 cycles/32b 13 cycles/32b +Pentium | 5 cycles/32b 2 cycles/32b 11.5 cycles/32b 13 cycles/32b Pentium Pro | 2 cycles/32b 3 cycles/32b 4 cycles/32b 6 cycles/32b ------------+----------------------------------------------------------------- IBM/POWER | |