delta/nettle.git - git.lysator.liu.se: nettle/nettle.git

	Commit message (Collapse)	Author	Age	Files	Lines
*	arm: Implement sha256_compress_n	Niels Möller	2022-07-26	2	-36/+80
\|
*	Delete the ARM Neon code doing a single block chacha.delete-1-way-neon	Niels Möller	2021-01-25	2	-181/+0
\|
*	Delete the ARM Neon code doing a single block salsa20.	Niels Möller	2021-01-25	2	-243/+0
\|
*	arm: Unify neon asm for big- and little-endian modes	Michael Weiser	2021-01-13	5	-89/+95
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Switch arm neon assembler routines to endianness-agnostic loads and stores where possible to avoid modifications to the rest of the code. This involves switching to vld1.32 for loading consecutive 32-bit words in host endianness as well as vst1.8 for storing back to memory in little-endian order as required by the caller. Where necessary, r3 is used to store the precalculated offset into the source vector for the secondary load operations. vstm is kept for little-endian platforms because it is faster than vst1 on most ARM implementations. vst1.x (at least on the Allwinner A20 Cortex-A7 implementation) seems to interfer with itself on subsequent calls, slowing it down further. So we reschedule some instructions to do stores as soon as results become available to have some other calculations or loads before the next vst1.x. This reliably saves two additional cycles per block on salsa20 and chacha which would otherwise be incurred. vld1.x does not seem to suffer from this or at least not to a level where two consecutive vld1.x run slower than an equivalent vldm. Rescheduling them similarly did not improve performance beyond that of vldm. Signed-off-by: Michael Weiser <michael.weiser@gmx.de>
*	arm: Fix comment typo	Niels Möller	2020-12-21	1	-1/+1
\| \| \| \|	Spotted by Michael Weiser
*	Update ARM mod and redc functions	Niels Möller	2020-10-31	5	-88/+99
\|
*	Use default m4 quote character in asm files, part 2	Niels Möller	2020-09-13	37	-602/+602
\| \| \| \|	Update arm files.
*	Rearrange chacha_crypt and chacha_crypt32, enabling fat builds to use ↵chacha-3core-neon	Niels Möller	2020-07-14	1	-0/+36
\| \| \| \|	chacha_3core
*	Fix (arm neon) _nettle_chacha_3core32	Niels Möller	2020-07-14	1	-1/+0
\|
*	Keep counter words in registers	Niels Möller	2020-07-13	1	-13/+17
\|
*	Reschedule first permutation	Niels Möller	2020-07-13	1	-12/+10
\|
*	Reschedule second permutation	Niels Möller	2020-07-13	1	-11/+9
\|
*	Reschedule second qround	Niels Möller	2020-07-13	1	-26/+29
\|
*	Reschedule first qround	Niels Möller	2020-07-13	1	-26/+29
\|
*	Three-way interleaving of chacha on Neon	Niels Möller	2020-07-13	1	-0/+242
\|
*	Rearrange salsa20, enabling ARM fat builds to use sala20_2core.salsa20-2core-neon	Niels Möller	2020-07-08	1	-0/+36
\|
*	Two-way interleaving of salsa20 on Neon	Niels Möller	2020-07-06	1	-0/+206
\|
*	arm: Fix memxor for non-armv6+ big-endian systems	Michael Weiser	2020-03-12	2	-19/+25
\| \| \| \| \| \| \| \| \| \| \|	ARM assembly adjustments for big-endian systems contained armv6+-only instructions (rev) in generic arm memxor code. Replace those with an actual conversion of the leftover byte store routines for big-endian systems. This also provides a slight optimisation by removing the additional instruction as well as increased symmetry between little- and big-endian implementations. Signed-off-by: Michael Weiser <michael.weiser@gmx.de>
*	ecc: rename functions to contain curve names instead of bits	Dmitry Eremin-Solenikov	2020-01-10	5	-15/+15
\| \| \| \| \| \| \| \|	Rename curve functions to use curve names instead of just bits. Otherwise function names can easily become confusing after adding other curves. Signed-off-by: Dmitry Eremin-Solenikov <dbaryshkov@gmail.com>
*	ecc: prefix optimized ECC function names with underscore	Dmitry Eremin-Solenikov	2020-01-10	5	-10/+10
\| \| \| \| \| \| \|	There is no need to keep optimized ECC functions in public namespace (nettle_), move them to internal namespace (_nettle_). Signed-off-by: Dmitry Eremin-Solenikov <dbaryshkov@gmail.com>
*	ecc: rename source files with curves data	Dmitry Eremin-Solenikov	2020-01-10	5	-10/+10
\| \| \| \| \| \| \|	In preparation to adding GOST curves support, rename source files and use curve name as eccdata parameter. Signed-off-by: Dmitry Eremin-Solenikov <dbaryshkov@gmail.com>
*	Add --enable-fat support for arm neon chacha20	Yuriy M. Kaminskiy	2019-01-19	1	-0/+37
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	On BCM2837B0 (Cortex-A53) @1.4GHz (Raspberry Pi 3B+), Before: `gnutls-cli --benchmark-ciphers` CHACHA20-POLY1305 (16384) 51.54 MB/sec `gnutls-cli --benchmark-tls-ciphers`: ECDHE_RSA_CHACHA20_POLY1305 (payload 1400) 21.31 MB/sec ECDHE_RSA_CHACHA20_POLY1305 (payload 15360) 24.60 MB/sec `nettle-benchmark` chacha encrypt 71.90 chacha decrypt 71.89 chacha_poly1305 encrypt 48.17 chacha_poly1305 decrypt 48.17 chacha_poly1305 update 146.03 After: `gnutls-cli --benchmark-ciphers` CHACHA20-POLY1305 (16384) 68.44 MB/sec `gnutls-cli --benchmark-tls-ciphers`: ECDHE_RSA_CHACHA20_POLY1305 (payload 1400) 27.25 MB/sec ECDHE_RSA_CHACHA20_POLY1305 (payload 15360) 32.41 MB/sec `nettle-benchmark` chacha encrypt 106.00 chacha decrypt 105.94 chacha_poly1305 encrypt 65.94 chacha_poly1305 decrypt 65.96 chacha_poly1305 update 175.24
*	Fix arm fat setup for nettle_sha1_compress.	Niels Möller	2018-07-12	1	-1/+1
\|
*	abi: explicitly export intended symbols and hide others	Nikos Mavrogiannopoulos	2018-07-07	2	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	This adds all exported symbols in the map files explicitly under the following rules: - Symbols mentioned in internal headers go in a section which is valid only for testing, and linking with these symbols will break in library updates. - Symbols mentioned in installed headers go in the exported sections and are considered part of the ABI. - All internal symbols move to internal headers. - The _nettle_md5_compress and _nettle_sha1_compress become exported without the _nettle prefix, due to existing usage.
*	Document arm endianness considerations	Michael Weiser	2018-03-25	1	-1/+68
\| \| \| \| \| \|	Extend arm/README to provide some background on considerations to be taken into account when writing assembly routines supposed to work in big and little memory endianness.
*	Support big-endian arm in assembly code	Michael Weiser	2018-03-25	7	-42/+138
\| \| \| \| \|	Adjust sha1-compress, sha256-compress, umac-nh, chacha-core-internal, salsa20-core-internal and memxor for arm to work in big-endian mode.
*	Make nettle compile with pre-UAL arm assembler.	Marcus Hoffmann	2017-03-09	1	-1/+1
\| \| \| \| \| \| \|	See: http://infocenter.arm.com/help/index.jsp?topic=/com.arm.doc.dui0204j/Cjagjjbc.html The pre-UAL instruction is also accepted by modern assemblers. Signed-off-by: Marcus Hoffmann <m.hoffmann@cartelsol.com>
*	ARM Neon implementation of chacha.	Niels Möller	2015-06-03	1	-0/+136
\|
*	Fix syntax error in ARM offset addressing. Spotted by Jukka Ukkonen.	Niels Möller	2015-04-29	1	-2/+2
\|
*	Drop unneeded %-prefix on ARM register.	Niels Möller	2015-04-29	2	-2/+2
\|
*	In arm fat build, hook in sha1 and sha256 armv6 assembly.	Niels Möller	2015-01-25	2	-0/+74
\|
*	arm: Add .arch directives for armv6	Martin Storsjö	2015-01-25	4	-0/+6
\| \| \| \| \|	This allows building these files as part of a fat build, even if the assembler by default targets a lower architecture version.
*	Fat functions for arm neon.	Niels Möller	2015-01-21	5	-0/+185
\|
*	Initial fat library support for arm.	Niels Möller	2015-01-19	4	-0/+140
\|
*	arm: Moved memxor3 to new file, arm/memxor3.asm.	Niels Möller	2015-01-10	2	-271/+315
\|
*	Comment updates.	Niels Möller	2014-09-22	5	-8/+8
\|
*	Update copyright headers for dual licensing, arm directories.	Niels Möller	2014-04-12	17	-306/+527
\|
*	Merged aes-reorg branch.	Niels Möller	2013-09-28	4	-114/+166
\|\
\| *	arm: Adapted AES assembly to new interface.	Niels Möller	2013-05-22	4	-114/+166
\| \|
* \|	arm: Moved sha1 and sha256 assembly into v6 subdirectory.	Niels Möller	2013-05-20	2	-0/+0
\|/
*	arm: Added AES code for pre-v6 processors.	Niels Möller	2013-05-16	3	-0/+361
\|
*	arm: Renamed AES_FINAL_ROUND.	Niels Möller	2013-05-16	3	-10/+10
\|
*	arm/v6: AES microptimization.	Niels Möller	2013-05-16	3	-9/+4
\|
*	arm: Moved AES round macros from aes.m4 to the files using them.	Niels Möller	2013-05-16	3	-123/+124
\|
*	arm/v6: Alignment tweaks to aes code.	Niels Möller	2013-05-16	2	-4/+8
\|
*	New asm directory arm/v6. Moved aes functions using the uxtb instruction there.	Niels Möller	2013-05-15	2	-0/+0
\|
*	Comment updates in asm files.	Niels Möller	2013-04-26	3	-3/+2
\|
*	Updated some memxor comments.	Niels Möller	2013-04-24	1	-2/+2
\|
*	Added nettle_ prefix to memxor symbols.	Niels Möller	2013-04-24	1	-4/+4
\|
*	Reorganization of ARM assembly.	Niels Möller	2013-04-18	18	-0/+3291
	Renamed directory armv7 to arm. New subdirectory arm/neon, for files using neon instructions. configure.ac hacked to make use of neon configurable.