diff options
author | Monty <xiphmont@xiph.org> | 2002-09-20 00:10:31 +0000 |
---|---|---|
committer | Monty <xiphmont@xiph.org> | 2002-09-20 00:10:31 +0000 |
commit | 1dad9587e30c61a8458fdda12da6248bf130c691 (patch) | |
tree | 028314f1b8071251d480ea836eb1a14e12c9f1f5 /Makefile.am | |
parent | 55b25a7aadfb989f50a32cc7d146910de8d6f6cb (diff) | |
download | tremor-1dad9587e30c61a8458fdda12da6248bf130c691.tar.gz |
Latest improvements from Nicolas Pitre. Reviewed by Monty
From Nicolas's notes:
- Includes my previous patch with interpolation code for correct accuracy
with all block sizes.
- Interlaces sin and cos values in the lookup table to reduce register
pressure since only one pointer is required to walk the table instead of
two. This also accounts for better cache locality.
- Split the lookup table into two tables since half of it (one value every
two) is only used in separate section of the code and only with large
block sizes. Therefore the table size used for the common case is reduced
by 2 accounting for yet better cache usage.
- Abstracted all cross products throughout the code so they can be easily
optimized. First this prevents redundant register reloads on ARM due to
the implicit memory access ordering, next this allowed for the
opportunity to hook some inline assembly to perform the actual operation.
- Fix layout of current assembly in asm_arm.h to match GCC's output (more
enjoyable to read when inspecting the final assembly) plus some
constraint correctness issues.
- Added a memory barrier macro to force the compiler not to cache values
into registers or on the stack in some cases.
- Reordered some code for better ARM assembly generation by
the compiler.
git-svn-id: https://svn.xiph.org/trunk/Tremor@3923 0101bb08-14d6-0310-b084-bc0e0c8e3800
Diffstat (limited to 'Makefile.am')
0 files changed, 0 insertions, 0 deletions