summaryrefslogtreecommitdiff
path: root/libavcodec/x86
Commit message (Expand)AuthorAgeFilesLines
* fft: mark xmm registers as clobbered in ff_imdct_calc_sseRamiro Polla2010-10-061-0/+1
* MMX, MMX2, SSE2 and SSSE3 optimizations for pred16x16/8x8_plane H264 intraRonald S. Bultje2010-10-053-0/+557
* snowdsp: Explicitly state the operand sizesİsmail Dönmez2010-10-041-1/+1
* Move static inline function to a macro, so that constant propagation inRonald S. Bultje2010-09-291-117/+113
* Use sse2 variant of put_pixels16() for no_rnd also. Provides a minor speedEli Friedman2010-09-291-0/+1
* Merge b_idx and edge variables, and optimize the ASM to directly load variablesRonald S. Bultje2010-09-291-46/+54
* Remove mv_mask variable. Replace the related pand -1/0 instructions by eitherRonald S. Bultje2010-09-291-6/+7
* Remove d_idx as a variable, and instead load it as a constant in the asm.Ronald S. Bultje2010-09-291-32/+38
* Unroll inner bidir loop in h264_loop_filter_strength_mmx2(), which gets ridRonald S. Bultje2010-09-291-5/+19
* Unloop the outer loop in h264_loop_filter_strength_mmx2(), which allowsRonald S. Bultje2010-09-291-25/+29
* Add d suffix to movd target register to make it work with nasm.Reimar Döffinger2010-09-261-2/+2
* Split and then simplify address generation macro.Reimar Döffinger2010-09-261-20/+22
* Remove unused variable.Ronald S. Bultje2010-09-241-1/+0
* Unroll loop in h264_idct_add16intra_sse2(). Basically identical to r25171, thisRonald S. Bultje2010-09-241-28/+28
* Unroll loop in h264_idct_add8_sse2(). This means we can inline scan8[] in theRonald S. Bultje2010-09-241-29/+20
* x86: disable SSE functions using stack when stack is not alignedMåns Rullgård2010-09-212-2/+4
* x86: remove hack disabling sse2 h264 loop filter with 32-bit iccMåns Rullgård2010-09-181-2/+1
* Don't access upper 32 bits of a 32-bit int on 64-bit systems.Ronald S. Bultje2010-09-171-1/+1
* Properly add HAVE_YASM around yasmified symbols. Should fix compile errorRonald S. Bultje2010-09-171-1/+9
* Move hadamard_diff{,16}_{mmx,mmx2,sse2,ssse3}() from inline asm to yasm,Ronald S. Bultje2010-09-173-212/+291
* Move sse16_sse2() from inline asm to yasm. It is one of the functions causingRonald S. Bultje2010-09-173-62/+90
* Rename h264_idct_sse2.asm to h264_idct.asm; move inline IDCT asm fromRonald S. Bultje2010-09-144-584/+905
* LGPL SSE2 H.264 iDCTJason Garrett-Glaser2010-09-103-14/+14
* Move mm_support() from libavcodec to libavutil, make it a publicStefano Sabatini2010-09-0816-167/+26
* Use "d" suffix for general-purpose registers used with movd.Reimar Döffinger2010-09-054-30/+30
* Rename FF_MM_ symbols related to CPU features flags as AV_CPU_FLAG_Stefano Sabatini2010-09-0415-106/+108
* Port latest x264 deblock asm (before they moved to using NV12 as internalRonald S. Bultje2010-09-035-325/+230
* Fix typo in r25019.Eli Friedman2010-09-011-1/+1
* Unscrew breakage after my last commit because of symbol prefixes.Ronald S. Bultje2010-09-011-8/+8
* Rename h264_weight_sse2.asm to h264_weight.asm; add 16x8/8x16/8x4 non-squareRonald S. Bultje2010-09-014-291/+427
* Split h264dsp_mmx.c (which was #included in dsputil_mmx.c) in h264_qpel_mmx.c,Ronald S. Bultje2010-09-015-1319/+1343
* Fix vertical align.Ronald S. Bultje2010-08-311-1/+1
* Fix compilation failure if yasm is disabled (missing vp3 symbols).Ronald S. Bultje2010-08-301-3/+3
* Split intra prediction initialization (i.e. assigning of function pointers)Ronald S. Bultje2010-08-303-83/+103
* Move H264 chroma MC from inline asm to yasm. This fixes VP3/5/6 and VC-1Ronald S. Bultje2010-08-307-718/+754
* Move VP3 IDCT functions from inline ASM to YASM. This fixes part of the VP3/5/6Ronald S. Bultje2010-08-307-701/+637
* Put ff_ prefix on non-static {put_signed,put,add}_pixels_clamped_mmx()Ronald S. Bultje2010-08-308-27/+27
* cosmetics in imdct_sseLoren Merritt2010-08-281-25/+20
* Fix typos when converting inline asm to yasm, fixes MMX-only fate-ea-vp61.Ronald S. Bultje2010-08-261-5/+5
* Revert r24931, it broke Win32 and some BSD compiles (yay fate).Ronald S. Bultje2010-08-251-1/+0
* Mark xmm6 and xmm7 as clobbered in ff_vp3_idct_sse2(), which is contributingRonald S. Bultje2010-08-251-0/+1
* VP6: fix vp6_filter_diag4_mmx/sse on 64-bitMåns Rullgård2010-08-251-0/+3
* Move vp6_filter_diag4() x86 SIMD code from inline ASM to YASM. This shouldRonald S. Bultje2010-08-257-270/+178
* Move vp6_filter_diag4() from DSPContext to VP56DSPContext.Ronald S. Bultje2010-08-253-12/+46
* Remove global mm_flags variableMåns Rullgård2010-08-2410-10/+14
* Mark xmm registers as clobbered in simple loopfilter. Should fix the lastRonald S. Bultje2010-08-241-11/+11
* imdct/x86: Use "s->mdct_size" instead of "1 << s->mdct_bits".Alex Converse2010-08-232-3/+3
* Fix segfaults in VP8 SIMD code on Win64 (and FATE/win64 failures).Ronald S. Bultje2010-08-231-14/+14
* Convert ff_imdct_half_sse() to yasm.Alex Converse2010-08-222-108/+195
* VP5/6/8: ~7% faster arithmetic decodingJason Garrett-Glaser2010-08-121-1/+1