ffmpeg/libavcodec/x86
Christophe GISQUET 2784d18791 SBR DSP x86: implement SSE sbr_hf_g_filt
Unrolling the main loop to process, instead of 4 elements:
- 8: minor gain of 2 cycles (not worth the extra object size)
- 2: loss of 8 cycles.

Assigning STEP to a register is a loss. Output address (Y) is almost always
unaligned.

Timings:
- C (32/64 bits): 117/109 cycles
- SSE: 57 cycles

Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
2012-02-23 15:50:09 -08:00
..
ac3dsp_mmx.c
ac3dsp.asm ac3dsp: do not use pshufb in ac3_extract_exponents_ssse3() 2012-02-09 21:04:44 -05:00
cabac.h
cavsdsp_mmx.c
dct32_sse.asm
deinterlace.asm
dnxhd_mmx.c
dsputil_mmx_avg_template.c
dsputil_mmx_qns_template.c
dsputil_mmx_rnd_template.c
dsputil_mmx.c dsputil: Add ff_ prefix to the dsputil*_init* functions 2012-02-15 22:06:34 +02:00
dsputil_mmx.h dsputil: Add ff_ prefix to the dsputil*_init* functions 2012-02-15 22:06:34 +02:00
dsputil_yasm.asm
dsputilenc_mmx.c dsputil: Add ff_ prefix to the dsputil*_init* functions 2012-02-15 22:06:34 +02:00
dsputilenc_yasm.asm
fdct_mmx.c cosmetics: Delete empty lines at end of file. 2012-02-09 12:26:45 +01:00
fft_3dn2.c cosmetics: Delete empty lines at end of file. 2012-02-09 12:26:45 +01:00
fft_3dn.c
fft_mmx.asm
fft_sse.c cosmetics: Delete empty lines at end of file. 2012-02-09 12:26:45 +01:00
fft.c cosmetics: Delete empty lines at end of file. 2012-02-09 12:26:45 +01:00
fft.h
fmtconvert_mmx.c
fmtconvert.asm
h264_chromamc_10bit.asm
h264_chromamc.asm
h264_deblock_10bit.asm
h264_deblock.asm h264: don't use redzone in loopfilter on win64. 2012-02-19 15:31:03 -08:00
h264_i386.h
h264_idct_10bit.asm h264: mark h264_idct_add8_10 with number of XMM registers. 2012-02-07 11:37:13 -08:00
h264_idct.asm h264: manually save/restore XMM registers for functions using INIT_MMX. 2012-02-08 10:31:14 -08:00
h264_intrapred_10bit.asm
h264_intrapred_init.c
h264_intrapred.asm h264: manually save/restore XMM registers for functions using INIT_MMX. 2012-02-08 10:31:14 -08:00
h264_qpel_10bit.asm
h264_qpel_mmx.c
h264_weight_10bit.asm
h264_weight.asm
h264dsp_mmx.c
idct_mmx_xvid.c cosmetics: Delete empty lines at end of file. 2012-02-09 12:26:45 +01:00
idct_mmx.c cosmetics: Delete empty lines at end of file. 2012-02-09 12:26:45 +01:00
idct_sse2_xvid.c
idct_xvid.h
imdct36_sse.asm
lpc_mmx.c
Makefile SBR DSP x86: implement SSE sbr_sum_square_sse 2012-02-23 15:50:06 -08:00
mathops.h
mlpdsp.c
motion_est_mmx.c dsputil: Add ff_ prefix to the dsputil*_init* functions 2012-02-15 22:06:34 +02:00
mpegaudiodec_mmx.c mpegaudio: replace memcpy by SIMD code 2012-02-15 20:11:54 -08:00
mpegvideo_mmx_template.c dsputil: Add ff_ prefix to inv_zigzag_direct16 2012-02-15 22:06:42 +02:00
mpegvideo_mmx.c mpegvideo: Add ff_ prefix to nonstatic functions 2012-02-15 22:07:23 +02:00
pngdsp-init.c
pngdsp.asm pngdsp: swap argument inversion. 2012-02-07 14:32:26 -08:00
proresdsp-init.c
proresdsp.asm
rv34dsp_init.c rv34: change most "int stride" into "ptrdiff_t stride". 2012-02-20 14:58:25 -08:00
rv34dsp.asm
rv40dsp_init.c rv34: change most "int stride" into "ptrdiff_t stride". 2012-02-20 14:58:25 -08:00
rv40dsp.asm
sbrdsp_init.c SBR DSP x86: implement SSE sbr_hf_g_filt 2012-02-23 15:50:09 -08:00
sbrdsp.asm SBR DSP x86: implement SSE sbr_hf_g_filt 2012-02-23 15:50:09 -08:00
simple_idct_mmx.c
snowdsp_mmx.c
vc1dsp_mmx.c
vc1dsp_yasm.asm
vp3dsp.asm
vp8dsp-init.c
vp8dsp.asm
vp56_arith.h
vp56dsp_init.c
vp56dsp.asm
w64xmmtest.c