ffmpeg/libavcodec/x86
Christophe Gisquet 2aef3d66c9 SBR DSP x86: implement SSE sbr_hf_gen
Start and end index are multiple of 2, therefore guaranteeing aligned access.
Also, this allows to generate 4 floats per loop, keeping the alignment all
along.

Timing:
- 32 bits: 326c -> 172c
- 64 bits: 323c -> 156c

Signed-off-by: Diego Biurrun <diego@biurrun.de>
2012-12-07 11:04:26 +01:00
..
Makefile x86: h264: Convert 8-bit QPEL inline assembly to YASM 2012-11-25 20:38:35 +01:00
ac3dsp.asm x86: mmx2 ---> mmxext in asm constructs 2012-11-14 00:58:51 +01:00
ac3dsp_init.c x86: mmx2 ---> mmxext in asm constructs 2012-11-14 00:58:51 +01:00
cabac.h
cavsdsp.c
dct32.asm
deinterlace.asm
dnxhdenc.c
dsputil.asm x86: h264: Convert 8-bit QPEL inline assembly to YASM 2012-11-25 20:38:35 +01:00
dsputil_avg_template.c x86: h264: Convert 8-bit QPEL inline assembly to YASM 2012-11-25 20:38:35 +01:00
dsputil_mmx.c x86: fix build without inline asm 2012-11-26 01:50:47 +01:00
dsputil_mmx.h
dsputil_qns_template.c
dsputil_rnd_template.c
dsputilenc.asm x86: dsputilenc: port to cpuflags 2012-11-28 16:05:44 +01:00
dsputilenc_mmx.c x86: mmx2 ---> mmxext in asm constructs 2012-11-14 00:58:51 +01:00
fdct.c
fft.asm
fft.h
fft_init.c
fmtconvert.asm x86: SPLATD: port to cpuflags 2012-11-18 18:34:05 +01:00
fmtconvert_init.c
h264_chromamc.asm x86: h264_chromamc: port to cpuflags 2012-11-25 17:25:10 +01:00
h264_chromamc_10bit.asm x86: mmx2 ---> mmxext in asm constructs 2012-11-14 00:58:51 +01:00
h264_deblock.asm x86: mmx2 ---> mmxext in asm constructs 2012-11-14 00:58:51 +01:00
h264_deblock_10bit.asm x86: mmx2 ---> mmxext in asm constructs 2012-11-14 00:58:51 +01:00
h264_i386.h
h264_idct.asm x86: h264_idct: port to cpuflags 2012-11-28 00:28:09 +01:00
h264_idct_10bit.asm x86: mmx2 ---> mmxext in asm constructs 2012-11-14 00:58:51 +01:00
h264_intrapred.asm x86: h264_intrapred: Fix C function names in comments 2012-11-18 18:34:05 +01:00
h264_intrapred_10bit.asm x86: mmx2 ---> mmxext in asm constructs 2012-11-14 00:58:51 +01:00
h264_intrapred_init.c x86: mmx2 ---> mmxext in asm constructs 2012-11-14 00:58:51 +01:00
h264_qpel.c x86: h264: Convert 8-bit QPEL inline assembly to YASM 2012-11-25 20:38:35 +01:00
h264_qpel_8bit.asm x86: h264 qpel: use the correct number of utilized xmm regs in cglobal 2012-11-25 18:48:43 -05:00
h264_qpel_10bit.asm
h264_weight.asm x86: h264_weight: port to cpuflags 2012-11-27 21:10:38 +01:00
h264_weight_10bit.asm
h264dsp_init.c x86: h264dsp: Fix linking with yasm and optimizations disabled 2012-11-28 14:45:28 +01:00
idct_mmx_xvid.c
idct_sse2_xvid.c
idct_xvid.h x86: mmx2 ---> mmxext in function names 2012-10-31 17:53:57 +01:00
imdct36.asm
lpc.c
mathops.h
mlpdsp.c
motion_est.c
mpegaudiodec.c
mpegvideo.c
mpegvideoenc.c
mpegvideoenc_template.c
pngdsp.asm x86: mmx2 ---> mmxext in asm constructs 2012-11-14 00:58:51 +01:00
pngdsp_init.c x86: mmx2 ---> mmxext in asm constructs 2012-11-14 00:58:51 +01:00
proresdsp.asm x86: yasm: Use complete source path for macro helper %includes 2012-10-31 00:37:42 +01:00
proresdsp_init.c
rv34dsp.asm x86: mmx2 ---> mmxext in asm constructs 2012-11-14 00:58:51 +01:00
rv34dsp_init.c x86: mmx2 ---> mmxext in asm constructs 2012-11-14 00:58:51 +01:00
rv40dsp.asm x86: mmx2 ---> mmxext in asm constructs 2012-11-14 00:58:51 +01:00
rv40dsp_init.c x86: mmx2 ---> mmxext in asm constructs 2012-11-14 00:58:51 +01:00
sbrdsp.asm SBR DSP x86: implement SSE sbr_hf_gen 2012-12-07 11:04:26 +01:00
sbrdsp_init.c SBR DSP x86: implement SSE sbr_hf_gen 2012-12-07 11:04:26 +01:00
simple_idct.c
snowdsp.c
vc1dsp.asm
vc1dsp.h
vc1dsp_init.c x86: mmx2 ---> mmxext in asm constructs 2012-11-14 00:58:51 +01:00
vc1dsp_mmx.c
vp3dsp.asm x86: mmx2 ---> mmxext in asm constructs 2012-11-14 00:58:51 +01:00
vp3dsp_init.c x86: mmx2 ---> mmxext in asm constructs 2012-11-14 00:58:51 +01:00
vp8dsp.asm x86: mmx2 ---> mmxext in asm constructs 2012-11-14 00:58:51 +01:00
vp8dsp_init.c x86: mmx2 ---> mmxext in asm constructs 2012-11-14 00:58:51 +01:00
vp56_arith.h
vp56dsp.asm
vp56dsp_init.c
w64xmmtest.c