ffmpeg/libavutil/x86
Lynne 35080149ef
x86/tx_float: mark AVX2 functions as AVXSLOW
Makes Bulldozer prefer AVX functions rather than AVX2,
which are 64% slower:

AVX:  117653 decicycles in av_tx (fft), 1048535 runs,     41 skips
AVX2: 193385 decicycles in av_tx (fft), 1048561 runs,     15 skips

The only difference between both is that vgatherdpd is used in
the former. We don't want to mark them with the new SLOW_GATHER
flag however, since gathers are still faster on Haswell/Zen 2/3
than plain loads.
2022-01-29 03:08:16 +01:00
..
Makefile x86/tx_float: do not build tx_float_init.c if x86 assembly is disabled 2022-01-27 02:17:46 +01:00
asm.h
bswap.h
cpu.c avutil/cpu: move slow gather checks below in the function 2021-12-21 17:51:17 -03:00
cpu.h
cpuid.asm
emms.asm
emms.h
fixed_dsp.asm
fixed_dsp_init.c
float_dsp.asm
float_dsp_init.c
imgutils.asm
imgutils_init.c
intmath.h
intreadwrite.h
lls.asm
lls_init.c
pixelutils.asm
pixelutils.h
pixelutils_init.c
timer.h
tx_float.asm x86/tx_float: add permute-free FFT versions 2022-01-26 04:13:58 +01:00
tx_float_init.c x86/tx_float: mark AVX2 functions as AVXSLOW 2022-01-29 03:08:16 +01:00
w64xmmtest.h
x86inc.asm
x86util.asm