mirror of https://git.ffmpeg.org/ffmpeg.git
35080149ef
Makes Bulldozer prefer AVX functions rather than AVX2, which are 64% slower: AVX: 117653 decicycles in av_tx (fft), 1048535 runs, 41 skips AVX2: 193385 decicycles in av_tx (fft), 1048561 runs, 15 skips The only difference between both is that vgatherdpd is used in the former. We don't want to mark them with the new SLOW_GATHER flag however, since gathers are still faster on Haswell/Zen 2/3 than plain loads. |
||
---|---|---|
.. | ||
Makefile | ||
asm.h | ||
bswap.h | ||
cpu.c | ||
cpu.h | ||
cpuid.asm | ||
emms.asm | ||
emms.h | ||
fixed_dsp.asm | ||
fixed_dsp_init.c | ||
float_dsp.asm | ||
float_dsp_init.c | ||
imgutils.asm | ||
imgutils_init.c | ||
intmath.h | ||
intreadwrite.h | ||
lls.asm | ||
lls_init.c | ||
pixelutils.asm | ||
pixelutils.h | ||
pixelutils_init.c | ||
timer.h | ||
tx_float.asm | ||
tx_float_init.c | ||
w64xmmtest.h | ||
x86inc.asm | ||
x86util.asm |