ffmpeg/tests/checkasm
John Cox 697533e76d avfilter/vf_bwdif: Add a filter_line3 method for optimisation
Add an optional filter_line3 to the available optimisations.

filter_line3 is equivalent to filter_line, memcpy, filter_line

filter_line shares quite a number of loads and some calculations in
common with its next iteration and testing shows that using aarch64
neon filter_line3s performance is 30% better than two filter_lines
and a memcpy.

Adds a test for vf_bwdif filter_line3 to checkasm

Rounds job start lines down to a multiple of 4. This means that if
filter_line3 exists then filter_line will not sometimes be called
once at the end of a slice depending on thread count. The final slice
may do up to 3 extra lines but filter_edge is faster than filter_line
so it is unlikely to create any noticable thread load variation.

Signed-off-by: John Cox <jc@kynesim.co.uk>
Signed-off-by: Martin Storsjö <martin@martin.st>
2023-07-06 00:21:05 +03:00
..
aarch64
arm
riscv checkasm: RISC-V 64-bit assembler test harness 2022-10-10 02:23:18 +02:00
x86 x86: replace explicit REP_RETs with RETs 2023-02-01 04:23:55 +01:00
.gitignore
Makefile checkasm: add h264chroma tests 2023-05-20 20:07:21 +02:00
aacpsdsp.c
af_afir.c
alacdsp.c
audiodsp.c checkasm/audiodsp: Be strict about MMX 2022-10-11 14:18:54 +02:00
av_tx.c x86/tx_float: add 15xN PFA FFT AVX SIMD 2022-09-23 12:35:27 +02:00
blockdsp.c checkasm/blockdsp: Be strict about MMX 2022-10-11 14:18:54 +02:00
bswapdsp.c
checkasm.c aarch64: Add cpu flags for the dotprod and i8mm extensions 2023-06-06 12:40:42 +03:00
checkasm.h checkasm: add h264chroma tests 2023-05-20 20:07:21 +02:00
exrdsp.c
fixed_dsp.c
flacdsp.c
float_dsp.c
fmtconvert.c avcodec/fmtconvert: Remove unused AVCodecContext parameter 2022-09-21 20:26:40 +02:00
g722dsp.c
h264chroma.c checkasm: add h264chroma tests 2023-05-20 20:07:21 +02:00
h264dsp.c
h264pred.c
h264qpel.c
hevc_add_res.c checkasm: pass context as pointer 2023-04-13 15:17:04 +03:00
hevc_deblock.c checkasm: pass context as pointer 2023-04-13 15:17:04 +03:00
hevc_idct.c checkasm: pass context as pointer 2023-04-13 15:17:04 +03:00
hevc_pel.c
hevc_sao.c checkasm: pass context as pointer 2023-04-13 15:17:04 +03:00
huffyuvdsp.c checkasm: pass context as pointer 2023-04-13 15:17:04 +03:00
idctdsp.c checkasm/idctdsp: Use declare_func_emms only when needed 2022-10-11 14:18:54 +02:00
jpeg2000dsp.c
llviddsp.c checkasm: pass context as pointer 2023-04-13 15:17:04 +03:00
llviddspenc.c checkasm/llviddspenc: Use declare_func_emms only when needed 2022-10-11 14:18:54 +02:00
lpc.c tests/checkasm/lpc: correct arithmetic when randomizing buffers 2022-09-23 01:50:59 +02:00
motion.c
opusdsp.c
pixblockdsp.c checkasm/pixblockdsp: Be strict about MMX 2022-10-11 14:18:54 +02:00
sbrdsp.c
sw_gbrp.c
sw_rgb.c
sw_scale.c
synth_filter.c dca_core: convert to lavu/tx 2022-11-06 14:39:36 +01:00
utvideodsp.c
v210dec.c checkasm/v210dec: add extra space to the destination arrays 2022-12-21 00:36:49 +01:00
v210enc.c checkasm/v210enc: test the entire width of 10-bit planar input arrays 2022-12-01 18:19:03 +01:00
vc1dsp.c checkasm/vc1dsp: Use declare_func_emms only when needed 2022-10-11 14:18:54 +02:00
vf_blend.c
vf_bwdif.c avfilter/vf_bwdif: Add a filter_line3 method for optimisation 2023-07-06 00:21:05 +03:00
vf_colorspace.c
vf_convolution.c libavfilter/x86/vf_convolution: add sobel filter optimization and unit test with intel AVX512 VNNI 2022-11-14 10:04:16 +08:00
vf_eq.c
vf_gblur.c
vf_hflip.c
vf_nlmeans.c
vf_threshold.c
videodsp.c
vorbisdsp.c
vp8dsp.c checkasm/vp8dsp: Use declare_func_emms only when needed 2022-10-08 09:33:36 +02:00
vp9dsp.c