ffmpeg/tests/checkasm
John Cox 697533e76d avfilter/vf_bwdif: Add a filter_line3 method for optimisation
Add an optional filter_line3 to the available optimisations.

filter_line3 is equivalent to filter_line, memcpy, filter_line

filter_line shares quite a number of loads and some calculations in
common with its next iteration and testing shows that using aarch64
neon filter_line3s performance is 30% better than two filter_lines
and a memcpy.

Adds a test for vf_bwdif filter_line3 to checkasm

Rounds job start lines down to a multiple of 4. This means that if
filter_line3 exists then filter_line will not sometimes be called
once at the end of a slice depending on thread count. The final slice
may do up to 3 extra lines but filter_edge is faster than filter_line
so it is unlikely to create any noticable thread load variation.

Signed-off-by: John Cox <jc@kynesim.co.uk>
Signed-off-by: Martin Storsjö <martin@martin.st>
2023-07-06 00:21:05 +03:00
..
aarch64
arm
riscv
x86
.gitignore
Makefile
aacpsdsp.c
af_afir.c
alacdsp.c
audiodsp.c
av_tx.c
blockdsp.c
bswapdsp.c
checkasm.c
checkasm.h
exrdsp.c
fixed_dsp.c
flacdsp.c
float_dsp.c
fmtconvert.c
g722dsp.c
h264chroma.c
h264dsp.c
h264pred.c
h264qpel.c
hevc_add_res.c
hevc_deblock.c
hevc_idct.c
hevc_pel.c
hevc_sao.c
huffyuvdsp.c
idctdsp.c
jpeg2000dsp.c
llviddsp.c
llviddspenc.c
lpc.c
motion.c
opusdsp.c
pixblockdsp.c
sbrdsp.c
sw_gbrp.c
sw_rgb.c
sw_scale.c
synth_filter.c
utvideodsp.c
v210dec.c
v210enc.c
vc1dsp.c
vf_blend.c
vf_bwdif.c avfilter/vf_bwdif: Add a filter_line3 method for optimisation 2023-07-06 00:21:05 +03:00
vf_colorspace.c
vf_convolution.c
vf_eq.c
vf_gblur.c
vf_hflip.c
vf_nlmeans.c
vf_threshold.c
videodsp.c
vorbisdsp.c
vp8dsp.c
vp9dsp.c