ffmpeg

History

Martin Storsjö cdb1665f70 aarch64: Make transpose_4x4H do a regular transpose Previously, ff_h264_idct_add_neon (originally in the arm version) used a non-regular transpose in order to be able to use more instructions that deal with registers as 128 bit register pairs. The aarch64 translation doesn't do it to the same extent, but brought along the same structure since it was a straight translation. This reshuffles ff_h264_idct_add_neon, bringing it closer to the C implementation, making the transpose_4x4H macro do a regular transpose, usable for other algorithms as well. Previously, the third and fourth output from transpose_4x4H were swapped, and prior to `cc29d96d5a`, the same inputs as well. In addition to just swapping the outputs, also renumber the intermediate registers for better readability (making the register order match transpose_4x8B). This runs with the same number of cycles as before. Signed-off-by: Martin Storsjö <martin@martin.st>		2016-03-26 21:25:56 +02:00
..
Makefile	fft: Split MDCT bits off from FFT	2016-03-01 10:18:28 +01:00
asm-offsets.h	…
cabac.h	…
dcadsp_init.c	dca: remove unused decode_hf function and quant_d tables	2015-12-24 13:58:18 +01:00
dcadsp_neon.S	dca: remove unused decode_hf function and quant_d tables	2015-12-24 13:58:18 +01:00
fft_init_aarch64.c	fft: Split MDCT bits off from FFT	2016-03-01 10:18:28 +01:00
fft_neon.S	…
fmtconvert_init.c	arm64: int32_to_float_fmul neon asm	2015-12-14 16:45:02 +01:00
fmtconvert_neon.S	arm64: int32_to_float_fmul neon asm	2015-12-14 16:45:02 +01:00
h264chroma_init_aarch64.c	…
h264cmc_neon.S	…
h264dsp_init_aarch64.c	…
h264dsp_neon.S	…
h264idct_neon.S	aarch64: Make transpose_4x4H do a regular transpose	2016-03-26 21:25:56 +02:00
h264pred_init.c	…
h264pred_neon.S	…
h264qpel_init_aarch64.c	…
h264qpel_neon.S	…
hpeldsp_init_aarch64.c	…
hpeldsp_neon.S	…
imdct15_init.c	…
imdct15_neon.S	…
mdct_init.c	fft: Split MDCT bits off from FFT	2016-03-01 10:18:28 +01:00
mdct_neon.S	…
mpegaudiodsp_init.c	…
mpegaudiodsp_neon.S	…
neon.S	aarch64: Make transpose_4x4H do a regular transpose	2016-03-26 21:25:56 +02:00
neontest.c	…
rv40dsp_init_aarch64.c	…
synth_filter_neon.S	…
vc1dsp_init_aarch64.c	…
videodsp.S	…
videodsp_init.c	…
vorbisdsp_init.c	…
vorbisdsp_neon.S	…