Commit Graph

434 Commits

Author SHA1 Message Date
Loren Merritt ec199cc94c asm argument that might be in memory needs a size
Originally committed as revision 11890 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-10 01:45:42 +00:00
Loren Merritt 2c70770e33 use fewer registers in apply_welch_window_sse2
Originally committed as revision 11882 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-09 05:29:47 +00:00
Loren Merritt 1d67b037f7 sse2 h264 motion compensation. not new code, just separate out the cases that didn't need ssse3.
Originally committed as revision 11877 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-06 12:32:31 +00:00
Loren Merritt 20d565be6d put loop counter in a register if possible. makes some of the qpel functions 3% faster.
Originally committed as revision 11876 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-06 04:44:21 +00:00
Loren Merritt 7080ec2937 fix aliasing warnings. simpler too.
Originally committed as revision 11875 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-06 04:14:07 +00:00
Loren Merritt a2b7bc8e71 constant was excessively aligned
Originally committed as revision 11874 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-06 03:51:53 +00:00
Loren Merritt ddf969705f ssse3 h264 motion compensation.
25% faster tham mmx on core2, 35% if you discount fullpel, 4% overall decoding.

Originally committed as revision 11871 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-05 11:22:55 +00:00
Loren Merritt b64dfbb8d2 add qpel rounder once during hv rather than twice during hv and whatever it's averaged with
Originally committed as revision 11870 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-05 03:58:13 +00:00
Loren Merritt fa9b873e08 clean up an ugliness introduced in r11826. this syntax will require fewer changes when adding future sse2 code.
Originally committed as revision 11868 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-05 01:16:48 +00:00
Michael Niedermayer 9a7871f77d Deprecate old and inefficient per instruction asm().
Originally committed as revision 11865 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-04 20:03:08 +00:00
Loren Merritt b2f775860b reduce code duplication
Originally committed as revision 11863 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-04 16:20:46 +00:00
Loren Merritt b313e8159c avg_pixels4_mmx2
Originally committed as revision 11829 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-03 17:04:33 +00:00
Loren Merritt 6c01d0069d use mmx2/3dnow avg functions in avg_qpel*_mc00
Originally committed as revision 11828 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-03 16:19:37 +00:00
Loren Merritt ed5d7a531c ff_h264_idct8_add_sse2.
compared to mmx, 217->126 cycles on core2, 262->220 on k8.

Originally committed as revision 11826 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-03 07:05:11 +00:00
Loren Merritt 51f0ac65df remove some movq in ff_h264_idct8_add_mmx. 225->217 cycles on core2.
Originally committed as revision 11825 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-03 03:21:47 +00:00
Baptiste Coudurier 066e0cc50d add parenthesis, fix warning: i386/dsputil_mmx.c:2618: warning: suggest parentheses around arithmetic in operand of |
Originally committed as revision 11673 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-01-30 23:54:59 +00:00
Baptiste Coudurier afa4778989 fix prototypes, remove warning: i386/dsputil_mmx.c:3594: warning: assignment from incompatible pointer type
Originally committed as revision 11672 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-01-30 23:52:42 +00:00
Reimar Döffinger 766324fc49 Add and use DECLARE_ASM_CONST for constants used in assembler code.
Should make it easier to work around compilation problems with e.g. ICC.

Originally committed as revision 11641 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-01-27 20:50:16 +00:00
Reimar Döffinger 038f0f9b34 Use DECLARE_ALIGNED in yet another place
Originally committed as revision 11640 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-01-27 20:45:49 +00:00
Reimar Döffinger 6a1a2fa0fe Use DECLARE_ALIGNED and remove unneeded attribute_used
Originally committed as revision 11639 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-01-27 19:59:11 +00:00
Reimar Döffinger 27215c6bf4 Use DECLARE_ALIGNED
Originally committed as revision 11630 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-01-27 14:46:56 +00:00
Diego Biurrun 426d18b85b Rename illegal identifiers, _ followed by capital is reserved for the system.
Originally committed as revision 11541 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-01-16 21:21:30 +00:00
Christophe Gisquet 28748a9128 Factorize some duplicated code from CAVS and H.264 into a common file.
patch by Christophe Gisquet, christophe.gisquet free fr

Originally committed as revision 11504 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-01-11 08:29:58 +00:00
Christophe Gisquet ae904fd028 Fix issue #301:
summary of changes:
- Use MANGLE when loading some constants into MMX registers.
- Convert those constants to non-static and thus add ff_ prefix.
- Remove last parameter of MSPEL_FILTER13_CORE (was constant).
- Use of "+r" instead of stricter but unnecessary "+g".
- Use of REG_c and direct loading of some of the above.
patch by Christophe GISQUET, christophe.gisquet free fr
Subject: [FFmpeg-devel] [PATCH] Roundup issue #301
Date: Fri, 28 Dec 2007 19:22:18 +0100

Originally committed as revision 11376 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-01-02 19:24:42 +00:00
Christophe Gisquet 9fa3572903 add MMX version for put_no_rnd_h264_chroma_mc8_c, used in VC-1 decoding.
patch by Christophe GISQUET %christophe P gisquet A free P fr%
original thread:
date: Nov 25, 2007 12:35 AM
subject: Re: [FFmpeg-devel] MMX version for put_no_rnd_h264_chroma_mc8_c

Originally committed as revision 11298 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-12-21 23:11:22 +00:00
Diego Biurrun 9fbd14acb8 Fix typo in macro name: WARPER8_16_SQ --> WRAPPER8_16_SQ.
Originally committed as revision 11296 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-12-21 12:38:41 +00:00
Aurelien Jacobs 407c50a024 move FLAC mmx dsp to its own file
Originally committed as revision 11244 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-12-16 22:20:47 +00:00
Reimar Döffinger 15c57ced2f Add 'l' suffix where it is necessary because type can not always be
inferred from arguments. Fixes compilation with Intel compiler

Originally committed as revision 11227 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-12-15 23:08:15 +00:00
Aurelien Jacobs 1b77e87756 add required include to make this file self-contained
Originally committed as revision 11211 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-12-12 22:45:03 +00:00
Diego Biurrun 571bf37f6d typo/clarification
Originally committed as revision 11201 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-12-11 18:47:04 +00:00
Diego Biurrun 56cc85a068 Misc spelling fixes, prefer American over British English.
Originally committed as revision 11126 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-12-02 15:43:08 +00:00
Vitor Sessak 52b541ad79 spelling
Originally committed as revision 11122 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-12-01 22:21:04 +00:00
Aurelien Jacobs bb6cc730e5 remove some unused ff_p* vars from dsputil
Originally committed as revision 11106 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-27 22:57:26 +00:00
Aurelien Jacobs dbb5fdbdc8 remove useless #ifdef around extern declaration
Originally committed as revision 11105 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-27 22:56:15 +00:00
Aurelien Jacobs 7c35b55111 cosmetics: indentation
Originally committed as revision 11104 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-27 22:54:46 +00:00
Aurelien Jacobs 51ac88228a convert some #ifdef CONFIG_ to if(ENABLE_
Originally committed as revision 11103 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-27 22:54:18 +00:00
Aurelien Jacobs 5b67ce2afa build vc1dsp_mmx.c in its own compilation unit
Originally committed as revision 11102 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-27 22:42:55 +00:00
Aurelien Jacobs 43de50659b use ff_ prefix for extern vars
Originally committed as revision 11101 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-27 22:36:15 +00:00
Aurelien Jacobs 182f56cb0c make ff_p* vars extern so that they can be used in various *_mmx.c files
Originally committed as revision 11100 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-27 22:23:34 +00:00
Christophe Gisquet ac40ce4244 Typo fix. Previous version had some picture error building up until next keyframe.
Now MMX version decodes 1:1 what the C version does
patch by Christophe GISQUET %christophe P gisquet A free P fr%

Originally committed as revision 11090 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-25 09:43:35 +00:00
Christophe Gisquet d3a9c44e1d Strip debug stuff from vc1dsp_mmx.c, patch by Christophe GISQUET %hristophe P gisquet A free P fr%
Original thread:
date: Nov 24, 2007 3:09 PM
subject: [FFmpeg-devel] [PATCH] Strip debug stuff from vc1dsp_mmx.c

Originally committed as revision 11088 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-24 14:34:25 +00:00
Christophe Gisquet 82821c913b add VC-1 MMX DSP functions, under MIT license.
patch by Christophe GISQUET %christophe P gisquet A free P fr%
original thread:
date: Jul 7, 2007 12:52 PM
subject: [FFmpeg-devel] [PATCH] VC-1 MMX DSP functions

Originally committed as revision 11074 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-21 22:41:31 +00:00
Michael Niedermayer 02d361919a tring to workaround gcc 2.95 bug which causes random failures
Originally committed as revision 11003 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-12 02:04:01 +00:00
Diego Biurrun deb43f0bd8 Explain why there are no multiple inclusion guards in these header files.
Originally committed as revision 10771 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-10-17 22:29:13 +00:00
Aurelien Jacobs ab54bff20a Remove wrong multiple inclusion guards.
Those files are really meant to be included several times.

Originally committed as revision 10766 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-10-17 11:19:42 +00:00
Diego Biurrun 5b21bdabe4 Add FFMPEG_ prefix to all multiple inclusion guards.
Originally committed as revision 10765 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-10-17 09:37:46 +00:00
Diego Biurrun 31b2c1446f Add missing multiple inclusion guards.
Originally committed as revision 10763 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-10-17 09:31:06 +00:00
Shane bdb273564c Fix intended order of operations for 4 assert() checks.
Patch by Shane, gnome42 T gmail O com

Originally committed as revision 10711 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-10-11 22:18:38 +00:00
Loren Merritt 6810b93a81 sse2 version of compute_autocorr().
4x faster than c (somehow, even though doubles only allow 2x simd).
overal flac encoding: 15-50% faster on core2, 4-11% on k8, 3-13% on p4.

Originally committed as revision 10621 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-09-29 22:31:18 +00:00
Reimar Döffinger eafa1c90e5 Replace complicated and currently broken manual alignment code by
DECLARE_ALIGNED_16. Fixes crash in ff_snow_horizontal_compose97i_sse2

Originally committed as revision 10261 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-30 11:41:58 +00:00