Commit Graph

200 Commits

Author SHA1 Message Date
Loren Merritt d2bb7db135 sort H.264 mmx dsp functions into their own file
Originally committed as revision 4338 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-02 20:45:35 +00:00
Michael Niedermayer c26ae41db2 adding a few const
Originally committed as revision 4337 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-01 21:19:00 +00:00
Michael Niedermayer 435b0720a8 100l for myself (breaking amd64)
Originally committed as revision 4336 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-01 18:04:01 +00:00
Michael Niedermayer 6510f43cf3 merge a few asm blocks so gcc cant unoptimize it (658->631 dezicycles on duron)
Originally committed as revision 4334 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-01 11:56:58 +00:00
Michael Niedermayer 987ae784e6 get rid of 2 movq (680 -> 658 dezicycles on duron)
Originally committed as revision 4333 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-01 11:36:32 +00:00
Michael Niedermayer e4b36d4434 avoid one transpose (730->680 dezicycles on duron)
Originally committed as revision 4332 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-01 08:43:40 +00:00
Loren Merritt 85bbfcd4ee 10l (symbol mangling)
Originally committed as revision 4331 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-01 04:51:46 +00:00
Michael Niedermayer 1f3dbc09b1 add rounding bias before the horizontal idct (765->730 dezicyles on duron)
Originally committed as revision 4330 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-01 01:18:41 +00:00
Loren Merritt 1d62fc8560 MMX for H.264 iDCT (adapted from x264)
Originally committed as revision 4329 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-05-31 22:48:33 +00:00
Zoltán Hidvégi 3072f0cb2e MMX code for (put|avg)_h264_chroma_mc8
Originally committed as revision 4305 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-05-25 08:05:41 +00:00
Loren Merritt 5cf08f2393 H.264 deblocking optimizations (mmx for chroma_bS4 case, convert existing cases to 8-bit math)
Originally committed as revision 4271 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-05-18 09:17:22 +00:00
Michael Niedermayer 5773a74669 porting the mmx&sse2 (sse2 untested) vp3 idcts to the lavc idct API
Originally committed as revision 4260 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-05-17 18:28:40 +00:00
Michael Niedermayer b178f758fa disabling vp3 mmx&mmx2 idcts, they must be ported over to the lavc idct API, ill port the vp3 c idct
Originally committed as revision 4255 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-05-17 09:11:48 +00:00
Michael Niedermayer c998bdd9a0 fix PIC
Originally committed as revision 4204 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-05-08 18:48:19 +00:00
Loren Merritt 42251a2a4f MMX for H.264 deblocking filter
Originally committed as revision 4158 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-04-25 01:01:41 +00:00
Michael Niedermayer 4e492bf107 read 32bit instead of 64bit to avoid overreading and missalignments
Originally committed as revision 4133 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-04-17 13:30:45 +00:00
Martin Drab 4d9ae03b09 optimization and gcc 4.0 bug workaround patch by (Martin Drab >drab kepler.fjfi.cvut cz<)
Originally committed as revision 3945 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-02-07 17:09:48 +00:00
Michael Niedermayer e42a152bbc x86 cpu capabilities detection rewrite / cleanup
Originally committed as revision 3750 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-12-13 16:11:38 +00:00
Matthieu Castet e7ddb0cf4c detect sse on athlon-xp patch by (matthieu castet <castet >.< matthieu >at< free >.< fr>)
Originally committed as revision 3729 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-12-05 23:26:43 +00:00
Michael Niedermayer 1c3990dbba H.261 encoder by (Maarten Daniels <maarten dot daniels at luc dot ac dot be>)
Originally committed as revision 3643 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-10-28 10:12:57 +00:00
Aurelien Jacobs 8b4d077fc9 gcc 3.4.3 preversions do not appreciate invalid instruction and operand combinations anymore patch by (Aurelien Jacobs <aurel at gnuage dot org>)
Originally committed as revision 3604 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-10-17 20:27:11 +00:00
Aurelien Jacobs 053dea12f2 adapting existing mmx/mmx2/sse/3dnow optimizations so they work on x86_64 patch by (Aurelien Jacobs <aurel at gnuage dot org>)
Originally committed as revision 3578 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-10-11 02:19:29 +00:00
Michael Niedermayer 178fcca848 1/2 resolution decoding
Originally committed as revision 3507 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-25 23:18:58 +00:00
Michael Niedermayer e69538fa60 h264_qpel8_hv_lowpass_mmx2/3dnow
Originally committed as revision 3446 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-08 18:49:12 +00:00
Michael Niedermayer e772bb8a82 h264_qpel4_hv_lowpass_mmx2/3dnow
alignment fixes

Originally committed as revision 3445 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-08 18:25:45 +00:00
Michael Niedermayer 56d8bd5659 optimization
Originally committed as revision 3440 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-08 00:52:49 +00:00
Michael Niedermayer a6e39f45a2 optimization
Originally committed as revision 3439 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-07 22:56:50 +00:00
Michael Niedermayer ed8ffdf46c optimization
Originally committed as revision 3438 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-07 17:22:37 +00:00
Michael Niedermayer 437525c473 h264 luma motion compensation in mmx2/3dnow
Originally committed as revision 3437 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-07 01:48:45 +00:00
Michael Niedermayer d6af6b0350 10000l fix and use more mmx2/3dnow code for mpeg4 qpel which has been written and commited long time ago but appearently never used, qpel motion compensation is 5% faster
now

Originally committed as revision 3435 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-06 03:17:31 +00:00
Michael Niedermayer 6206f8c920 10l and better MMX/SSE detection for VIA1000
Originally committed as revision 3323 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-07-16 01:54:07 +00:00
Michael Niedermayer 61b3b639e7 dc_precission=11 fix
Originally committed as revision 3291 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-07-06 12:27:36 +00:00
Michael Niedermayer 4c99f2cdbe transmeta crusoe mmx detection by "Nobody/Anonymous - nobody" / sf feature request tracker
Originally committed as revision 3248 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-06-24 22:13:44 +00:00
Michael Niedermayer 1ec4df0fa8 sse8 and nsse in mmx
Originally committed as revision 3206 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-06-08 02:13:44 +00:00
Michael Niedermayer e96682e6f4 some of the warning fixes by (Michael Roitzsch <mroi at users dot sourceforge dot net>)
Originally committed as revision 3140 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-05-18 17:09:46 +00:00
Michael Niedermayer 77d0e165b1 mmx dct optimization
Originally committed as revision 3100 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-04-30 19:53:54 +00:00
Mike Melanson 7daabccb5d move the 0x80 vector outside of the function, thus saving the compiler
the trouble of having to initialize each byte on the stack individually

Originally committed as revision 3083 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-04-27 04:06:24 +00:00
Mike Melanson f9ed9d8584 separate out put_signed_pixels_clamped() into its own function and
implement an optimized MMX version of the function

Originally committed as revision 3082 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-04-27 03:58:06 +00:00
Mike Melanson 75b4f3c6d6 yes, that's nice...
Originally committed as revision 3072 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-04-26 00:33:59 +00:00
Mike Melanson 116824d0aa reorganize and simplify the VP3 IDCT stuff
Originally committed as revision 3071 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-04-26 00:20:29 +00:00
Mike Melanson 6186b94c13 hush...
Originally committed as revision 3065 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-04-25 03:38:30 +00:00
Mike Melanson 38acbc3cb9 hook up support for SSE2-optimized VP3 IDCT
Originally committed as revision 3064 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-04-25 03:33:30 +00:00
Mike Melanson 78a16bfec6 add selected SSE2 ASM macros
Originally committed as revision 3063 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-04-25 03:30:19 +00:00
Mike Melanson 5f9379007c SSE2-optimized variant of VP3 IDCT
Originally committed as revision 3062 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-04-25 03:29:10 +00:00
Mike Melanson daae869903 correct MMX-optimized variant of VP3 IDCT, with comments (thank you
Perl!)

Originally committed as revision 3061 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-04-25 03:26:24 +00:00
Michael Niedermayer 19f82bc824 fixing alignment problems -> SSE2 support enabled again in libavcodec (from ffdshow / milan_cutka)
Originally committed as revision 2990 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-04-09 16:32:49 +00:00
Gildas Bazin 68951ecf0c fft_*() renamed into ff_fft_*() patch by (Gildas Bazin <gbazin at altern dot org>)
Originally committed as revision 2882 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-03-13 21:43:24 +00:00
Mike Melanson 01456e8e86 use optimized VP3 functions where appropriate
Originally committed as revision 2857 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-03-08 02:18:19 +00:00
Mike Melanson 44cb64ee89 seperated out the C-based VP3 DSP functions into a different file; also
ported the MMX-optimized versions of those functions

Originally committed as revision 2855 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-03-08 02:13:45 +00:00
Dmitry Baryshkov 5c0513bda7 attribute used patch by (mitya at school dot ioffe dot ru (Dmitry Baryshkov))
Originally committed as revision 2830 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-02-29 22:10:18 +00:00