Commit Graph

218 Commits

Author SHA1 Message Date
Loren Merritt
e8b562087d tweak h264_biweight
Originally committed as revision 4835 to svn://svn.ffmpeg.org/ffmpeg/trunk
2006-01-09 03:38:37 +00:00
Loren Merritt
cec9395977 fix some potential arithmetic overflows in pred_direct_motion() and
ff_h264_weight_WxH_mmx2().

Originally committed as revision 4795 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-12-30 23:47:41 +00:00
Diego Biurrun
bb270c0896 COSMETICS: tabs --> spaces, some prettyprinting
Originally committed as revision 4764 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-12-22 01:10:11 +00:00
Diego Biurrun
115329f160 COSMETICS: Remove all trailing whitespace.
Originally committed as revision 4749 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-12-17 18:14:38 +00:00
Guillaume Poirier
f6d1338cb5 Add the rest of missing Reg_* macros to support both AMD-64 style regs and IA32 regs.
Not used yet, but should be once the SIMD code to accelerate Snow decoding is merged.

Originally committed as revision 4731 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-12-10 22:53:44 +00:00
Loren Merritt
ea15df8048 use sse16_sse2() in nsse
Originally committed as revision 4688 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-11-12 05:23:25 +00:00
Loren Merritt
a6624e21cb faster h264_chroma_mc8_mmx, added h264_chroma_mc4_mmx.
2-4% overall speedup.

Originally committed as revision 4666 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-10-27 06:45:29 +00:00
Loren Merritt
b926572aa9 h264 mmx weighted prediction. up to 3% overall speedup.
Originally committed as revision 4630 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-10-09 23:38:52 +00:00
Loren Merritt
5693c08356 sse2 16x16 sum squared diff (306=>268 cycles on a K8)
faster 8x8 mmx ssd (77=>70 cycles)

Originally committed as revision 4623 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-09-30 02:31:47 +00:00
Michael Niedermayer
12e9668119 replace a few mov + psrlq with pshufw, there are more cases which could benefit from this but they would require us to duplicate some functions ...
the trick is from various places (my own code in libpostproc, a patch on the x264 list, ...)

Originally committed as revision 4608 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-09-21 21:17:09 +00:00
Reimar Döffinger
cd7af76d9e Fix compile without CONFIG_GPL, misplaced #endif caused a missing }.
Originally committed as revision 4575 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-09-10 19:30:40 +00:00
Michael Niedermayer
9f211bc6d7 remove unused table entries
change non portable table access

Originally committed as revision 4574 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-09-10 19:03:37 +00:00
Michael Niedermayer
84740d5980 xvids mmx&mmx2 idcts
needed to decode xvid without some minor artefacts
under #ifdef CONFIG_GPL of course

Originally committed as revision 4572 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-09-10 17:01:30 +00:00
Måns Rullgård
79396ac685 Kill some compiler warnings. Compiled code verified identical after changes.
Originally committed as revision 4567 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-09-06 21:25:35 +00:00
Michael Niedermayer
d3a9f79871 simplify (d&a) and (d&~a) calculation, hint by skal
Originally committed as revision 4552 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-09-03 09:17:30 +00:00
Michael Niedermayer
b5b65df7a9 add consts (this was in my local tree, dunno where it came from, probably forgoten from some const patch)
Originally committed as revision 4551 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-09-02 21:13:19 +00:00
Måns Rullgård
bf4e3bd2d0 kill a bunch of compiler warnings
Originally committed as revision 4522 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-08-14 15:42:40 +00:00
Alexander Strasser
c11c2bc20b libavutil: Utility code from libavcodec moved to a separate library.
Originally committed as revision 4489 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-08-01 20:07:05 +00:00
Loren Merritt
d2bb7db135 sort H.264 mmx dsp functions into their own file
Originally committed as revision 4338 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-02 20:45:35 +00:00
Michael Niedermayer
c26ae41db2 adding a few const
Originally committed as revision 4337 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-01 21:19:00 +00:00
Michael Niedermayer
435b0720a8 100l for myself (breaking amd64)
Originally committed as revision 4336 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-01 18:04:01 +00:00
Michael Niedermayer
6510f43cf3 merge a few asm blocks so gcc cant unoptimize it (658->631 dezicycles on duron)
Originally committed as revision 4334 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-01 11:56:58 +00:00
Michael Niedermayer
987ae784e6 get rid of 2 movq (680 -> 658 dezicycles on duron)
Originally committed as revision 4333 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-01 11:36:32 +00:00
Michael Niedermayer
e4b36d4434 avoid one transpose (730->680 dezicycles on duron)
Originally committed as revision 4332 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-01 08:43:40 +00:00
Loren Merritt
85bbfcd4ee 10l (symbol mangling)
Originally committed as revision 4331 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-01 04:51:46 +00:00
Michael Niedermayer
1f3dbc09b1 add rounding bias before the horizontal idct (765->730 dezicyles on duron)
Originally committed as revision 4330 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-01 01:18:41 +00:00
Loren Merritt
1d62fc8560 MMX for H.264 iDCT (adapted from x264)
Originally committed as revision 4329 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-05-31 22:48:33 +00:00
Zoltán Hidvégi
3072f0cb2e MMX code for (put|avg)_h264_chroma_mc8
Originally committed as revision 4305 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-05-25 08:05:41 +00:00
Loren Merritt
5cf08f2393 H.264 deblocking optimizations (mmx for chroma_bS4 case, convert existing cases to 8-bit math)
Originally committed as revision 4271 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-05-18 09:17:22 +00:00
Michael Niedermayer
5773a74669 porting the mmx&sse2 (sse2 untested) vp3 idcts to the lavc idct API
Originally committed as revision 4260 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-05-17 18:28:40 +00:00
Michael Niedermayer
b178f758fa disabling vp3 mmx&mmx2 idcts, they must be ported over to the lavc idct API, ill port the vp3 c idct
Originally committed as revision 4255 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-05-17 09:11:48 +00:00
Michael Niedermayer
c998bdd9a0 fix PIC
Originally committed as revision 4204 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-05-08 18:48:19 +00:00
Loren Merritt
42251a2a4f MMX for H.264 deblocking filter
Originally committed as revision 4158 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-04-25 01:01:41 +00:00
Michael Niedermayer
4e492bf107 read 32bit instead of 64bit to avoid overreading and missalignments
Originally committed as revision 4133 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-04-17 13:30:45 +00:00
Martin Drab
4d9ae03b09 optimization and gcc 4.0 bug workaround patch by (Martin Drab >drab kepler.fjfi.cvut cz<)
Originally committed as revision 3945 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-02-07 17:09:48 +00:00
Michael Niedermayer
e42a152bbc x86 cpu capabilities detection rewrite / cleanup
Originally committed as revision 3750 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-12-13 16:11:38 +00:00
Matthieu Castet
e7ddb0cf4c detect sse on athlon-xp patch by (matthieu castet <castet >.< matthieu >at< free >.< fr>)
Originally committed as revision 3729 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-12-05 23:26:43 +00:00
Michael Niedermayer
1c3990dbba H.261 encoder by (Maarten Daniels <maarten dot daniels at luc dot ac dot be>)
Originally committed as revision 3643 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-10-28 10:12:57 +00:00
Aurelien Jacobs
8b4d077fc9 gcc 3.4.3 preversions do not appreciate invalid instruction and operand combinations anymore patch by (Aurelien Jacobs <aurel at gnuage dot org>)
Originally committed as revision 3604 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-10-17 20:27:11 +00:00
Aurelien Jacobs
053dea12f2 adapting existing mmx/mmx2/sse/3dnow optimizations so they work on x86_64 patch by (Aurelien Jacobs <aurel at gnuage dot org>)
Originally committed as revision 3578 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-10-11 02:19:29 +00:00
Michael Niedermayer
178fcca848 1/2 resolution decoding
Originally committed as revision 3507 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-25 23:18:58 +00:00
Michael Niedermayer
e69538fa60 h264_qpel8_hv_lowpass_mmx2/3dnow
Originally committed as revision 3446 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-08 18:49:12 +00:00
Michael Niedermayer
e772bb8a82 h264_qpel4_hv_lowpass_mmx2/3dnow
alignment fixes

Originally committed as revision 3445 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-08 18:25:45 +00:00
Michael Niedermayer
56d8bd5659 optimization
Originally committed as revision 3440 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-08 00:52:49 +00:00
Michael Niedermayer
a6e39f45a2 optimization
Originally committed as revision 3439 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-07 22:56:50 +00:00
Michael Niedermayer
ed8ffdf46c optimization
Originally committed as revision 3438 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-07 17:22:37 +00:00
Michael Niedermayer
437525c473 h264 luma motion compensation in mmx2/3dnow
Originally committed as revision 3437 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-07 01:48:45 +00:00
Michael Niedermayer
d6af6b0350 10000l fix and use more mmx2/3dnow code for mpeg4 qpel which has been written and commited long time ago but appearently never used, qpel motion compensation is 5% faster
now

Originally committed as revision 3435 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-06 03:17:31 +00:00
Michael Niedermayer
6206f8c920 10l and better MMX/SSE detection for VIA1000
Originally committed as revision 3323 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-07-16 01:54:07 +00:00
Michael Niedermayer
61b3b639e7 dc_precission=11 fix
Originally committed as revision 3291 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-07-06 12:27:36 +00:00