RepoMirrors/mpv

mirror of https://github.com/mpv-player/mpv synced 2025-03-25 04:38:01 +00:00

Author	SHA1	Message	Date
Niklas Haas	070edd7300	vo_opengl: add hooks and rework pass_read_video The hook mechanism allows arbitrary processing stages to get dispatched whenever certain named textures have been "finalized" by the code. This is mostly meant to serve as a change that opens up the internal processing in pass_read_video to user scripts, but as a side benefit all of the code dealing with offsets and plane alignment and other such confusing things has been rewritten. This hook mechanism is powerful enough to cover the needs of both debanding and prescaling (and more), so as a result they can be removed from pass_read_video entirely and implemented through hooks. Some avenues for optimization: - The prescale hook is currently somewhat distributed code-wise. It might be cleaner to split it into superxbr and NNEDI3 hooks which can each be self-contained. - It might be possible to move a large part of the hook code out to an external file (including the hook definitions for debanding and prescaling), which would be very much desired. - Currently, some stages (chroma merging, integer conversion) will always run even if unnecessary. I'm planning another series of refactors (deferred img_tex) to allow dropping unnecessary shader stages like these, but that's probably some ways away. In the meantime it would be doable to re-add some of the logic to skip these stages if we know we don't need them. - More hook locations could be added (?)	2016-05-15 20:42:02 +02:00
Niklas Haas	3d4889e91e	vo_opengl: minor change to scaler_resizes_only Instead of rounding down, we round to the nearest float. This reduces the maximum possible error introduced by this rounding operation. Also clarify the comment.	2016-05-15 20:42:02 +02:00
wm4	3858d37b61	vo_opengl: partially fix 0bgr format support Fixes broken colors with --vf=format=0bgr (but only if deband is disabled). 0bgr means the first byte is padding, while the following three bytes are bgr. From the vo_opengl perspective, it has 4 physical components with 3 logical components. copy_img_tex() simply copied 3 components from the physical representation, which means the last component (r) was sliced off. Fix this by not using p->color_swizzle for packed formats, and instead let packed formats set the per-plane swizzle in texplane.swizzle. The latter applies the swizzle as part of operation in copy_img_tex(), which essentially moves physical to logical representations. Unfortunately, debanding (and thus with opengl-hq defaults) is still broken.	2016-05-13 22:35:42 +02:00
wm4	09e07e92c5	vo_opengl: drop duplicate LUMINANCE_ALPHA handling This was supposed to handle the absence of GL_ARB_texture_rg. But it's already handled elsewhere. (init_format() sets texplane.swizzle accordingly.)	2016-05-13 22:07:25 +02:00
wm4	7d2c6d60da	vo_opengl: minor simplification Make the find_plane_format function take a bit count. This also makes the function's comment true for the first time the function and its comment exist. (It was commented as taking bits, but always took bytes.)	2016-05-13 21:46:08 +02:00
wm4	a228bf54c8	vo_opengl: slightly better FBO format check Now that we know in advance whether an implementation should support a specific format, we have more flexibility when determining which format to use. In particular, we can drop the roundabout ES logic. I'm not sure if actually trying to create the FBO for probing still has any value. But it might, so leave it for now.	2016-05-12 21:22:28 +02:00
wm4	0cd217b039	vo_opengl: disable scalers on ES2 Even if everything else is available, the need for first class arrays breaks it. In theory we could fix this since we don't strictly need them, but I guess it's not worth bothering. Also give the misnamed have_mix variable a slightly better name.	2016-05-12 21:22:28 +02:00
wm4	84ccebd9b9	vo_opengl: reorganize texture format handling This merges all knowledge about texture format into a central table. Most of the work done here is actually identifying which formats exactly are supported by OpenGL(ES) under which circumstances, and keeping this information in the format table in a somewhat declarative way. (Although only to the extend needed by mpv.) In particular, ES and float formats are a horrible mess. Again this is a big refactor that might cause regression on "obscure" configurations.	2016-05-12 21:22:28 +02:00
wm4	e68b510a94	vo_opengl: correctly disable interpolation if tscale can't be used It'll fail with an assertion in the interpolation code otherwise.	2016-05-12 21:22:28 +02:00
wm4	01d04b100f	vo_opengl: don't use dumb-mode with 10 bit integer texture hack Recent regression. Caused it to use dumb-mode with integer textures, which on ANGLE leads to nearest scaling.	2016-05-11 17:41:00 +02:00
wm4	1f6e71c7fa	vo_opengl: fix passing along swizzle from hwdec interop In theory this was needed for the previous commit (but wasn't in practice, since for hwdec the LUMINANCE_ALPHA mangling is not applied anymore, and ANGLE uses RG textures in absence of GL_ARB_texture_rg for whatever crazy reasons). In practice this caused funky colors on OSX with the uyvy422 format, which is also fixed in this commit.	2016-05-10 21:12:57 +02:00
wm4	12ae19c449	vo_opengl: cosmetics: rename variables "p" is used for the private context everywhere in the source file, but renaming it also requires renaming some local variables.	2016-05-10 18:49:49 +02:00
wm4	b0b01aa250	vo_opengl: refactor how hwdec interop exports textures Rename gl_hwdec_driver.map_image to map_frame, and let it fill out a struct gl_hwdec_frame describing the exact texture layout. This gives more flexibility to what the hwdec interop can export. In particular, it can export strange component orders/permutations and textures with padded size. (The latter originating from cropped video.) The way gl_hwdec_frame works is in the spirit of the rest of the vo_opengl video processing code, which tends to put as much information in immediate state (as part of the dataflow), instead of declaring it globally. To some degree this duplicates the texplane and img_tex structs, but until we somehow unify those, it's better to give the hwdec state its own struct. The fact that changing the hwdec struct would require changes and testing on at least 4 platform/GPU combinations makes duplicating it almost a requirement to avoid pain later. Make gl_hwdec_driver.reinit set the new image format and remove the gl_hwdec.converted_imgfmt field. Likewise, gl_hwdec.gl_texture_target is replaced with gl_hwdec_plane.gl_target. Split out a init_image_desc function from init_format. The latter is not called in the hwdec case at all anymore. Setting up most of struct texplane is also completely separate in the hwdec and normal cases. video.c does not check whether the hwdec "mapped" image format is supported. This should not really happen anyway, and if it does, the hwdec interop backend must fail at creation time, so this is not an issue.	2016-05-10 18:42:42 +02:00
wm4	9d16837c99	vo_opengl: support GL_EXT_texture_norm16 on GLES This gives us 16 bit fixed-point integer texture formats, including ability to sample from them with linear filtering, and using them as FBO attachments. The integer texture format path is still there for the sake of ANGLE, which does not support GL_EXT_texture_norm16 yet. The change to pass_dither() is needed, because the code path using GL_R16 for the dither texture relies on glTexImage2D being able to convert from GL_FLOAT to GL_R16. GLES does not allow this. This could be trivially fixed by doing the conversion ourselves, but I'm too lazy to do this now.	2016-04-27 19:19:56 +02:00
wm4	757c8baf8c	vo_opengl: always use sized internal formats This shouldn't make much of a difference, but should make the following commit simpler.	2016-04-27 19:02:04 +02:00
wm4	d3a26272cd	vo_opengl: print error if opengl hwdec interop fails	2016-04-27 13:32:49 +02:00
wm4	244eff9201	vo_opengl: always reset some GL state when leaving renderer The active texture and some pixelstore parameters are now always reset to defaults when entering and leaving the renderer. Could be important for libmpv.	2016-04-22 12:08:21 +02:00
wm4	87cb2339a6	vo_opengl: improve rotation handling (again) Apply basic transformations like rotation by 90° and mirroring when sampling from the source textures. The original idea was making this part of img_tex.transform, but this didn't work: lots of code plays tricks on the transform, so manipulating it is not necessarily transparent, especially when width/height are switched. So add a new pre_transform field, which is strictly applied before the normal transform. This fixes most glitches involved with rotating the image. Cropping and rotation are now weirdly separated, even though they could be done in the same step. I think this is not much of a problem, and has the advantage that changing panscan does not trigger FBO reallocations (I think...).	2016-04-08 22:21:38 +02:00
wm4	7a5312e9a6	vo_opengl: minor simplification It's the same functionally.	2016-04-05 20:58:22 +02:00
wm4	afd685490d	vo_opengl: fix nnedi + rectangle textures Shader compilation error due to incompatible samplers.	2016-04-05 20:57:02 +02:00
Niklas Haas	2dcf18c0c0	vo_opengl: generate 3DLUT against source and use full BT.1886 This commit refactors the 3DLUT loading mechanism to build the 3DLUT against the original source characteristics of the file. This allows us, among other things, to use a real BT.1886 profile for the source. This also allows us to actually use perceptual mappings. Finally, this reduces errors on standard gamut displays (where the previous 3DLUT target of BT.2020 was unreasonably wide). This also improves the overall accuracy of the 3DLUT due to eliminating rounding errors where possible, and allows for more accurate use of LUT-based ICC profiles. The current code is somewhat more ugly than necessary, because the idea was to implement this commit in a working state first, and then maybe refactor the profile loading mechanism in a later commit. Fixes #2815.	2016-04-01 10:27:27 +02:00
Niklas Haas	ec6e8a31e0	vo_opengl: draw transparency checkerboard after upscaling This also draws it after color management etc. In a nutshell, this change makes the transparency checkerboard independent of upscaling, panning, cropping etc. It will always be the same apparent size and position (relative to the window). It will also be independent of the video colorspace and such things. (Note: This might cause white imbalance issues if playing a file with a white point that does not match the display, in absolute colorimetric mode. But that's uncommon, especially in conjunction with transparent image files, so it's not a primary concern here)	2016-03-29 22:29:19 +02:00
wm4	dae23fff09	vo_opengl: always premultiply alpha Until now, we've let the windowing backend decide. But since they usually require premultiplied alpha, and premultiplied alpha is easier to handle, hardcode it.	2016-03-29 21:56:38 +02:00
wm4	b95a10c2dd	vo_opengl: fix rotation direction The recent changes fixed rotation handling, but reversed the rotation direction. The direction is expected to be counter-clockwise, because demuxers export video rotation metadata as such.	2016-03-29 11:47:16 +02:00
wm4	5827d9cc09	vo_opengl: fix rotation This has been completely broken since commit `93546f0c`. But even before, rotation handling did not make too much sense. In particular, it rotated the contents of the cropped image, instead of adjusting the crop rectangle as well. The result was that things like panscan or zooming did not behave as expected with rotation applied. The same is true for vertical flipping. Flipping is triggered by negative image stride. OpenGL does not support flipping the image on upload, so it's done as part of the rendering. It can be triggered with --vf=flip, but other filters and even decoders could setup negative stride to flip the image. Fix these issues by applying transforms to texture coordinates properly, and by making rotation and flipping part of these transforms. This still doesn't work properly for separated scaling. The issue is that we'd have to adjust how the passes are done. For now, pick a very stupid solution by rotating the image to a FBO, and then scaling from that. This has the avantage that the scale logic doesn't have to be complicated for such a rare case. It could be improved later. Prescaling is apparently still broken. I don't know if chroma positioning works properly either. None of this should affect the case with no rotation.	2016-03-28 17:02:27 +02:00
wm4	fb70819048	vo_opengl: don't upload potentially uninitialized memory to GL buffer If the texture count is lower than 4, entries in va.textcoord[] will remain uninitialized. While this is unlikely to be a problem (since these values are unused on the shader side too), it's not nice and might explain some things which have shown up in valgrind. Fix by always initializing the whole thing.	2016-03-28 16:13:56 +02:00
wm4	c51fe7944d	vo_openg: fix debanding + rectangle-textures	2016-03-27 16:46:01 +02:00
wm4	c7f802ee45	vo_opengl_cb: fix NULL deref Broken in commit `d6c99c85`. vo_opengl_cb.c adds the corner case that p->osd can be NULL. This make opengl-cb always crash.	2016-03-23 14:49:39 +01:00
wm4	fd3ae6c561	vo_opengl: fix blend-subtitles=video in some cases Shader miscompilation and bad output. Regression probably since commit `93546f0c` (or one of the following ones). Fixes #2982.	2016-03-22 13:34:52 +01:00
wm4	d6c99c8513	vo_opengl, osd: allow osc.lua to react faster on resizes Glitches when resizing are still possible, but are reduced. Other VOs could support this too, but don't need to do so. (Totally avoiding glitches would be much more effort, and probably not worth the trouble. How about you just watch the video the player is playing, instead of spending your time resizing the window.)	2016-03-21 22:23:41 +01:00
wm4	45db7d52a9	vo_opengl: fix operation without GL_ARB_texture_rg This also gets rid of the kind of hard to read texture swizzle setup and turns it into something dumber. Assumes that we don't create any FBOs with 2 channel formats. (Only the video source textures are handled by this commit.)	2016-03-17 12:50:26 +01:00
wm4	71642f5d23	vo_opengl: fix sharpen filter Regression since commit `93546f0c`. Fixes #2956.	2016-03-16 19:09:52 +01:00
Niklas Haas	9f91bc4b75	vo_opengl: refactor superxbr algorithm This is a fresh implementation from scratch that carries with it significantly less baggage and verbosity from the previous (ported) version. The actual values for the masks and such were copied from the current code. Behavior and performance should be unaffected. An important difference between the old code and the new code is that the new code always explicitly samples from the first component, rather than being able to process multiple planes at once. Since prescale-luma only affects luma, I deemed this unnecessary. May change in the future, if prescale-chroma ever gets implemented. But prescaling multiple planes would be slow to do this way. (Better would be to generalize it to differently-sized vectors)	2016-03-07 22:31:15 +01:00
Niklas Haas	a4dfc28fe1	vo_opengl: refactor plane-skipping optimizations Instead of hard-coding the logic and planes to skip, factor this out to a reusible function, and instead add the number of relevant coordinates to the texture state.	2016-03-05 13:08:38 +01:00
Niklas Haas	b81036524a	vo_opengl: rename prescale to prescale-luma Since prescale now literally only affects the luma plane (and the filters are all designed for luma-only operation either way), the option has been renamed and the documentation updated to clarify this.	2016-03-05 13:08:38 +01:00
Niklas Haas	8ac6f6acf0	vo_opengl: add macros for scaler units There was no real point in hard-coding these all over the place, especially since the order was sort of arbitrary and confusing.	2016-03-05 13:08:38 +01:00
Niklas Haas	93546f0c2f	vo_opengl: refactor pass_read_video and texture binding This is a pretty major rewrite of the internal texture binding mechanic, which makes it more flexible. In general, the difference between the old and current approaches is that now, all texture description is held in a struct img_tex and only explicitly bound with pass_bind. (Once bound, a texture unit is assumed to be set in stone and no longer tied to the img_tex) This approach makes the code inside pass_read_video significantly more flexible and cuts down on the number of weird special cases and spaghetti logic. It also has some improvements, e.g. cutting down greatly on the number of unnecessary conversion passes inside pass_read_video (which was previously mostly done to cope with the fact that the alternative would have resulted in a combinatorial explosion of code complexity). Some other notable changes (and potential improvements): - texture expansion is now always handled in pass_read_video, and the colormatrix never does this anymore. (Which means the code could probably be removed from the colormatrix generation logic, modulo some other VOs) - struct fbo_tex now stores both its "physical" and "logical" (configured) size, which cuts down on the amount of width/height baggage on some function calls - vo_opengl can now technically support textures with different bit depths (e.g. 10 bit luma, 8 bit chroma) - but the APIs it queries inside img_format.c doesn't export this (nor does ffmpeg support it, really) so the status quo of using the same tex_mul for all planes is kept. - dumb_mode is now only needed because of the indirect_fbo being in the main rendering pipeline. If we reintroduce p->use_indirect and thread a transform through the entire program this could be skipped where unnecessary, allowing for the removal of dumb_mode. But I'm not sure how to do this in a clean way. (Which is part of why it got introduced to begin with) - It would be trivial to resurrect source-shader now (it would just be one extra 'if' inside pass_read_video).	2016-03-05 13:08:38 +01:00
igv	b638a413c3	vo_opengl: remove redundant code	2016-02-28 17:46:16 +01:00
igv	8bafd68fff	vo_opengl: set uniform variable "pixel_size" for internal shaders	2016-02-26 23:21:03 +01:00
Niklas Haas	2f562825e0	vo_opengl: declare vec4 color inside fragment shader stub Why was this done so stupidly, with so many complicated special cases, before? Declare it once so the shader bits don't have to figure out where and when to do so themselves.	2016-02-23 20:58:15 +01:00
igv	f0794d0544	vo_opengl: set uniform variable "pixel_size" pixel_size is often used variable, also reciprocal is a costly operation for AMD and older nVidia (prior to Kepler) GPUs.	2016-02-22 22:33:04 +01:00
igv	935c8402bc	vo_opengl: set the correct size of the input image	2016-02-22 22:32:49 +01:00
wm4	c01aaabb3e	vo_opengl: use correct gl_target variable p->gl_target and plane->gl_target are always the same value here, but semantically plane->gl_target is the correct one.	2016-02-18 10:46:03 +01:00
wm4	d6af58c699	vo_opengl: pass the correct target to deband functions Apple crap (namely hardware decoding interop) forces us to use rectangle textures for input. But after that we continue with normal textures. This was not considered for debanding, and the sampler type used for it can be different depending on the exact render chain. Simply use the target type of the input texture.	2016-02-18 10:41:13 +01:00
wm4	fd80fcd3f3	vo_opengl: unconfuse Coverity It thinks that integer_conv_fbo[index] is implied to be accessed with up to index=5. Although that is theoretical only, it has a point that this makes no sense. Use the same constant for the array allocation, to make it more uniform and robust. Fixes CID 1350060.	2016-02-12 15:56:58 +01:00
wm4	fb3b8e1e25	vo_opengl: do chroma merging in integer conversion stage This is a huge win when playing yuv420p10 on ANGLE - the 2 conversion stages for planes 1 and 2 and the chroma merging stage are all merged into one.	2016-01-27 21:08:30 +01:00
wm4	34bead4859	vo_opengl: replace tscale-interpolates-only with interpolation-threshold The previous approach was too naive, and can e.g. ruin playback if scheduling switches e.g. between 1 and 2 vsync per frame.	2016-01-27 21:07:17 +01:00
wm4	7b6e3772ab	vo_opengl: support 10 bit support with ANGLE GLES does not support high bit depth fixed point textures for unknown reasons, so direct 10 bit input is not possible. But we can still use integer textures, which are supported by GLES 3.0. These store integer data just like the standard fixed point textures, except they are not normalized on sampling. They also don't support bilinear filtering, and require a special sampler ("usampler2D"). While these texture formats enable us to shuffle the data to the GPU, they're rather impractical with the requirements mentioned above and our current architecture. One problem is that most code assumes it can always use bilinear scaling (even if bilinear is never used when using appropriate scale/cscale options). Another is that we don't have any concept of running a function on a texture in an uniform way. So for now, run a simple conversion step through a FBO. The FBO will use the rgba16f format normally, which gives enough bits for 10 bit, and will at least gracefully degrade with higher depth input. This is bound to be much slower than a more "direct" method, but at least it works and is simple to implement. The odd change of function call order in init_video() is to properly disable "dumb mode" (no FBO use) if these texture formats are in use.	2016-01-26 21:35:23 +01:00
wm4	beb7094301	vo_opengl: actually reset use_normalized_range field This was never reset - absolutely can't be right. If the renderer somehow switches back to another codepath, it certainly has to be reset. Maybe this was hard to hit, as the normalization is going to be idempotent in simpler cases (like rendering RGBA input). Also get rid of the "merged" variable.	2016-01-26 21:35:23 +01:00
wm4	fc3ca14ef7	vo_opengl: default to rgba16f FBOs on ANGLE Although it has only 1 bit more precission than rgba10_a2, it was reported to improve the visual quality.	2016-01-26 21:35:16 +01:00
wm4	521110054d	vo_opengl: add tscale-interpolates-only sub-option	2016-01-25 21:46:40 +01:00
wm4	bd1fb6f9b1	vo_opengl: default scaler-resizes-only sub-option to yes Often requested. The main argument, that prominent scalers like sharpen change the image even if no scaling happens, disappeared anyway. ("sharpen", unsharp masking, is neither prominent nor a scaler anymore. This is an artifact from MPlayer, which fuses unsharp masking with bilinear scaling in order to make it single-pass, or so.)	2016-01-25 21:46:40 +01:00
wm4	7f300b4204	vo_opengl: rename custom shader entrypoint from sample to sample_pixel "sample" is a reserved identifier at least in GLES ES. Suggestions for a better name than "sample_pixel" are still welcome. Fixes #2733.	2016-01-25 20:24:41 +01:00
wm4	3a015b9ec7	video: remove some useless old RGB formats Some VOs had support for these - remove them. Typically, these formats will have only some use in cases where using RGB software conversion with libswscale is faster than letting the VO/GPU do it (i.e. almost never). For the sake of testing this case, keep IMGFMT_RGB565. This is the least messy format, because it has no padding/alpha bits with unknown semantics. Note that decoding to these formats still works. We'll let libswscale repack the data to whatever the VO in use can take.	2016-01-25 10:43:35 +01:00
wm4	e4ec0f42e4	Change GPL/LGPL dual-licensed files to LGPL Do this to make the license situation less confusing. This change should be of no consequence, since LGPL is compatible with GPL anyway, and making it LGPL-only does not restrict the use with GPL code. Additionally, the wording implies that this is allowed, and that we can just remove the GPL part.	2016-01-19 18:36:34 +01:00
wm4	27bc881cd8	vo_opengl: generic semi-planar support Should take care of the planned FFmpeg AV_PIX_FMT_P010 addition. (This will eventually be needed when doing HEVC Main 10 decoding with DXVA2 copyback.)	2016-01-07 16:31:52 +01:00
Bin Jin	2f4bd58f4a	vo_opengl: reset nnedi3 weights properly Fixes #2661	2016-01-03 23:33:54 +01:00
wm4	082c23515f	vo_opengl: fix operation on GLSL versions earlier than 1.30 GLSL below version 1.30 does not support mix() with a boolean interpolation value. Use ?: instead. Untested, but probably works.	2015-12-24 14:44:46 +01:00
wm4	eac0665b8d	vo_opengl: blend transparent video against tiles by default Add a "blend-tiles" choice to the "alpha" sub-option. This is pretty simplistic and uses the GL raster position to derive the tiles. A weird consequence is that using --vo=opengl and --vo=opengl-hq gives different scaling behavior (screenspace pixel size vs. source video pixel size 16x16 tiles), but it seems we don't have easy access to the original texture coordinates. Using the rasterpos is probably simpler. Make this option the default.	2015-12-22 23:18:46 +01:00
wm4	cd24fdcd5a	vo_opengl: disable pbo by defaults for opengl-hq Too many problems.	2015-12-19 16:26:36 +01:00
wm4	47f2f554a3	vo_opengl: handle alpha with odd bit widths too Since alpha isn't pulled through the colormatrix (maybe it should?), we reject alpha formats with odd sizes, such as yuva444p10. But the awful tex_mul path in vo_opengl does this anyway (at some points even explicitly), which means there will be a subtle difference in handling of 16 bit yuv alpha formats. Make it consistent and always apply the range adjustment to the alpha component. This also means odd sizes like 10 bit are supported now. This assumes alpha uses the same "shifted" range as the yuv color channels for depths larger than 8 bit. I'm not sure whether this is actually the case.	2015-12-19 16:11:34 +01:00
wm4	a0519f1d18	vo_opengl: cocoa: output premultiplied alpha Which is apparently what is expected here. (I'm pretty sure X11 compositors want stright alpha, so 2 code paths are needed.)	2015-12-19 14:14:12 +01:00
wm4	3394d37b4e	vo_opengl: refactor how framebuffer depth is passed from backends Store the determined framebuffer depth in struct GL instead of MPGLContext. This means gl_video_set_output_depth() can be removed, and also justifies adding new fields describing framebuffer/backend properties to struct GL instead of having to add more functions just to shovel the information around. Keep in mind that mpgl_load_functions() will wipe struct GL, so the new fields must be set before calling it.	2015-12-19 14:14:12 +01:00
wm4	f24ba544cd	vo_opengl: enable brightness/contrast controls for RGB Why not. Also, instead of disabling hue/saturation for RGB, just don't apply them. (They don't make sense for conversion matrixes other than YUV, but I can't be bothered to keep the fine-grained disabling of UI controls either.)	2015-12-12 14:47:30 +01:00
wm4	47e6ef0bdf	vo_opengl: remove one more XYZ special-case The XYZ colorspace on XYZ pixfmt is enforced in some sanitation routine.	2015-12-09 17:10:38 +01:00
Bin Jin	6d36c432ab	vo_opengl: fix precision loss of fruit dithering matrix With default setting, the matrix for fruit dithering requires 12 bits precision (values from 0/4096 to 4095/4096). But 16-bit float provides only 10 bits. In addition, when `dither-size-fruit=8` is set, 16 bits are required from the texture format. Fix this by attempting to use 16 bit integer texture first. This is still not precise, but should be better than using a half float.	2015-12-09 00:36:48 +01:00
wm4	45ae0716be	csputils: rename "yuv2rgb" functions They're not necessarily restricted to YUV aka YCbCr. vo_direct3d.c and demux_disc.c (DVD specific code) changes untested.	2015-12-09 00:23:36 +01:00
wm4	c5c7b239b6	csputils, vo_opengl: remove XYZ special case in color matrix retrieval This just seems unnecessary. Refactor it a bit. There should be no functional changes.	2015-12-09 00:16:51 +01:00
wm4	c138505813	vo_opengl: enable colormatrix even for RGB input Enables brightness/contrast controls, and handles gbrp10 correctly.	2015-12-07 23:48:59 +01:00
wm4	663415b914	vo_opengl: fix issues with some obscure pixel formats The computation of the tex_mul variable was broken in multiple ways. This variable is used e.g. by debanding for moving expansion of 10 bit fixed-point input to normalized range to another stage of processing. One obvious bug was that the rgb555 pixel format was broken. This format has component_bits=5, but obviously it's already sampled in normalized range, and does not need expansion. The tex_mul-free code path avoids this by not using the colormatrix. (The code was originally designed to work around dealing with the generally complicated pixel formats by only using the colormatrix in the YUV case.) Another possible bug was with 10 bit input. It expanded the input by bringing the [0,2^10) range to [0,1], and then treating the expanded input as 16 bit input. I didn't bother to check what this actually computed, but it's somewhat likely it was wrong anyway. Now it uses mp_get_csp_mul(), and disables expansion when computing the YUV matrix.	2015-12-07 23:48:59 +01:00
Bin Jin	c569d4f6ed	vo_opengl: decrease default lookup texture size to 64 It turns out that with accurate lookup we can decrease the default size of texture now. Do it to compensate the performance loss introduced by the LUT_POS macro.	2015-12-07 23:48:40 +01:00
Bin Jin	e6058d3dc3	vo_opengl: make LOOKUP_TEXTURE_SIZE configurable	2015-12-07 23:48:18 +01:00
Bin Jin	c1a96de41c	vo_opengl: Fix minor LUT sampling error Define a macro to correct the coordinate for lookup texture. Cache the corrected coordinate for 1D filter and use mix() to minimize the performance impact.	2015-12-07 23:48:15 +01:00
wm4	17507b5935	vo_opengl: require --enable-gpl3 for nnedi There are claims that nnedi3.c doesn't constitute its own new implementation, but is derived from existing HLSL or OpenCL shaders distributed under the LGPLv3 license. Until these are resolved, do the "correct" thing and require --enable-gpl3 to build nnedi.	2015-12-03 09:32:40 +01:00
Bin Jin	42a0f4d87b	vo_opengl: enable NNEDI3 prescaler on OpenGL ES 3.0 It turns out that both UBO and intBitsToFloat() are supported in OpenGL ES 3.0[1][2], enable them so that NNEDI3 prescaler can be used in a wider range of backends. Also fixes some implicit int-to-float conversions so that the shader actually compiles on GLES. Tested on Linux desktop (nvidia 358.16) with "es" sub-option. [1]: https://www.khronos.org/opengles/sdk/docs/man3/html/glGetUniformBlockIndex.xhtml [2]: https://www.khronos.org/opengles/sdk/docs/manglsl/docbook4/xhtml/intBitsToFloat.xml	2015-12-02 12:32:02 +01:00
wm4	6ff1cd5502	vo_opengl: make tscale=mitchell:tscale-clamp the default Looks better than "oversample". tscale-clamp suggested by haasn.	2015-11-29 17:55:01 +01:00
wm4	9fc74d5acd	vo_opengl: warn if interpolation is enabled, but not display-sync Try to avoid user confusion. Reading the global options in this place is a pretty disgusting hack, but it's still the most robust way.	2015-11-28 20:10:01 +01:00
wm4	318e9801f2	vo_opengl: fix interpolation with display-sync At least I hope so. Deriving the duration from the pts was not really correct. It doesn't include speed adjustments, and becomes completely wrong of the user e.g. changes the playback speed by a huge amount. Pass through the accurate duration value by adding a new vo_frame field. The value for vsync_offset was not correct either. We don't need the error for the next frame, but the error for the current one. This wasn't noticed because it makes no difference in symmetric cases, like 24 fps on 60 Hz. I'm still not entirely confident in the correctness of this, but it sure is an improvement. Also, remove the MP_STATS() calls - they're not really useful to debug anything anymore.	2015-11-28 15:45:49 +01:00
wm4	7023c383b2	vo: change vo_frame field units This was just converting back and forth between int64_t/microseconds and double/seconds. Remove this stupidity. The pts/duration fields are still in microseconds, but they have no meaning in the display-sync case (also drop printing the pts field from opengl/video.c - it's always 0).	2015-11-27 22:04:44 +01:00
wm4	1fe64c61be	vo_opengl: disable interpolation without display-sync Without display-sync mode, our guesses wrt. vsync phase etc. are much worse, and I see no reason to keep the complicated "vsync_timed" code.	2015-11-25 22:10:55 +01:00
wm4	59eb489425	vo_opengl: enable dumb-mode automatically if possible I decided that I actually can't stand how vo_opengl unnecessarily puts the video through 3 shader stages (instead of 1). Thus, what was meant to be a fallback for weak OpenGL implementations, the dumb-mode, now becomes default if the user settings allow it. The code required to check for the settings isn't so wild, so I guess it's manageable. I still hope that one day, our rendering logic can generate ideal shader stages for this case too. Note that in theory, dumb-mode could be reenabled at runtime due to a color management 3D LUT being set, so a separate dumb_mode field is required. The dumb-mode option can't just be overwritten.	2015-11-19 21:22:24 +01:00
wm4	4fd0cd4a73	vo_opengl: support 3D textures on ANGLE Unfortunately, color management can still not work, because no GLES version specified so far support fixed-point 16 bit textures. Maybe we could use integer textures, but these don't support filtering. Using float textures would be another possibility.	2015-11-19 21:21:04 +01:00
wm4	6df3fa2ec1	vo_opengl: switch FBO format on GLES GL_RGB10_A2 is the best fixed-point format we can get on GLES/ANGLE for now. (Unless we somehow switch to non-normalized integer textures.)	2015-11-19 21:20:50 +01:00
wm4	1a8b06f67e	vo_opengl: make 1D textures completely optional Polar scalers use 1D textures, because they're slightly faster on some GPUs than 2D textures. But 2D textures work too, so add support for them. Allows using these scalers with ANGLE.	2015-11-19 21:20:40 +01:00
wm4	a6fb80baa4	vo_opengl: add RGBA8 framebuffer format, enable non-dumb mode for ES 3.0 This makes advanced scaling sort-of work for GLES 3.0 (on ANGLE). It's still not very advisable, as 8 bits might not be enough to avoid debanding. (Ironically, the debanding filter can be enabled, and does not raise any GL errors - but probably doesn't do anything useful.)	2015-11-19 14:45:06 +01:00
wm4	f9a2fc592f	vo_opengl: don't mix floats and integers in dither shader Some GLSL dialects (GLSL ES 3.00) do not have such implicit conversions. They have to be made floats for the sake of the shader compiler.	2015-11-19 14:41:49 +01:00
wm4	e24e0ccd68	vo_opengl: force dumb mode if RG textures are not available Something goes wrong somewhere. Don't bother, it's only needed for compatibility with our absolute baseline (GL 2.1/GLES 2). On the other hand, we can process nv12 formats just fine.	2015-11-16 20:09:15 +01:00
wm4	883d311413	vo_opengl: use glBlitFramebuffer to draw repeated frames In the display-sync, non-interpolation case, and if the display refresh rate is higher than the video framerate, we duplicate display frames by rendering exactly the same screen again. The redrawing is cached with a FBO to speed up the repeat. Use glBlitFramebuffer() instead of another shader pass. It should be faster. For some reason, post-process was run again on each display refresh. Stop doing this, which should also be slightly faster. The only disadvantage is that temporal dithering will be run only once per video frame, but I can live with this. One aspect is messy: clearing the background is done at the start on the target framebuffer, so to avoid clearing twice and duplicating the code, only copy the part of the framebuffer that contains the rendered video. (Which also gets slightly messy - needs to compensate for coordinate system flipping.)	2015-11-15 18:30:54 +01:00
wm4	4682b0147e	vo_opengl: move the glFlush() call to the renderer	2015-11-10 14:36:23 +01:00
Bin Jin	03bbaad686	vo_opengl: fix 10-bit video prescaling The nnedi3 prescaler requires a normalized range to work properly, but the original implementation did the range normalization after the first step of the first pass. This could lead to severe quality degradation when debanding is not enabled for NNEDI3. Fix this issue by passing `tex_mul` into the shader code. Fixes #2464	2015-11-09 22:48:40 +01:00
wm4	eeb5f98758	vo_opengl: handle GL_ARB_uniform_buffer_object with low GLSL versions Why is this stupid crap being so much a pain for no reason.	2015-11-09 16:24:01 +01:00
wm4	46cee66563	vo_opengl: rename fancy-downscaling to correct-downscaling The old name was stupid. Very stupid.	2015-11-07 17:49:14 +01:00
Avi Halachmi (:avih)	0062c98dff	vo_opengl: fancy-downscaling: enable also for anamorphic clips	2015-11-07 17:44:50 +01:00
wm4	cfa4952f7c	vo_opengl: glBindBufferBase is not part of GL 2.1/GLES 2.0 Commit `27dc834f` added it as such. Also remove the check for glUniformBlockBinding() - it's part of an extension, and the check glGetUniformBlockIndex() already checks whether the extension is fully available.	2015-11-06 13:59:33 +01:00
Bin Jin	27dc834f37	vo_opengl: implement NNEDI3 prescaler Implement NNEDI3, a neural network based deinterlacer. The shader is reimplemented in GLSL and supports both 8x4 and 8x6 sampling window now. This allows the shader to be licensed under LGPL2.1 so that it can be used in mpv. The current implementation supports uploading the NN weights (up to 51kb with placebo setting) in two different way, via uniform buffer object or hard coding into shader source. UBO requires OpenGL 3.1, which only guarantee 16kb per block. But I find that 64kb seems to be a default setting for recent card/driver (which nnedi3 is targeting), so I think we're fine here (with default nnedi3 setting the size of weights is 9kb). Hard-coding into shader requires OpenGL 3.3, for the "intBitsToFloat()" built-in function. This is necessary to precisely represent these weights in GLSL. I tried several human readable floating point number format (with really high precision as for single precision float), but for some reason they are not working nicely, bad pixels (with NaN value) could be produced with some weights set. We could also add support to upload these weights with texture, just for compatibility reason (etc. upscaling a still image with a low end graphics card). But as I tested, it's rather slow even with 1D texture (we probably had to use 2D texture due to dimension size limitation). Since there is always better choice to do NNEDI3 upscaling for still image (vapoursynth plugin), it's not implemented in this commit. If this turns out to be a popular demand from the user, it should be easy to add it later. For those who wants to optimize the performance a bit further, the bottleneck seems to be: 1. overhead to upload and access these weights, (in particular, the shader code will be regenerated for each frame, it's on CPU though). 2. "dot()" performance in the main loop. 3. "exp()" performance in the main loop, there are various fast implementation with some bit tricks (probably with the help of the intBitsToFloat function). The code is tested with nvidia card and driver (355.11), on Linux. Closes #2230	2015-11-05 17:38:20 +01:00
Bin Jin	4c43c30421	vo_opengl: add Super-xBR filter for upscaling Add the Super-xBR filter for image doubling, and the prescaling framework to support it. The shader code was ported from MPDN extensions project, with modification to process luma only. This commit is largely inspired by code from #2266, with `gl_transform_trans()` authored by @haasn taken directly.	2015-11-05 17:38:20 +01:00
Bin Jin	7438f208c3	vo_opengl: make image size dynamic during rendering This commit marks the image size variables temporary, and renames them in order to prevent any potential confusion in the future.	2015-11-05 17:38:20 +01:00
wm4	e6a395c297	vo_opengl, vo_opengl_cb: drop unneeded vo_frame fields next_vsync/prev_vsync was only used to retrieve the vsync duration. We can get this in a simpler way. This also removes the vsync duration estimation from vo_opengl_cb.c, which is probably worthless anyway. (And once interpolation is made display-sync only, this won't matter at all.)	2015-11-04 21:49:54 +01:00
wm4	8737732035	vo_opengl: cache frames only in display-sync mode vo_frame.num_vsyncs can be != 1 in some cases in normal sync mode too. This is not a very exact fix, but in exchange it's robust. (These vo_frame flags are way too tricky in combination with redrawing and such.)	2015-10-30 12:53:43 +01:00
wm4	67aab3a9f6	vo_opengl: do not attempt to cache frames in FBO in dumb-mode There were occasional shader compilation and rendering failures if FBOs were unavailable. This is caused by the FBO caching code getting active, even though FBOs are unavailable (i.e. dumb-mode). Boken by commit 97fc4f. Fixes #2432.	2015-10-30 12:49:12 +01:00
Bin Jin	17b4fb02b3	vo_opengl: remove source shader leftover The source shader was removed after deband was introduced.	2015-10-24 17:11:02 +02:00
Niklas Haas	97fc4f4a85	vo_opengl: always cache to an FBO when not interpolating This speeds up redraws considerably (improving eg. <60 Hz material on a 60 Hz monitor with display-sync active, or redraws while paused), but slightly slows down the worst case (eg. video FPS = display FPS).	2015-10-23 19:51:20 +02:00
wm4	e3de309804	vo_opengl: support all kinds of GBRP formats Adds support for AV_PIX_FMT_GBRP9, AV_PIX_FMT_GBRP10, AV_PIX_FMT_GBRP12, AV_PIX_FMT_GBRP14, AV_PIX_FMT_GBRP16, AV_PIX_FMT_GBRAP, and AV_PIX_FMT_GBRAP16. (Not that it matters, because nobody uses these anyway.)	2015-10-18 18:37:24 +02:00
wm4	4a07205963	vo_opengl: debanding requires GLSL 1.30 We have to disable it, or shader compilation will fail. Fixes #2362.	2015-10-01 20:44:39 +02:00
wm4	41bf41e416	vo_opengl: do not reset video queue when changing video equalizer If interpolation is enabled, then this causes heavy artifacts if done while unpaused. It's preferable to allow a latency of a few frames for the change to take full effect instead. If this is done paused, the frame is fully redrawn anyway.	2015-09-30 22:59:34 +02:00
wm4	57831d52dc	vo_opengl: actually set hardware decoder mapped texture format Surfaces used by hardware decoding formats can be mapped exactly like a specific software pixel format, e.g. RGBA or NV12. p->image_params is supposed to be set to this format, but it wasn't. (How did this ever work?) Also, setting params->imgfmt in the hwdec interop drivers is pointless and redundant. (Change them to asserts, because why not.)	2015-09-24 23:48:57 +02:00
wm4	cb1c072534	vo_opengl: remove sharpen scalers, add sharpen sub-option This turns the old scalers (inherited from MPlayer) into a pre- processing step (after color conversion and before scaling). The code for the "sharpen5" scaler is reused for this. The main reason MPlayer implemented this as scalers was perhaps because FBOs were too expensive, and making it a scaler allowed to implement this in 1 pass. But unsharp masking is not really a scaler, and I would guess the result is more like combining bilinear scaling and unsharp masking.	2015-09-23 22:43:27 +02:00
wm4	65ad85790a	vo_opengl: remove unsued chroma_location field This was redundant to forcing the value with vf_format, so the vo_opengl sub-option was removed. This field is just a leftover.	2015-09-23 22:16:36 +02:00
wm4	17cd6798a6	vo_opengl: move shader file caching to video.c It's just about loading and cachign small files, not does not necessarily have anything to do with shaders. Move it to video.c where it's used.	2015-09-23 22:13:03 +02:00
wm4	a8eae12af5	vo_opengl: fix shader compilation with debanding and OSX hwdec 2 things are being stupid here: Apple for requiring rectangle textures with their IOSurface interop for no reason, and OpenGL having a different sampler type for rectangle textures.	2015-09-10 20:53:47 +02:00
wm4	b4abcbd19d	vo_opengl: fix deband sub-option handling This all has to be done manually.	2015-09-09 20:40:04 +02:00
Niklas Haas	97363e176d	vo_opengl: implement debanding (and remove source-shader) The removal of source-shader is a side effect, since this effectively replaces it - and the video-reading code has been significantly restructured to make more sense and be more readable. This means users no longer have to constantly download and maintain a separate deband.glsl installation alongside mpv, which was the only real use case for source-shader that we found either way.	2015-09-09 19:19:23 +02:00
Niklas Haas	eb56807b41	vo_opengl: move self-contained shader routines to a separate file This is mostly to cut down somewhat on the amount of code bloat in video.c by moving out helper functions (including scaler kernels and color management routines) to a separate file. It would certainly be possible to move out more functions (eg. dithering or CMS code) with some extra effort/refactoring, but this is a start. Signed-off-by: wm4 <wm4@nowhere>	2015-09-09 18:17:44 +02:00
Niklas Haas	7929e36e93	vo_opengl: reduce code duplication for scaler options This simple refactor cuts down on the immense amount of overhead and duplication across all of the related scale-* options.	2015-09-09 18:09:40 +02:00
Niklas Haas	44eda2177d	vo_opengl: remove gl_ prefixes from files in video/out/opengl This is a bit redundant with the name of the directory itself, and not in line with existing naming conventions.	2015-09-09 18:09:31 +02:00

... 3 4 5 6 7

315 Commits