RepoMirrors/mpv - mpv

Commit Graph

Author	SHA1	Message	Date
Niklas Haas	5b6b77b8dc	vo_gpu: vulkan: normalize use of Flags and FlagBits FlagBits is just the name of the enum. The actual data type representing a combination of these flags follows the Flags convention. (The relevant difference is that the latter is defined to be uint32_t instead of left implicit) For consistency, use Flags everywhere instead of randomly switching between Flags and FlagBits. Also fix a wrong type name on `stageFlags`, pointed out by @atomnuker	2017-09-27 00:25:18 +02:00
Niklas Haas	0ba6c7d73f	vo_gpu: vulkan: optimize redundant pipeline barriers Using renderpass layout transitions is more optimal and doesn't require a redundant pipeline barrier. Since our render passes are static and don't change throughout the lifetime of a ra_renderpass, we unfortunately don't have much flexibility here - so just hard-code SHADER_READ_ONLY_OPTIMAL as the output format as this will be the most common case. We also can't short-circuit the transition when we need to preserve the framebuffer contents, since that depends on the current layout; so we still use an explicit tex_barrier in this case. (Most optimal for this scenario would be an input attachment anyway)	2017-09-26 23:50:01 +02:00
wm4	dc5fcced23	vd_lavc: cuda requires setting hw_device_ctx This restores cuda/cuvid under Windows. Cuvid is relatively useless under Windows, but this was requested.	2017-09-26 23:20:20 +02:00
wm4	9b60398f4e	video: remove old videotoolbox support Like as in previous commits, you need a very recent FFmpeg (probably git master).	2017-09-26 19:13:26 +02:00
wm4	ae7db6503b	video: drop old D3D11/DXVA2 support Now you need FFmpeg git, or something. This also gets rid of the last real use of gpu_memcpy(). libavutil does that itself. (vaapi.c still used it, but it was essentially unused, because the code path isn't really in use anymore. It wasn't even included due to the d3d-hwaccel dependency in wscript.)	2017-09-26 18:58:45 +02:00
wm4	5d57e9b1be	video: drop old cuda/cuvid hwaccel Just use FFmpeg 3.3 (or whatever it was) to get Cuvid support.	2017-09-26 18:00:46 +02:00
Niklas Haas	e569050fe4	vo_gpu: fix memleak in spirv.c	2017-09-26 17:32:36 +02:00
Niklas Haas	a4e951e80c	vo_gpu: explicitly label storage image formats This is apparently required to get storage images working on windows/vulkan, and probably good practice either way. Not entirely sure if it's the best idea to be always storing the value as 32-bit float, but it should hardly matter in practice (since we're only writing one sample per thread). (Leaving them implicit requires the shaderStorageImageWriteWithoutFormat feature to be enabled, which the windows nvidia vulkan driver doesn't support, at least not for a GTX 670)	2017-09-26 17:25:46 +02:00
Niklas Haas	47af509e1f	vo_gpu: attempt to avoid UBOs for dynamic variables This makes the radeon driver shut up about frequently updating STATIC_DRAW UBOs (--opengl-debug), and also reduces the amount of synchronization necessary for vulkan uniform buffers. Also add some extra debugging/tracing code paths. I went with a flags-based approach in case we ever want to extend this.	2017-09-26 17:25:35 +02:00
Niklas Haas	ca85a153b4	vo_gpu: vulkan: add support for push constants Can in theory avoid updating the uniform buffer every frame	2017-09-26 17:25:35 +02:00
Rostislav Pehlivanov	ed345ffc2f	vo_gpu: vulkan: add support for wayland	2017-09-26 17:25:35 +02:00
Niklas Haas	258487370f	vo_gpu: vulkan: generalize SPIR-V compiler In addition to the built-in nvidia compiler, we now also support a backend based on libshaderc. shaderc is sort of like glslang except it has a C API and is available as a dynamic library. The generated SPIR-V is now cached alongside the VkPipeline in the cached_program. We use a special cache header to ensure validity of this cache before passing it blindly to the vulkan implementation, since passing invalid SPIR-V can cause all sorts of nasty things. It's also designed to self-invalidate if the compiler gets better, by offering a catch-all `int compiler_version` that implementations can use as a cache invalidation marker.	2017-09-26 17:25:35 +02:00
Niklas Haas	91f23c7067	vo_gpu: vulkan: initial implementation This time based on ra/vo_gpu. 2017 is the year of the vulkan desktop! Current problems / limitations / improvement opportunities: 1. The swapchain/flipping code violates the vulkan spec, by assuming that the presentation queue will be bounded (in cases where rendering is significantly faster than vsync). But apparently, there's simply no better way to do this right now, to the point where even the stupid cube.c examples from LunarG etc. do it wrong. (cf. https://github.com/KhronosGroup/Vulkan-Docs/issues/370) 2. The memory allocator could be improved. (This is a universal constant) 3. Could explore using push descriptors instead of descriptor sets, especially since we expect to switch descriptors semi-often for some passes (like interpolation). Probably won't make a difference, but the synchronization overhead might be a factor. Who knows. 4. Parallelism across frames / async transfer is not well-defined, we either need to use a better semaphore / command buffer strategy or a resource pooling layer to safely handle cross-frame parallelism. (That said, I gave resource pooling a try and was not happy with the result at all - so I'm still exploring the semaphore strategy) 5. We aggressively use pipeline barriers where events would offer a much more fine-grained synchronization mechanism. As a result of this, we might be suffering from GPU bubbles due to too-short dependencies on objects. (That said, I'm also exploring the use of semaphores as a an ordering tactic which would allow cross-frame time slicing in theory) Some minor changes to the vo_gpu and infrastructure, but nothing consequential. NOTE: For safety, all use of asynchronous commands / multiple command pools is currently disabled completely. There are some left-over relics of this in the code (e.g. the distinction between dev_poll and pool_poll), but that is kept in place mostly because this will be re-extended in the future (vulkan rev 2). The queue count is also currently capped to 1, because of the lack of cross-frame semaphores means we need the implicit synchronization from the same-queue semantics to guarantee a correct result.	2017-09-26 17:25:35 +02:00
Niklas Haas	c82022f349	vo_opengl_cb: fix deprecated option usage opengl-debug was renamed to gpu-debug	2017-09-26 17:24:39 +02:00
Niklas Haas	89cdccfa6c	vo_gpu: fix possible segfault on shader miscompile Iterations after the first time will fail to realize that the pass was never created. This function's logic and control flow is so annoying...	2017-09-23 16:36:58 +02:00
James Ross-Gowan	3d119a0e41	vo_gpu: angle: fix misleading struct name This should have been renamed when it stopped being empty.	2017-09-23 18:33:33 +10:00
Niklas Haas	b0ba193b66	vo_gpu: handle texture initialization errors gracefully Tested by making the ra_tex_resize function always fail (apart from the initial FBO check). This required a few changes: 1. reset shaders on failed dispatch 2. reset cleanup binds on failed dispatch 3. fall back to initializing the struct image to 1x1 on failure 4. handle output_fbo_valid gracefully	2017-09-23 09:58:27 +02:00
Niklas Haas	f3ec494613	vo_gpu: reduce the --alpha=blend-tiles checkerboard intensity This was sort of grating by default and made it really hard to actually read e.g. text on top of a transparent background. I decided to approach the problem from both directions, making the whites darker and the grays lighter. This brings it closer to the dynamic range of e.g. the wikipedia transparent svg preview.	2017-09-22 21:14:27 +02:00
Niklas Haas	e3288c4597	vo_gpu: simplify compute shader coordinate calculation Since the removal of FBOTEX_FUZZY, this can be made slightly simpler.	2017-09-22 17:22:53 +02:00
Niklas Haas	62ddc85d17	vo_gpu: simplify structs / names Due to the plethora of historical baggage from different eras getting confusing, I decided to simplify and unify the struct organization and naming scheme. Structs that got renamed: 1. fbodst -> ra_fbo (and moved to gpu/context.h) 2. fbotex -> removed (redundant after `2af2fa7a`) 3. fbosurface -> surface 4. img_tex -> image In addition to these structs being renamed, all of the names have been made consistent. The new scheme is as follows: struct image img; struct ra_tex *tex; struct ra_fbo fbo; This also affects derived names, e.g. indirect_fbo -> indirect_tex. Notably also, finish_pass_fbo -> finish_pass_tex and finish_pass_direct -> finish_pass_fbo. The new equivalent of fbotex_change() is called ra_tex_resize(). This commit (should) contain no logic changes, just renaming a bunch of crap.	2017-09-22 16:58:55 +02:00
Niklas Haas	2af2fa7a27	vo_gpu: kill off FBOTEX_FUZZY I've observed the garbage pixels in more scenarios. They also were never really needed to begin with, originally being a discovered work-around for bug that we fixed since then anyway. Doesn't really seem to even help resizing, since the OpenGL drivers are all smart enough to pool resources internally anyway. Fixes #1814	2017-09-22 16:33:25 +02:00
James Ross-Gowan	fab0448c5e	Revert "cocoa: re-enable double buffering" Enabling double buffering fixed some graphical glitches when entering fullscreen, but it also caused a fullscreen performance regression. We decided that the glitches were preferable to the performance regression. This reverts commit `cee764849e`.	2017-09-22 23:08:46 +10:00
James Ross-Gowan	1d5620a658	vo_gpu: override ra_swapchain_fns for the d3d11 surface ANGLE can take advantage of some of these when using the external swapchain-backed surface.	2017-09-22 14:22:14 +02:00
Niklas Haas	aabe12b0bc	vo_gpu: opengl: fix possible screenshot window crash gl_read_fbo_contents can fail Fixes #4905	2017-09-22 14:20:22 +02:00
Niklas Haas	d325f30fb5	vo_opengl_cb: fix segfault on uninit The code used ra_ctx_destroy even though ra_ctx_create was never called (since it's just a dummy ctx), which led to a conflict of assumptions. The proper fix is to only use ra_gl_ctx_uninit (mirroring the ra_gl_ctx_init) and free the dummy ctx manually. Fixes https://github.com/cmdrkotori/mpc-qt/issues/129	2017-09-22 14:20:11 +02:00
wm4	fba927de41	options: properly handle deprecated options with CLI actions We want e.g. --opengl-shaders-append=foo to resolve to the new option, all while printing an option name. --opengl-shader is a similar case. These options are special, because they apply "actions" on actual options by specifying a suffix. So the alias/deprecation handling has to be part of resolving the actual option from prefix and suffix.	2017-09-22 11:31:03 +02:00
wm4	baffe6bcbc	vo_gpu: fix autoprobing message	2017-09-22 05:37:54 +02:00
wm4	2b5da4804c	build: make vo_gpu + infrastructure non-optional Also readd the the error message for when no GL backends are found (why was this removed?).	2017-09-22 05:35:26 +02:00
wm4	2b855739d5	vd_lavc: revert accidental vaapi changes revert Commit `bfa9b62858` accidentally reverted these (because my editor did not reload the file correctly). Fixes #4904.	2017-09-22 05:19:37 +02:00
Aman Gupta	6254b6d637	vo_opengl_cb: hwdec_ios: fix build [179/188] Compiling video/out/vo_lavc.c ../../video/out/opengl/hwdec_ios.m:135:9: warning: unused variable 'gl' [-Wunused-variable] GL gl = ra_gl_get(mapper->ra); ^ ../../video/out/opengl/hwdec_ios.m:247:48: warning: incompatible pointer to integer conversion passing 'CVOpenGLESTextureRef' (aka 'struct __CVBuffer ') to parameter of type 'GLuint' (aka 'unsigned int') [-Wint-conversion] p->gl_planes[i]); ^~~~~~~~~~~~~~~ ../../video/out/opengl/ra_gl.h:9:45: note: passing argument to parameter 'gl_texture' here GLuint gl_texture); ^ 2 warnings generated.	2017-09-22 05:03:52 +02:00
Niklas Haas	52789d6ca0	vo_gpu: fix vo=opengl legacy alias Turns out the option code apparently tries to directly talloc_free() the allocated strings, instead of going through a tactx wrapper or something. So we can't directly overwrite it. Do something else instead..	2017-09-21 16:13:52 +02:00
Niklas Haas	aefd7a90c9	vo_gpu: fix memleak in ra_gl_ctx The ctx->ra was never freed propely, nor was p->wrapped_fb. (TIL: MPV_LEAK_REPORT exists)	2017-09-21 15:51:47 +02:00
Niklas Haas	b940691784	vo_gpu: drop the RA_CAP_NESTED_ARRAY req from EWA compute Almost as fast as the old code, but more general. Notably, glslang doesn't support nested arrays. (cf. https://github.com/KhronosGroup/glslang/issues/1057) Also much cleaner code-wise, so I think I'll keep it even if glslang implements array_of_arrays.	2017-09-21 15:15:59 +02:00
Niklas Haas	03fee22c4d	wayland: allow vo_wayland_uninit(NULL)	2017-09-21 15:15:55 +02:00
Niklas Haas	28b2fa4b7e	vo_gpu: fix possible segfault in shader_cache.c If shader compilation fails in an unexpected way, it can end up calling renderpass_run on an invalid pass, since current_shader is never cleared.	2017-09-21 15:15:15 +02:00
Niklas Haas	db0fb3c48b	vo_gpu: fix gamma scale This never really made sense since the BT.1886 changes. It should get brighter for bright rooms, not darker for dark rooms. Picked some new values that seemed reasonable-ish.	2017-09-21 15:01:26 +02:00
Niklas Haas	61f5c423be	vo_gpu: fix comment on ra_buf_type This hasn't been true for several iterations of this API.	2017-09-21 15:01:22 +02:00
Niklas Haas	e92effb14f	vo_gpu: describe the plane merging pass This can get left unknown if something hooks NATIVE	2017-09-21 15:01:22 +02:00
James Ross-Gowan	cee764849e	cocoa: re-enable double buffering This causes a performance regression on 10.11 and newer, but the single buffered method was broken and could cause partially rendered frames to be presented to the screen. This reverts `9f30cd8292` and `e543853a7f`.	2017-09-21 15:01:22 +02:00
James Ross-Gowan	75c0c06640	vo_gpu: convert windows/osx hwdecs/contexts to new API	2017-09-21 15:01:17 +02:00
Niklas Haas	65979986a9	vo_opengl: refactor into vo_gpu This is done in several steps: 1. refactor MPGLContext -> struct ra_ctx 2. move GL-specific stuff in vo_opengl into opengl/context.c 3. generalize context creation to support other APIs, and add --gpu-api 4. rename all of the --opengl- options that are no longer opengl-specific 5. move all of the stuff from opengl/* that isn't GL-specific into gpu/ (note: opengl/gl_utils.h became opengl/utils.h) 6. rename vo_opengl to vo_gpu 7. to handle window screenshots, the short-term approach was to just add it to ra_swchain_fns. Long term (and for vulkan) this has to be moved to ra itself (and vo_gpu altered to compensate), but this was a stop-gap measure to prevent this commit from getting too big 8. move ra->fns->flush to ra_gl_ctx instead 9. some other minor changes that I've probably already forgotten Note: This is one half of a major refactor, the other half of which is provided by rossy's following commit. This commit enables support for all linux platforms, while his version enables support for all non-linux platforms. Note 2: vo_opengl_cb.c also re-uses ra_gl_ctx so it benefits from the --opengl- options like --opengl-early-flush, --opengl-finish etc. Should be a strict superset of the old functionality. Disclaimer: Since I have no way of compiling mpv on all platforms, some of these ports were done blindly. Specifically, the blind ports included context_mali_fbdev.c and context_rpi.c. Since they're both based on egl_helpers, the port should have gone smoothly without any major changes required. But if somebody complains about a compile error on those platforms (assuming anybody actually uses them), you know where to complain.	2017-09-21 15:00:55 +02:00
wm4	bfa9b62858	build: add preliminary LGPL mode See "Copyright" file for caveats. This changes the remaining "almost LGPL" files to LGPL, because we think that the conditions the author set for these was finally fulfilled.	2017-09-21 13:56:27 +02:00
wm4	a4de782272	mp_image: don't guess colorspace params in mp_image_copy_attributes() This is "wrong", because you might want mp_image_copy_attributes() to preserve the information that the colorspace parameters are unknown. This is important for hwdec -copy modes, which call this function before fix_image_params() and mp_colorspace_merge() are called. Instead, just wipe the colorspace attributes if the pixel format changes in an apparently incompatible way. Use mp_image_params_guess_csp() logic for this and factor that into its own function. mp_image_set_attributes() attempts to do something similar, so change that in the same way. Also, mp_image_params_guess_csp() just returned if the imgfmt was invalid or unset - just remove that part, because it annoyingly doesn't fit into the new code, and had little reason to exist to begin with. (Probably.)	2017-09-19 19:20:27 +02:00
wm4	06927fefdd	mp_image: always copy pixel aspect ratio I see no reason not to do this. I think the check comes from the time when mp_image stored the image aspect ratio, instead of the pixel aspect ratio, where the logic might have made more sense.	2017-09-19 19:04:47 +02:00
wm4	810990a746	mp_image: always copy color attributes on hw download It was noticed that -copy hwdec modes typically dropped the chroma_location field. This happened because the attributes on hw download are copied with mp_image_copy_attributes(), which tries to copy these parameters only if src and dst were both YUV (in an attempt to copy parameters only if it makes sense). But hardware formats did not have the YUV flag set (anymore?), and code shouldn't attempt to check the flag in this way anyway. Drop the check, and always copy the whole color metadata struct. There is a call to mp_image_params_guess_csp() below, which tries to unset nonsense metadata if it was copied from a YUV format to RGB. This function would also do the right thing for hw formats (although for the cited bug only the software case matters). Fixes #4804.	2017-09-19 18:59:01 +02:00
wm4	b881881dbd	Revert "vd_lavc: change auto-probe order to prefer cuda over vdpau-copy" This reverts commit `96462040ec`. I guess the autoprobing is still too primitive to handle this well. What it really should be trying is initializing the wrapper decoder, and if that doesn't work, try another method. This is complicated by hwaccels initializing in a delayed way, so there is no easy solution yet. Probably fixes #4865.	2017-09-19 18:35:29 +02:00
Niklas Haas	2f41b834b3	vo_opengl: make the ra_renderpass names consistent The random space kept screwing me over	2017-09-13 20:53:19 +02:00
Niklas Haas	293c696ddb	vo_opengl: use GLX_MESA_swap_control where available This overrides the use of GLX_SGI_swap_control, because apparently GLX_SGI_swap_control doesn't support SwapInterval(0), but the GLX_MESA_swap_interval does. Of course, everybody except mesa just accepts SwapInterval(0) even for GLX_SGI_swap_control, but mesa needs to be the special snowflake here and reject it, forcing us to load their stupid named extension instead. Meanwhile khronos has done nothing except spit out GLX_EXT_swap_control (not to be confused with GL_EXT_swap_control, which is exported by WGL_EXT_swap_control), that doesn't fix the problem because mesa doesn't implement it anyway. What a fucking mess.	2017-09-13 20:53:17 +02:00
Niklas Haas	0bb67b1055	vo_opengl: always initialize uniforms on first use Even if the contents are entirely zero. In the current code, these entries were left uninitialized. (Which always worked for nvidia - but randomly blew up for AMD)	2017-09-12 03:00:47 +02:00
Niklas Haas	0c2cb69597	vo_opengl: generalize UBO packing/handling This is simultaneously generalized into two directions: 1. Support more sc_uniform types (needed for SC_UNIFORM_TYPE_PUSHC) 2. Support more flexible packing (needed for both PUSHC and ra_d3d11)	2017-09-12 02:57:45 +02:00

1 2 3 4 5 ...

3813 Commits