RepoMirrors/mpv - mpv

Commit Graph

Author	SHA1	Message	Date
wm4	4a6b04bdb9	vo_gpu: never pass flipped images to ra or ra backends Seems like the last refactor to this code broke playing flipped images, at least with --opengl-pbo --gpu-api=opengl. Flipping is rather a shitmess. The main problem is that OpenGL does not support flipped uploading. The original vo_gl implementation considered it important to handle the flipped case efficiently, so instead of uploading the image line by line backwards, it uploaded it flipped, and then flipped it in the renderer (basically the upload path ignored the flipping). The ra code and backends probably have an insane and inconsistent mix of semantics, so fix this by never passing it flipped images in the first place. In the future, the backends should probably support flipped images directly. Fixes #5097.	2017-11-10 10:06:33 +01:00
James Ross-Gowan	e7bf5576e5	vo_gpu: hwdec_d3d11va: allow zero-copy video decoding Like the manual says, this is technically undefined behaviour. See: https://msdn.microsoft.com/en-us/library/windows/desktop/ff476085.aspx In particular, MSDN says texture arrays created with the BIND_DECODER flag cannot be used with CreateShaderResourceView, which means they can't be sampled through SRVs like normal Direct3D textures. However, some programs (Google Chrome included) do this anyway for performance and power-usage reasons, and it appears to work with most drivers. Older AMD drivers had a "bug" with zero-copy decoding, but this appears to have been fixed. See #3255, #3464 and http://crbug.com/623029.	2017-11-07 20:27:13 +11:00
James Ross-Gowan	b258d82d6e	vo_gpu: d3d11: enhance cache invalidation The shader cache in ra_d3d11 caches the result of shaderc, crossc and the D3DCompiler DLL, so it should be invalidated when any of those components are updated. This should make the cache more reliable, which makes it safer to enable gpu-shader-cache-dir. Shader compilation is slow with D3D11, so gpu-shader-cache-dir is highly necessary	2017-11-07 20:27:13 +11:00
James Ross-Gowan	b9c1286893	vo_gpu: d3d11: log shader compilation times Some shaders take a _long_ time to compile with the Direct3D compiler. The ANGLE backend had this problem too, to a certain extent. Logging should help identify which shaders cause long stalls and could also help with benchmarking ways of reducing compile times.	2017-11-07 20:27:13 +11:00
James Ross-Gowan	4b014b3a81	vo_gpu: move d3d11_screenshot to shared code This can be used by the ANGLE backend and ra_d3d11.	2017-11-07 20:27:13 +11:00
James Ross-Gowan	9b2dae79b1	vo_gpu: d3d11: add RA caps for ra_d3d11 ra_d3d11 uses the SPIR-V compiler to translate GLSL to SPIR-V, which is then translated to HLSL. This means it always exposes the same GLSL version that the SPIR-V compiler supports (4.50 for shaderc/glslang.) Despite claiming to support GLSL 4.50, some features that are tied to the GLSL version in OpenGL are not supported by ra_d3d11 when targeting legacy Direct3D feature levels. This includes two features that mpv relies on: - Reading from gl_FragCoord in the fragment shader (requires FL 10_0) - textureGather from any texture component (requires FL 11_0) These features have been exposed as new RA caps.	2017-11-07 20:27:13 +11:00
James Ross-Gowan	68eac1a1e7	vo_gpu: d3d11: initial implementation This is a new RA/vo_gpu backend that uses Direct3D 11. The GLSL generated by vo_gpu is cross-compiled to HLSL with SPIRV-Cross. What works: - All of mpv's internal shaders should work, including compute shaders. - Some external shaders have been tested and work, including RAVU and adaptive-sharpen. - Non-dumb mode works, even on very old hardware. Most features work at feature level 9_3 and all features work at feature level 10_0. Some features also work at feature level 9_1 and 9_2, but without high-bit- depth FBOs, it's not very useful. (Hardware this old is probably not fast enough for advanced features anyway.) Note: This is more compatible than ANGLE, which requires 9_3 to work at all (GLES 2.0,) and 10_1 for non-dumb-mode (GLES 3.0.) - Hardware decoding with D3D11VA, including decoding of 10-bit formats without truncation to 8-bit. What doesn't work / can be improved: - PBO upload and direct rendering does not work yet. Direct rendering requires persistent-mapped PBOs because the decoder needs to be able to read data from images that have already been decoded and uploaded. Unfortunately, it seems like persistent-mapped PBOs are fundamentally incompatible with D3D11, which requires all resources to use driver- managed memory and requires memory to be unmapped (and hence pointers to be invalidated) when a resource is used in a draw or copy operation. However it might be possible to use D3D11's limited multithreading capabilities to emulate some features of PBOs, like asynchronous texture uploading. - The blit() and clear() operations don't have equivalents in the D3D11 API that handle all cases, so in most cases, they have to be emulated with a shader. This is currently done inside ra_d3d11, but ideally it would be done in generic code, so it can take advantage of mpv's shader generation utilities. - SPIRV-Cross is used through a NIH C-compatible wrapper library, since it does not expose a C interface itself. The library is available here: https://github.com/rossy/crossc - The D3D11 context could be made to support more modern DXGI features in future. For example, it should be possible to add support for high-bit-depth and HDR output with DXGI 1.5/1.6.	2017-11-07 20:27:13 +11:00
James Ross-Gowan	8020a62953	vo_gpu: export the GLSL format qualifier for ra_format Backported from @haasn's change to libplacebo, except in the current RA, there's nothing to indicate an ra_format can be bound as a storage image, so there's no way to force all of these formats to have a glsl_format. Instead, the layout qualifier will be removed if glsl_format is NULL. This is needed for the upcoming ra_d3d11 backend. In Direct3D 11, while loading float values from unorm images often works as expected, it's technically undefined behaviour, and in Windows 10, it will cause the debug layer to spam the log with error messages. Also, apparently in GLSL, the format name must match the image's format exactly (but in Direct3D, it just has to have the same component type.)	2017-11-07 20:27:13 +11:00
James Ross-Gowan	41dff03f8d	vo_gpu: add namespace query mechanism Backported from @haasn's change to libplacebo. More flexible than the previous "shared \|\| non-shared" distinction. The extra flexibility is needed for Direct3D 11, but it also doesn't hurt code-wise.	2017-11-07 20:27:13 +11:00
wm4	793b43020c	vo_lavc: remove messy delayed subtitle rendering logic For some reason vo_lavc's draw_image can buffer the frame and encode it only later. Also, there is logic for rendering the OSD (i.e. subtitles) only when needed. In theory this can lead to subtitles being pruned before it tries to render them (as the subtitle logic doesn't know that the VO still needs them later), although this probably never happens in reality. The worse issue, that actually happened, is that if the last frame gets buffered, it attempts to render subtitles in the uninit callback. At this point, the subtitle decoder is already torn down and all subtitles removed, thus it will draw nothing. This didn't always happen. I'm not sure why - potentially in the working cases, the frame wasn't buffered. Since this logic doesn't have much worth, except a minor performance advantage if frames with subtitles are dropped, just remove it. Hopefully fixes #4689.	2017-11-07 05:29:26 +01:00
wm4	921073bf86	img_format: remove some guards against old ffmpeg API These are always present in ffmpeg-mpv, and never in Libav.	2017-11-06 17:14:01 +01:00
wm4	5261d1b099	vo_gpu: don't re-render hwdec frames when repeating frames Repeating frames (for display-sync) is not supposed to render the entire frame again. When using hardware decoding, it unfortunately did: the renderer uses the frame ID to check whether the frame data changed, and unmapping the hwdec frame clears it. Essentially reverts commit `761eeacf54`. Back then I probably thought it would be a good idea to release the hwdec image quickly in order to return it to the decoder, but they're referenced anyway. This should increase the performance and reduce GPU work.	2017-11-03 15:11:56 +01:00
wm4	99dd2f57f0	vo_gpu: ra_gl: fix minimum GLSL version to 120 Not sure why there was 110, or why there is even a default.	2017-11-03 11:53:31 +01:00
wm4	2abf20b2b2	vo_gpu: fix mobius tone mapping compatibility to GLSL 120 Normally such code is didsabled by have_mglsl==false in check_gl_features(), but apparently not this one. Just fix it. Seems also more readable. Fixes #5069.	2017-11-03 11:53:17 +01:00
wm4	aac74b7c36	vo_gpu: ra_gl: fix crash trying to use glBindBufferBase on GL 2.1 Apparently this is required, but it doesn't check for it. To be fair, this was tested by creating a compatibility context and pretending it's GL 2.1. GL_ARB_shader_storage_buffer_object actually requires GL 4.0 or up, but GL_ARB_uniform_buffer_object requires only GL 2.0.	2017-11-03 11:39:15 +01:00
wm4	ff17760c00	vd_lavc: restore --hwdec-image-format and d3d11 opaque mode When the ifdeffery for the frame_params API was added, the new code accidentally didn't include this.	2017-11-02 15:58:36 +01:00
wm4	501230f2a0	vo_gpu: potentially fix icc-profile-auto updating vo_gpu.c will call gl_video_icc_auto_enabled() to check whether it should retrieve the ICC profile. But the value returned by this function will be outdated, because gl_video_update_options() is not called yet. Change the order of function calls so that this is done after updating the options. (This is fairly chaotic, but I guess this code will be refactored a dozen of times anyway in the future.)	2017-11-01 01:58:16 +01:00
wm4	95c6a482eb	vd_lavc: clean out more hwdec legacy code All this code used to be required by the old variants of the libavcodec hw decoding APIs. Almost all of that is gone, although the mediacodec API unfortunately still pulls in some old stuff (but not all of it). (mediacodec build/functionality is untested, but should work.)	2017-10-31 17:11:52 +01:00
wm4	f27a9aaa17	vd_lavc: remove more dead legacy code All of this was dead code and completely unused. get_buffer2_hwdec() is the biggest chunk. One unfortunate thing about it is that, while it was active, it could perform a software fallback much faster, because it didn't have to wait until a full frame is decoded (it actually decoded a full frame, but the current code has to decode many more frames due to the codec delay, because the current code waits until the API returns a decoded frame.) We should probably restore the latter, although since it's an optional optimization, and the current behavior doesn't change with the removal of this code, don't actually do anything about it.	2017-10-31 15:29:13 +01:00
wm4	7fd1359fcf	videotoolbox: use generic code for dummy hwdevice init And move the remaining code (just 2 struct constant definitions) to vd_lavc.c.	2017-10-31 15:20:09 +01:00
wm4	e0f42bdc5d	vd_lavc: remove dead legacy code	2017-10-31 14:18:35 +01:00
wm4	078a3ed996	d3d: remove some legacy code See #5062.	2017-10-31 14:14:45 +01:00
wm4	ff093e5b78	vd_lavc: makre sure required headers are included early enough Should fix #5062.	2017-10-31 14:12:11 +01:00
wm4	a18a7cd4f5	vd_lavc: move display mastering data stuff to mp_image This is where it should be. It only wasn't because of an old libavcodec bug, that returned the side data only on every IDR. This required some sort of caching, which is now dropped. (mp_image wouldn't have been able to do this kind of caching, because this code is stateless.) We don't support these old libavcodec versions anymore, which is why this is not needed anymore. Also move initialization of rotation/stereo stuff to dec_video.c.	2017-10-30 21:07:48 +01:00
wm4	a7f4ecb012	Bump libav* API use (Not tested on Windows and OSX.)	2017-10-30 20:55:42 +01:00
wm4	1c46bd5e50	vo_gpu: remove a redundant ifdef	2017-10-30 18:35:33 +01:00
wm4	58d83a9309	vd_lavc: make --hwdec=nvdec-copy actually work This simply didn't work. Unlike cuda-copy, this is a true hwaccel, and obviously we need to provide it a device. Implement this in a relatively generic way, which can probably reused directly by videotoolbox (not doing this yet because it would require testing on OSX). Like with cuda-copy, --cuda-decode-device is ignored. We might be able to provide a more general way to select devices at some later point.	2017-10-30 18:34:49 +01:00
wm4	b7ce3ac445	vd_lavc: remove need for duplicated cuda GL interop backend This is just a dumb consequence of HWDEC_ types somehow being part of both decoder and VO. Obviously, the VO should only care about supporting specific hardware surface types or providing specific device types, but until they are separated, stupid unintuitive mismatches will occur.	2017-10-30 18:31:20 +01:00
wm4	d6ebb2df47	Get rid of deprecated AVFrame accessors Fist we were required to use them for ABI compat. reasons (and other BS), now they're deprecated and we're supposed to access them directly again.	2017-10-30 13:36:44 +01:00
Ryo Munakata	046fe45950	hwdec_drmprime_drm: fix segv with --hwdec	2017-10-30 12:46:49 +01:00
wm4	6b745769b1	vd_lavc: add support for nvdec hwaccel See manpage additions. (In ffmpeg-mpv and Libav, this is still called "cuvid". Libav won't work yet, because it has no frame params support yet, but this could get fixed soon.)	2017-10-28 19:59:08 +02:00
wm4	f36d152eb7	vd_lavc: use avcodec_fill_hw_frames_parameters() API This removes the need for codec- and API-specific knowledge in the libavcodec hardware acceleration API user. For mpv, this removes the need for vd_lavc_hwdec.pixfmt_map and a few other things. (For now, we still keep the "old" parts for the sake of supporting older Libav, and FFgarbage.)	2017-10-27 18:08:20 +02:00
Niklas Haas	4701c5ba4f	vo_gpu: fix ra_tex_upload_pbo for 2D textures params->rc was ignored in the calculation for the buffer size. I fucking hate this stupid ra_tex_upload signature where *rc is randomly relevant or not.	2017-10-27 16:56:23 +02:00
wm4	d6f33e0b0d	vo_gpu: osd: simplify some code Coverity complains about this, but it's probably a false positive. Anyway, rewrite it in a slightly more readable way. Now it's more obvious that it is correct.	2017-10-27 14:19:57 +02:00
wm4	22fa498bf9	vd_lavc: more aggressive frame dropping for intra only codecs Should speed up seeks. (Unfortunately it's useless for backstepping. Backstepping is like precise seeking, except we're unable to drop frames, as we can't know the previous frame if we drop it.)	2017-10-26 19:44:26 +02:00
Niklas Haas	c2d4fd0ef4	vo_gpu: change --tone-mapping-desaturate algorithm Comparing mpv's implementation against the ACES ODR reference samples and algorithms, it seems like they're happy desaturating highlights _way_ more aggressively than mpv currently does. And indeed, looking at some example clips like The Redwoods (which is actually well-mastered), the current desaturation produces unnatural-looking brightness fringes where the sky meets the treeline. Adjust the algorithm to make it apply to a much larger, more gradual brightness region; and change the interpretation of the parameter. As a bonus, the new parameter is actually sanely scaled (higher values = more desaturation). Also, make it scale based on the signal level instead of the luminance, to avoid under-desaturating bright blues.	2017-10-25 17:24:27 +02:00
wm4	a5b51f75dc	demux: get rid of demux_packet.new_segment field The new_segment field was used to track the decoder data flow handler of timeline boundaries, which are used for ordered chapters etc. (anything that sets demuxer_desc.load_timeline). This broke seeking with the demuxer cache enabled. The demuxer is expected to set the new_segment field after every seek or segment boundary switch, so the cached packets basically contained incorrect values for this, and the decoders were not initialized correctly. Fix this by getting rid of the flag completely. Let the decoders instead compare the segment information by content, which is hopefully enough. (In theory, two segments with same information could perhaps appear in broken-ish corner cases, or in an attempt to simulate looping, and such. I preferred the simple solution over others, such as generating unique and stable segment IDs.) We still add a "segmented" field to make it explicit whether segments are used, instead of doing something silly like testing arbitrary other segment fields for validity. Cached seeking with timeline stuff is still slightly broken even with this commit: the seek logic is not aware of the overlap that segments can have, and the timestamp clamping that needs to be performed in theory to account for the fact that a packet might contain a frame that is always clipped off by segment handling. This can be fixed later.	2017-10-24 19:35:55 +02:00
Lionel CHAZALLON	1992bb5151	video : Move drm options to substruct. This allows to group them and most of all query the group config when needed and when we don't have the access to vo.	2017-10-23 21:08:20 +02:00
Lionel CHAZALLON	cfcee4cfe7	Add DRM_PRIME Format Handling and Display for RockChip MPP decoders This commit allows to use the AV_PIX_FMT_DRM_PRIME newly introduced format in ffmpeg that allows decoders to provide an AVDRMFrameDescriptor struct. That struct holds dmabuf fds and information allowing zerocopy rendering using KMS / DRM Atomic. This has been tested on RockChip ROCK64 device.	2017-10-23 21:07:24 +02:00
Lionel CHAZALLON	762b8cc300	video : allow drm primary plane to be transparent for egl context We want primary plane to be one top of overlay (video), so we need it to be 32 bits.	2017-10-23 21:06:53 +02:00
Mark Thompson	26b46950a1	vo_opengl: hwdec_vaegl: Disable vaExportSurfaceHandle() libva 2.0 (VAAPI 1.0.0) was released without it, but it is scheduled to be included in libva 2.1.	2017-10-23 11:58:13 +02:00
Rostislav Pehlivanov	f8aeda0da9	wayland_common: check monitor scale Since we divide by it in a couple of places and compositors can be crazy, its better to be safe than sorry. Also checks cursor spawn durinig init (pointless since it does again on cursor entry but its more correct).	2017-10-22 06:49:35 +01:00
Rostislav Pehlivanov	78ef7fb766	wayland_common: improve cursor code and scale cursor properly It seems the cursor hadn't had its position properly adjusted when scaled. Hence, bring back correct buffer scaling to make the cursor look fine. Also the cursor surface now gets created sooner so that's better.	2017-10-22 05:53:20 +01:00
Rostislav Pehlivanov	13fb166d87	wayland_common: don't scale the cursor wl_buffer Only gnome does something as stupid as always applying scaling to the cursor rather than just using a larger sized one with HIDPI.	2017-10-19 21:35:20 +01:00
wm4	d3c022779a	video: fix alpha handling Regression since `ec6e8a31e0`. Removal of the explicit else case always applies the conversion to premultiplied alpha in the else branch. We want to scale with multiplied alpha, but we don't want to multiply with alpha again on top of it. Fixes #4983, hopefully.	2017-10-19 19:01:33 +02:00
James Ross-Gowan	d9e3bad500	vo_gpu: add rgba16hf to the list of FBO formats This should be functionally identical to rgba16f, since the formats only differ in their representation on the CPU, but it could be useful for RA backends that don't expose rgba16f, like Vulkan. It's definitely useful for the WIP D3D11 backend.	2017-10-18 23:55:13 +11:00
wm4	747892209f	vo_rpi: fix build (probably) Untested. If it works, fixes #4919.	2017-10-17 09:28:00 +02:00
wm4	77945b2c16	vo_gpu: remove weird p->vo indirection That's just unnecessary.	2017-10-17 09:09:00 +02:00
wm4	c90f76d322	vo_gpu: fix video sometimes not being rerendered on equalizer change With video paused, changing the brightness controls (or similar) would sometimes not rerender the video frame. So the OSD would redraw, but the video wouldn't change. This is caused by output caching, and a redraw request is free to return the cached frame. Change it such to invalidate the cached frame if any of the options or the equalizer change. In theory, gl_video_reset_surfaces() could be called if the equalizer changes - this would apparently force interpolatzion to redraw all frames. But this looks kind of crappy when changing the equalizer during playback. It'll "eventually" use the correct settings anyway, and when paused interpolation is off.	2017-10-17 09:07:35 +02:00
wm4	b299a3f546	vdpau: remove some more dead code	2017-10-16 18:19:19 +02:00

1 2 3 4 5 ...

3921 Commits