RepoMirrors/mpv - mpv

Commit Graph

Author	SHA1	Message	Date
Graham Booker	99315383f8	vo_gpu/d3d11: fix VRAM leak A VRAM memory leak was present in d3d11 when `idle=yes` and playback stops for an item. This patch re-enables some of the code which is only used during diagnostic which fixes the issue.	2022-07-21 16:13:45 +02:00
Cœur	bb5b4b1ba6	various: fix typos	2022-04-25 09:07:18 -04:00
Jan Ekström	5edc49adc9	vo_gpu/d3d11: add message ID based log level mapping This lets us remap various messages which might now be happening at each frame onto the trace level, thus unaffecting the initial debug log level. Additionally - thanks to this ability - the previously globally denied message queue abandonment messages can now be handled and mapped to trace log level, as on that log level they may be of use. Recommended by rossy and based on his libplacebo commit 6d72f6445566eddb0493447d0bda72d98a99d40c .	2022-04-11 17:56:02 +03:00
Jan Ekström	4b8e3f20bc	vo_gpu/d3d11: mimic libplacebo backbuffer usage Instead of always having the reference outside of calling resize, request a backbuffer at start and relieve the backbuffer at submission for presentation.	2022-04-11 17:56:02 +03:00
Jan Ekström	6102d2bd78	vo_gpu/d3d11: enable receival of color depth without active backbuffer Query the description of the swap chain, which should in all theory contain the format of the backbuffer. Then utilize a newly added ra_d3d11 function to map the format to an ra_format. After that, utilize the depth of the first plane of the format, as previously.	2022-04-11 17:56:02 +03:00
Jan Ekström	b5b5098d48	vo_gpu/d3d11: add helper function to get ra_format from DXGI_FORMAT	2022-04-11 17:56:02 +03:00
James Ross-Gowan	11f0947d46	vo_gpu_next: add D3D11 RA texture wrapping/unwrapping for hwdec This mostly copies the API used in the GL backend.	2022-03-03 13:06:05 +01:00
Niklas Haas	bb434a60ed	hwdec: release images as soon as possible after mapping We don't need to hold on to buffers longer than necessary. Doesn't matter for vo_gpu but greatly matters for vo_gpu_next, since it persists hwdec mapped textures for longer periods. Unfortunately, only provides benefits for hwdecs which do explicit copies in their decode path, which currently just means cuda and d3d11va.	2022-03-03 13:06:05 +01:00
Jan Ekström	13d0df7c21	vo_gpu/d3d11: enable access to the underlying ra_ctx swap chain This enables us to utilize it from gpu-next.	2022-02-02 21:48:01 +02:00
James Ross-Gowan	3c4b8433dc	vo_gpu/d3d11: add more usage flags to the swapchain images vo_gpu_next and libplacebo expect swapchain images to be able to be blitted to, which for libplacebo on FL11_0 and up means they have to have DXGI_USAGE_UNORDERED_ACCESS, since libplacebo uses a compute shader to emulate certain kinds of blits. For libplacebo's benefit, set all applicable usage flags on swapchain images.	2022-02-02 21:48:01 +02:00
Jan Ekström	65f9a688a4	vo_gpu/d3d11: early exit in start_frame if output fbo is nullptr	2022-02-02 21:48:01 +02:00
Philip Langdale	fd63bf398a	vo_gpu: stop hard-coding max compute group threads We've been assuming that maximum number of compute group threads is never less than the 1024 defined by the desktop GL spec. Given that we haven't had working compute shaders for GLES and I guess the Vulkan spec defines at least as high a value, we've gotten away with it so far. But we should really look the value up and respect it.	2021-12-19 01:51:54 +01:00
Philip Langdale	3f006eced4	options: Make validation and help possible for all option types Today, validation is only possible for string type options. But there's no particular reason why it needs to be restricted in this way, and there are potential uses, to allow other options to be validated without forcing the option to have to reimplement parsing from scratch. The first part, simply making the validation function an explicit field instead of overloading priv is simple enough. But if we only do that, then the validation function still needs to deal with the raw pre-parsed string. Instead, we want to allow the value to be parsed before it is validated. That in turn leads to us having validator functions that should be type aware. Unfortunately, that means we need to keep the explicit macro like OPT_STRING_VALIDATE() as a way to enforce the correct typing of the function. Otherwise, we'd have to have the validator take a void * and hope the implementation can cast it correctly. For help, we don't have this problem, as help doesn't look at the value. Then, we turn validators that are really help generators into explicit help functions and where a validator is help + validation, we split them into two parts. I have, however, left functions that need to query information for both help and validation as single functions to avoid code duplication. In this change, I have not added an other OPT_FOO_VALIDATE() macros as they are not needed, but I will add some in a separate change to illustrate the pattern.	2021-03-28 19:46:27 +03:00
James Ross-Gowan	326c55a571	vo_gpu: d3d11: only use presentation feedback with flip model The current implementation of presentation feedback was designed to be used with flip model presentation. With the bitblt model, GetFrameStatistics returns totally different values and it's not clear if we can use them at all. Previously, this wasn't a problem because with the bitblt model, GetFrameStatistics only worked in exclusive fullscreen. Now that mpv supports exclusive fullscreen, we should explicitly check for a flip model swapchain before using presentation feedback.	2020-05-07 00:17:50 +10:00
Jan Ekström	b885f803e2	vo_gpu: d3d11: also utilize config cache for d3d11-specific options Update the cache whenever we utilize p->opts, as recommended by wm4.	2020-04-12 21:18:50 +03:00
James Ross-Gowan	2a542b7f19	vo_gpu: d3d11: add support for exclusive fullscreen Lets the application fully control the rendering onto the screen instead of the compositor.	2020-04-12 21:18:50 +03:00
wm4	26f4f18c06	options: change option macros and all option declarations Change all OPT_* macros such that they don't define the entire m_option initializer, and instead expand only to a part of it, which sets certain fields. This requires changing almost every option declaration, because they all use these macros. A declaration now always starts with {"name", ... followed by designated initializers only (possibly wrapped in macros). The OPT_* macros now initialize the .offset and .type fields only, sometimes also .priv and others. I think this change makes the option macros less tricky. The old code had to stuff everything into macro arguments (and attempted to allow setting arbitrary fields by letting the user pass designated initializers in the vararg parts). Some of this was made messy due to C99 and C11 not allowing 0-sized varargs with ',' removal. It's also possible that this change is pointless, other than cosmetic preferences. Not too happy about some things. For example, the OPT_CHOICE() indentation I applied looks a bit ugly. Much of this change was done with regex search&replace, but some places required manual editing. In particular, code in "obscure" areas (which I didn't include in compilation) might be broken now. In wayland_common.c the author of some option declarations confused the flags parameter with the default value (though the default value was also properly set below). I fixed this with this change.	2020-03-18 19:52:01 +01:00
Jan Ekström	fc29620ec8	vo_gpu/d3d11: add support for configuring swap chain color space By default utilizes the color space of the desktop on which the swap chain is located. If a specific value is defined, it will be instead be utilized. Enables configuration of the PQ color space (BT.2020 primaries, PQ transfer function) for HDR. Additionally, signals the swap chain color space to the renderer, so that the render looks correct without having to specify target-trc or target-prim manually. Due to all of the APIs being Win10+ only, will only work starting with Windows 10.	2019-10-30 02:41:25 +02:00
James Ross-Gowan	888f4e63a4	vo_gpu: d3d11: set the ra_format.storable flag This flag was added in `e2976e662d`, but it was only set for Vulkan. In D3D11 it can be set from info in D3D11_FEATURE_FORMAT_SUPPORT2.	2019-10-27 00:45:27 +11:00
James Ross-Gowan	7384b05433	vo_gpu: d3d11: prevent wraparound in queued frames calc If expected_sync_pc is greater than submit_count, the unsigned subtraction will wraparound, which breaks playback. This bug was found while experimenting with bit-blt model present, but it might be possible to trigger it with the flip model as well, if there was a dropped frame.	2019-10-26 01:14:01 +11:00
Jan Ekström	89f4ce9d6f	vo_gpu/d3d11: switch adapter selection to case-insensitive startswith This lets users set values such as "intel" or "nvidia" as the adapter vendor is generally noted in the beginning of the description string.	2019-10-15 22:12:48 +03:00
Jan Ekström	684ffd13b4	vo_gpu/d3d11: fixup adapter selection by switching it all to bstr I did ponder if I should have done this right away, and it seems like not doing it at first was a mistake.	2019-10-15 22:12:48 +03:00
Jan Ekström	648d785930	vo_gpu/d3d11: add support for configuring swap chain format Query information on the system output most linked to the swap chain, and either utilize a user-configured format, or either 8bit RGBA or 10bit RGB with 2bit alpha depending on the system output's bit depth.	2019-10-13 22:31:33 +11:00
James Ross-Gowan	4809a3f48d	vo_gpu/d3d11: utilize actual backbuffer values for bit depth And if backbuffer is not around, return an error value utilized elsewhere already.	2019-10-13 22:31:33 +11:00
James Ross-Gowan	6002e2705f	vo_gpu: d3d11: use linear filtering for wrapped textures This affects hwdec_dxva2dxgi, which uses ra_d3d11_wrap_tex to wrap RGB video frames that are shared with a D3D9 device. Without it, mpv uses nearest instead of bilinear scaling with --scale=bilinear (the default) and --hwdec=dxva2. It's kind of hard to believe this bug has gone unnoticed for almost two years, but that seems to have been the case. Fixes: #7042	2019-10-10 19:52:19 +11:00
Jan Ekström	1f76e69145	vo_gpu/d3d11: add adapter name validation and listing with "help" Not the prettiest way to get it done, but seems to work.	2019-09-29 19:39:26 +03:00
Jan Ekström	e6447e2e89	vo_gpu/d3d11: add an option for the adapter name Set it from the adapter name in the d3d11 options.	2019-09-29 19:39:26 +03:00
Jan Ekström	8163906299	video/d3d11: add adapter selection by name into d3d11 options This lets the user define an adapter name that can then be passed further into the internals.	2019-09-29 19:39:26 +03:00
Anton Kindestam	6290420380	vo: make swapchain-depth option generic for all VOs In preparation for making vo_drm able to use swapchain-depth	2019-09-28 14:10:01 +03:00
James Ross-Gowan	03cb8755e1	vo_gpu: d3d11: don't reset frame stats after pause I think I was wrong about having to reset the stats when mpv stops producing frames, eg. when it's paused. As long as the swapchain doesn't underflow, last_queue_display_time will still be accurate, because the next frame produced should still be presented one vsync after the last one in the swapchain. If the swapchain underflows (which is the common case for when mpv is paused for more than 150ms,) the next predicted frame time should be in the past. It should be fine to leave last_queue_display_time unset in this case, since vo.c will use the current time instead, which is a decent guess (though it doesn't take vsync phase into account.) last_sync_refresh_count and last_sync_qpc_time should be kept on swapchain underflow as well. Assuming the display refresh rate doesn't change while mpv is paused, they'll only provide a more accurate guess of the vsync duration when mpv starts playing again. Also, vsync_duration_qpc never needs to get reset. It will get overwritten immediately in most cases, and when it doesn't, it's still a better guess of the vsync duration than nothing.	2019-09-26 23:41:38 +03:00
James Ross-Gowan	a9f9601ca8	vo_gpu: d3d11: add support for presentation feedback This adds vsync reporting to the D3D11 backend using the presentation feedback provided by DXGI, which is pretty similar to what's provided by GLX_OML_sync_control in the GLX backend. In DirectX, PresentCount is the SBC, PresentRefreshCount and SyncRefreshCount are kind of like the MSC and SyncQPCTime is the UST. Unlike GLX, the DXGI API makes it possible for PresentCount and SyncQPCTime to refer to different physical vsyncs, in which case PresentRefreshCount and SyncRefreshCount will be different. The code supports this possibility, even though it's not clear whether it can happen when using flip-model presentation. The docs say for flip-model apps, PresentRefreshCount is equal to SyncRefreshCount "when the app presents on every vsync," but on my hardware, they're always equal, even when mpv misses a vsync. They can definitely be different in exclusive fullscreen bitblt mode, though, which mpv doesn't support now, but might support in future. Another difference to GLX is that, at least on my hardware, PresentRefreshCount and SyncRefreshCount always refer to the last physical vsync on which mpv presented a frame, but glxGetSyncValues can apparently return a MSC and UST from the most recent physical vsync, even if mpv didn't present a new frame on it. This might result in different behaviour between the two backends after dropped frames or brief pauses. Also note, the docs for the DXGI presentation feedback APIs are pretty awful, even by Microsoft standards. In particular the docs for DXGI_FRAME_STATISTICS are misleading (PresentCount really is the number of times Present() has been called for that frame, not "the running total count of times that an image was presented to the monitor since the computer booted.") For good documentation, try these: https://docs.microsoft.com/en-us/windows/win32/direct3ddxgi/dxgi-flip-model https://docs.microsoft.com/en-us/windows/win32/direct3d9/d3dpresentstats https://docs.microsoft.com/en-us/windows-hardware/drivers/ddi/content/d3dkmthk/ns-d3dkmthk-_d3dkmt_present_stats (Yeah, the docs for the D3D9Ex and even the kernel-mode version of this structure are better than the DXGI ones. It seems possible that they're all rewordings of the same internal Microsoft docs, but whoever wrote the DXGI one didn't really understand it.)	2019-09-22 23:18:40 +10:00
Hui Jin	fda45f4537	vo_d3d11/context: fix crash due to ctx->ra is null pointer access 'ctx->ra' is null pointer when d3d11 init failed before call 'ra_d3d11_create' in 'd3d11_init'.	2019-09-14 21:35:49 +10:00
Hui Jin	191737b9c9	vo_d3d11/hwdec_dxva2dxgi: fix memory leak that 'ctx11' be not release 'ctx11' be not release when d3d11 hwdec be uninit with 'mapper_uninit' method.	2019-09-14 21:35:49 +10:00
James Ross-Gowan	80552ab28e	vo_gpu: d3d11: fix storage lifetime of compound literals Somehow I got the idea that compound literals had function-scoped lifetime. Instead, like all other objects with automatic storage duration, compound literals are block-scoped, so they become invalid after exiting the block they were declared in. It seems like a recent change to GCC actually reuses the memory that the compound literals used to occupy, which was causing a few bugs. The pattern of conditionally assigning a pointer to a compound literal was used in a few places in ra_d3d11 where the Direct3D API expects either a pointer to an initialised struct or NULL. Change these to ensure the lifetime of the struct includes the API call. Should fix #6775.	2019-08-20 18:12:21 +10:00
Philip Langdale	e2976e662d	video/out/gpu: Add a `storable` flag to ra_format While `ra` supports the concept of a texture as a storage destination, it does not support the concept of a texture format being usable for a storage texture. This can lead to us attempting to create a texture from an incompatible format, with undefined results. So, let's introduce an explicit format flag for storage and use it. In `ra_pl` we can simply reflect the `storable` flag. For GL and D3D, we'll need to write some new code to do the compatibility checks. I'm not going to do it here because it's not a regression; we were already implicitly assuming all formats were storable. Fixes #6657	2019-07-08 00:59:28 +02:00
James Ross-Gowan	cc38035841	vo_gpu: d3d11: use the SPIRV-Cross C API directly When the D3D11 backend was first written, SPIRV-Cross only had a C++ API and no guarantee of API or ABI stability, so instead of using SPIRV-Cross directly, mpv used an unofficial C wrapper called crossc. Now that KhronosGroup/SPIRV-Cross#611 is resolved, SPIRV-Cross has an official C API that can be used instead, so remove crossc and use SPIRV-Cross directly.	2019-06-12 23:03:55 +03:00
Niklas Haas	f0b6860d62	vo_gpu: index desc namespaces by ra No reason to require them be constant. This allows them to depend on runtime characteristics of the `ra`.	2019-04-21 23:55:22 +03:00
James Ross-Gowan	1b80e124db	vo_gpu: d3d11: implement tex_download() This allows the new GPU screenshot functionality introduced in `9f595f3a80` to work with the D3D11 backend. It replaces the old window screenshot functionality, which was shared between D3D11 and ANGLE. The old code can be removed, since it's not needed by ANGLE anymore either.	2018-02-13 21:25:15 +11:00
wm4	76276c9210	video: rewrite filtering glue code Get rid of the old vf.c code. Replace it with a generic filtering framework, which can potentially handle more than just --vf. At least reimplementing --af with this code is planned. This changes some --vf semantics (including runtime behavior and the "vf" command). The most important ones are listed in interface-changes. vf_convert.c is renamed to f_swscale.c. It is now an internal filter that can not be inserted by the user manually. f_lavfi.c is a refactor of player/lavfi.c. The latter will be removed once --lavfi-complex is reimplemented on top of f_lavfi.c. (which is conceptually easy, but a big mess due to the data flow changes). The existing filters are all changed heavily. The data flow of the new filter framework is different. Especially EOF handling changes - EOF is now a "frame" rather than a state, and must be passed through exactly once. Another major thing is that all filters must support dynamic format changes. The filter reconfig() function goes away. (This sounds complex, but since all filters need to handle EOF draining anyway, they can use the same code, and it removes the mess with reconfig() having to predict the output format, which completely breaks with libavfilter anyway.) In addition, there is no automatic format negotiation or conversion. libavfilter's primitive and insufficient API simply doesn't allow us to do this in a reasonable way. Instead, filters can use f_autoconvert as sub-filter, and tell it which formats they support. This filter will in turn add actual conversion filters, such as f_swscale, to perform necessary format changes. vf_vapoursynth.c uses the same basic principle of operation as before, but with worryingly different details in data flow. Still appears to work. The hardware deint filters (vf_vavpp.c, vf_d3d11vpp.c, vf_vdpaupp.c) are heavily changed. Fortunately, they all used refqueue.c, which is for sharing the data flow logic (especially for managing future/past surfaces and such). It turns out it can be used to factor out most of the data flow. Some of these filters accepted software input. Instead of having ad-hoc upload code in each filter, surface upload is now delegated to f_autoconvert, which can use f_hwupload to perform this. Exporting VO capabilities is still a big mess (mp_stream_info stuff). The D3D11 code drops the redundant image formats, and all code uses the hw_subfmt (sw_format in FFmpeg) instead. Although that too seems to be a big mess for now. f_async_queue is unused.	2018-01-30 03:10:27 -08:00
James Ross-Gowan	88c29b1301	vo_gpu: hwdec_dxva2dxgi: initial implementation This enables DXVA2 hardware decoding with ra_d3d11. It should be useful for Windows 7, where D3D11VA is not available. Images are transfered from D3D9 to D3D11 using D3D9Ex surface sharing[1]. Following Microsoft's recommendations, it uses a queue of shared surfaces, similar to Microsoft's ISurfaceQueue. This will hopefully prevent surface sharing from impacting parallelism and allow multiple D3D11 frames to be in-flight at once. [1]: https://msdn.microsoft.com/en-us/library/windows/desktop/ee913554.aspx	2018-01-06 11:26:15 +11:00
James Ross-Gowan	a9a4d6349a	vo_gpu: d3d11: check for NULL backbuffer in start_frame In a lost device scenario, resize() will fail and p->backbuffer will be NULL. We can't recover from lost devices yet, but we should still check for a NULL backbuffer in start_frame() rather than crashing. Also remove a NULL check for p->swapchain. This was a red herring, since p->swapchain never becomes NULL in an error condition, but p->backbuffer actually does. This should fix the crash in #5320, but it doesn't fix the underlying reason for the lost device (which is probably a driver bug.)	2018-01-04 23:05:10 +11:00
James Ross-Gowan	7677c7c32c	vo_gpu: d3d11: avoid copying staging buffers to cbuffers Apparently some Intel drivers have a bug where copying from staging buffers to constant buffers does not work. We used to keep a copy of the buffer data in a staging buffer to enable partial constant buffer updates. To work around this bug, keep the copy in talloc-allocated system memory instead. There doesn't seem to be any noticable performance difference from keeping the copy in system memory. Our cbuffers are probably too small for it to matter anyway. See also: https://crbug.com/593024 Fixes #5293	2018-01-01 20:31:45 +11:00
James Ross-Gowan	6ab7e0d465	vo_gpu: d3d11: check for timestamp query support Apparently timestamp queries are optional for 10level9 devices. Check for support when creating the device rather than spamming error messages during rendering. CreateQuery can be used to check for support by passing NULL as the final parameter. See: https://msdn.microsoft.com/en-us/library/windows/desktop/ff476150.aspx#ID3D11Device_CreateQuery	2017-12-09 19:53:53 +11:00
wm4	292724538c	video: remove some more hwdec legacy stuff Finally get rid of all the HWDEC_* things, and instead rely on the libavutil equivalents. vdpau still uses a shitty hack, but fuck the vdpau code. Remove all the now unneeded remains. The vdpau preemption thing was not unused anymore; if someone cares this could probably be restored.	2017-12-02 04:53:55 +01:00
wm4	9f52a92899	video: move d3d.c out of decode sub directory It makes more sense to have it in the general video directory (along with vdpau.c and vaapi.c), since the decoder source files don't even access it anymore.	2017-12-01 17:58:56 +01:00
wm4	7e87feaf15	vo_gpu: hwdec: remove redundant fields The testing_only field is not referenced anymore with vaglx removed and the previous commit dropping all uses. The ra_hwdec_driver.api field became unused with the previous commit, but all hwdec interop drivers still initialized it. Since this touches highly OS-specific code, build regressions are possible (plus the previous commit might break hw decoding at runtime). At least hwdec_cuda.c still used the .api field, other than initializing it.	2017-12-01 05:57:41 +01:00
James Ross-Gowan	4569f39da5	vo_gpu: d3d11: don't use runtime version for UAV slot count FL 11_1 is only supported with the Direct3D 11.1 runtime anyway, so there is no need to check both the runtime version and the feature level.	2017-11-19 21:49:27 +11:00
James Ross-Gowan	021fe791e1	vo_gpu: d3d11: mark the bgra8 format as unordered Whoops. I was confused by the double-negative here.	2017-11-19 21:17:31 +11:00
James Ross-Gowan	9b5d062d36	vo_gpu: d3d11: remove flipped texture upload hack Made unnecessary by `4a6b04bdb9`.	2017-11-12 17:10:22 +11:00
James Ross-Gowan	e7bf5576e5	vo_gpu: hwdec_d3d11va: allow zero-copy video decoding Like the manual says, this is technically undefined behaviour. See: https://msdn.microsoft.com/en-us/library/windows/desktop/ff476085.aspx In particular, MSDN says texture arrays created with the BIND_DECODER flag cannot be used with CreateShaderResourceView, which means they can't be sampled through SRVs like normal Direct3D textures. However, some programs (Google Chrome included) do this anyway for performance and power-usage reasons, and it appears to work with most drivers. Older AMD drivers had a "bug" with zero-copy decoding, but this appears to have been fixed. See #3255, #3464 and http://crbug.com/623029.	2017-11-07 20:27:13 +11:00

1 2

55 Commits