RepoMirrors/mpv - mpv

Commit Graph

Author	SHA1	Message	Date
wm4	8e5cd62dca	oml_sync: fix typo in comment I think... Also reword another part of the text.	2019-09-20 00:32:29 +02:00
wm4	c6773692ad	vo_gpu: remove vdpau/GLX backend Useless garbage. This was once added to test whether vdpau presentation feedback could be used. Results were always unsatisfactory, and now vdpau is dead.	2019-09-19 20:37:05 +02:00
wm4	83d7123dc3	vo_gpu: remove mali-fbdev Useless at this point, I don't even know if it still works, or how to test it.	2019-09-19 20:37:05 +02:00
Anton Kindestam	e08f235578	drm: fix libmpv ABI breakage introduced in `351c083487` Extending the client-allocated mpv_opengl_drm_params struct constituted a break of ABI that could cause UB. Create a clean break by deprecating "drm_params" and related structs and enum values, and replacing it with "drm_params_v2". Also fix some comments and code that wrongly assumed that open could return any other negative number than -1 for failure. This commit updates the libmpv version to 1.104	2019-09-18 23:59:32 +03:00
wm4	0abe34ed21	vo_gpu: x11: remove special vdpau probing, use EGL by default Originally, vo_gpu/vo_opengl considered the case of Nvidia proprietary drivers, which required vdpau/GLX, and Intel open source drivers, which require vaapi/EGL. Since window creation and GPU context creation are inseparable in mpv's internal API, it had to pick the correct API very early, or hardware decoding wouldn't work. "x11probe" was introduced for this reason. It created a GLX context (without showing the window yet), and checked whether vdpau was available. If yes, it used GLX, if not, it continued probing x11/EGL. (Obviously it couldn't always fail on GLX without vdpau, which is why it was a separate "probe" backend.) Years passed, and now the situation is different. Vdpau is dead. Nvidia drivers and libavcodec now provide CUDA interop, which requires EGL, and fixes some of the vdpau problems. AMD drivers now provide vaapi, which generally works better than vdpau. Intel didn't change. In particular, vaapi provides working HEVC Main10 support. In theory, it should work on vdpau too, with quality reduction (no 10 bit surfaces), but I couldn't get it to work. So always prefer EGL. And suddenly hardware decoding works. This is actually rather important, because HEVC is unfortunately on the rise, despite shitty encoders and unoptimized decoders. The latter may mean that hardware decoding works better than libavcodec. This should have been done a long, long time ago.	2019-09-15 20:00:52 +03:00
wm4	10a1b98082	vo_gpu: x11egl: support Mesa OML sync extension Mesa supports the EGL_CHROMIUM_sync_control extension, and it's available out of the box with AMD drivers. In practice, this is exactly the same as GLX_OML_sync_control, but for EGL. The extension specification is separate from the GLX one though, and buried somewhere in the Chromium code. This appears to work, although I don't know if it really works. In theory, this could be useful for other EGL targets. Support code for it could have been added to egl_helpers.c to avoid some minor duplicated glue code if another EGL target were to provide this extension. I didn't bother with that. ANGLE on Windows can't support it, because the extension spec. explicitly requires POSIX timers. ANGLE on Linux/OSX is actively harmful for mpv and hopefully won't ever use it. Wayland uses EGL, but has its own fancy presentation feedback stuff (and besides, I don't think basic video player functionality works on Wayland at all). context_drm_egl maybe? But I think DRM has its own stuff.	2019-09-08 23:23:43 +10:00
wm4	8d7960f6ef	vo_gpu: glx: move OML sync code to an independent file So the next commit can make EGL use it. EGL has a quite similar function, that practically works the same. Although it's relatively trivial, it's still tricky, and probably shouldn't end up as duplicated code. There are no functional changes, except initialization, and how failure of the glXGetSyncValues call is handled. Also, some comments mention the EGL extension. Note that there's no intention for this code to handle anything else than the very specific OML sync extension (and its EGL equivalent). This is just too weirdly specific to the weird idiosyncrasies of the extension, and it makes no sense to extend it to handle anything else. (Such as Wayland or DXGI presentation feedback.)	2019-09-08 23:23:43 +10:00
Philip Langdale	6842755feb	vo_gpu: hwdec_vaegl: Rename and move to hwdec_vaapi In preparation for adding Vulkan interop support, let's rename to remove the egl reference and move to an api neutral location.	2019-07-08 01:57:02 +02:00
Philip Langdale	1638fa7b46	vo/gpu: hwdec_vdpau: Support direct mode for 4:4:4 content New releases of VDPAU support decoding 4:4:4 content, and that comes back as NV24 when using 'direct mode' in OpenGL Interop. That means we need to be a little bit smarter about how we set up the OpenGL textures.	2019-07-08 01:11:27 +02:00
Michael Forney	13e14d95e1	opengl/context_wayland: Fix crash on configure before initial reconfig If the compositor sends a configure event before the surface is initially mapped, resize gets called before the egl_window gets created, resulting in a crash in wl_egl_window_resize. This was fixed back in `618361c697`, but was reintroduced when the wayland code was rewritten in `68f9ee7e0b`.	2019-07-08 01:00:01 +02:00
Philip Langdale	e2976e662d	video/out/gpu: Add a `storable` flag to ra_format While `ra` supports the concept of a texture as a storage destination, it does not support the concept of a texture format being usable for a storage texture. This can lead to us attempting to create a texture from an incompatible format, with undefined results. So, let's introduce an explicit format flag for storage and use it. In `ra_pl` we can simply reflect the `storable` flag. For GL and D3D, we'll need to write some new code to do the compatibility checks. I'm not going to do it here because it's not a regression; we were already implicitly assuming all formats were storable. Fixes #6657	2019-07-08 00:59:28 +02:00
Anton Kindestam	8261924db9	drm_common: Add proper help option to drm-mode This was implemented by using OPT_STRING_VALIDATE for drm-mode, instead of OPT_INT. Using a string here also prepares for future additions to drm-mode that aim to allow specifying a mode by its resolution.	2019-05-04 14:17:11 +02:00
Anton Kindestam	a776628d88	drm_common: Add option to toggle use of atomic modesetting It is useful when debugging to be able to force atomic off, or as a workaround if atomic breaks for some user. Legacy modesetting is less likely to break by virtue of being a less complex API.	2019-05-04 14:17:11 +02:00
Philip Langdale	23a324215b	vo/gpu: hwdec_cuda: Refactor gpu api specific code into separate files The amount of code now present that's specific to Vulkan or OpenGL has reached the point where we really want to split it out to avoid a mess of #ifdefs. At the same time, I'm moving the code to an api neutral location.	2019-05-03 18:02:18 +02:00
Anton Kindestam	738fda3677	context_drm_egl: Add support for presentation feedback This implements presentation feedback for context_drm_egl using the values that get fed to the page flip handler.	2019-05-03 18:01:56 +02:00
Jan Ekström	edbc199914	vo_gpu/hwdec_cuda: fixup compilation with vulkan disabled The actual code utilizing this enum was seemingly properly if'd, but not the enum in the struct itself. Fixes compilation.	2019-04-22 18:17:30 +03:00
Philip Langdale	74831dd651	vo/gpu: hwdec_cuda: Reorganise backend-specific code This tries to tidy up the GL vs Vulkan code to be a bit cleaner and easier to read.	2019-04-21 23:55:22 +03:00
Philip Langdale	4005cda614	vo_gpu: hwdec_cuda: Implement interop for placebo This change updates the vulkan interop code to work with the libplacebo based ra_vk, but also introduces direct VkImage sharing to avoid the use of the intermediate buffer. It is also necessary and desirable to introduce explicit semaphore bsed synchronisation for operations on the shared images. Synchronisation means we can safely reuse the same VkImage for every mapped frame, by ensuring the frame is copied to the VkImage before mapping the next frame. This functionality requires a 417.xx or newer nvidia driver, due to bugs in the VkImage interop in the earlier 411 and 415 drivers. It's definitely worth the effort, as the raw throughput is about twice that of implementation using an intermediate buffer.	2019-04-21 23:55:22 +03:00
Niklas Haas	7006d6752d	vo_gpu: vulkan: use libplacebo instead This commit rips out the entire mpv vulkan implementation in favor of exposing lightweight wrappers on top of libplacebo instead, which provides much of the same except in a more up-to-date and polished form. This (finally) unifies the code base between mpv and libplacebo, which is something I've been hoping to do for a long time. Note: The ra_pl wrappers are abstract enough from the actual libplacebo device type that we can in theory re-use them for other devices like d3d11 or even opengl in the future, so I moved them to a separate directory for the time being. However, the rest of the code is still vulkan-specific, so I've kept the "vulkan" naming and file paths, rather than introducing a new `--gpu-api` type. (Which would have been ended up with significantly more code duplicaiton) Plus, the code and functionality is similar enough that for most users this should just be a straight-up drop-in replacement. Note: This commit excludes some changes; specifically, the updates to context_win and hwdec_cuda are deferred to separate commits for authorship reasons.	2019-04-21 23:55:22 +03:00
Niklas Haas	f0b6860d62	vo_gpu: index desc namespaces by ra No reason to require them be constant. This allows them to depend on runtime characteristics of the `ra`.	2019-04-21 23:55:22 +03:00
Jan Ekström	199aabddcc	Merge branch 'master' into pr6360 Manual changes done: * Merged the interface-changes under the already master'd changes. * Moved the hwdec-related option changes to video/decode/vd_lavc.c.	2019-03-11 01:00:27 +02:00
Anton Kindestam	537006965e	context_drm_egl: implement n-buffering This allows context_drm_egl to use as many buffers as libgbm or the swapchain_depth setting allows (whichever is smaller). On pause and on still images (cover art etc.) to make sure that output does not lag behind user input, the swapchain is drained and reverts to working in a dual buffered (equivalent to swapchain-depth=1) manner. When possible (swapchain-depth>=2), the wait on the page flip event is now not done immediately after queueing, but is deferred to the next invocation of swap_buffers. Which should give us more CPU time between invocations. Although, since gbm_surface_has_free_buffers() can only tell us a boolean value and not how many buffers we have left, we are forced to do this contortionist dance where we first overshoot until gbm_surface_has_free_buffers() reports 0, followed by immediately waiting so we can free a buffer, to be able to get the deferred wait on page flip rolling. With this commit we do not rely on the default vsync fences/latency emulation of video/out/opengl/context.c, but supply our own, since the places we create and wait for the fences needs to be somewhat different for best performance. Minor fixes: * According to GBM documentation all BO:s gotten with gbm_surface_lock_front_buffer must be released before gbm_surface_destroy is called on the surface. * We let the page flip handler function handle the waiting_for_flip flag.	2019-02-25 01:25:25 +01:00
Anton Kindestam	ae115bd8d8	opengl: Support GL_ARB_sync style fences on OpenGL ES 3.0 OpenGL ES 3.0 and up has suppport for for GL_ARB_sync style fences. Make sure that mpv can use them.	2019-02-25 01:25:25 +01:00
wm4	f4ce3b8bb9	vo, vo_gpu, glx: correct GLX_OML_sync_control usage I misunderstood how this extension works. If I understand it correctly now, it's worse than I thought. They key thing is that the (ust, msc, sbc) tripple is not for a single swap event. Instead, (ust, msc) run independently from sbc. Assuming a CFR display/compositor, this means you can at best know the vsync phase and frequency, but not the exact time a sbc changed value. There is GLX_INTEL_swap_event, which might work as expected, but it has no EGL equivalent (while GLX_OML_sync_control does, in theory). Redo the context_glx sync code. Now it's either more correct or less correct. I wanted to add proper skip detection (if a vsync gets skipped due to rendering taking too long and other problems), but it turned out to be too complex, so only some unused fields in vo.h are left of it. The "generic" skip detection has to do. The vsync_duration field is also unused by vo.c. Actually this seems to be an improvement. In cases where the flip call timing is off, but the real driver-level timing apparently still works, this will not report vsync skips or higher vsync jitter anymore. I could observe this with screenshots and fullscreen switching. On the other hand, maybe it just introduces an A/V offset or so. Why the fuck can't there be a proper API for retrieving these statistics? I'm not even asking for much.	2018-12-06 10:32:27 +01:00
wm4	b1ba7de34d	vo: use a struct for vsync feedback stuff So new useless stuff can be easily added.	2018-12-06 10:30:25 +01:00
wm4	83884fdf03	vo_gpu: glx: use GLX_OML_sync_control for better vsync reporting Use the extension to compute the (hopefully correct) video delay and vsync phase. This is very fuzzy, because the latency will suddenly be applied after some frames have already been shown. This means there _will_ be "jumps" in the time accounting, which can lead to strange effects at start of playback (such as making initial "dropped" etc. frames worse). The only reasonable way to fix this would be running a few dummy frame swaps at start of playback until the latency is known. The same happens when unpausing. This only affects display-sync mode. Correct function was not confirmed. It only "looks right". I don't have the equipment to make scientifically correct measurements. A potentially bad thing is that we trust the timestamps we're receiving. Out of bounds timestamps could wreak havoc. On the other hand, this will probably cause the higher level code to panic and just disable DS. As a further caveat, this makes a bunch of assumptions about UST timestamps. If there are delayed frames (i.e. we skipped one or more vsyncs), the latency logic is mostly reset. There is no attempt to make the vo.c skipped vsync logic to use this. Also, the latency computation determines a vsync duration, and there's no effort to reconcile or share the vo.c logic for determining vsync duration.	2018-12-06 10:30:14 +01:00
Anton Kindestam	f0509d3738	drm: rename plane options to better, invariant, names This commit bumps the libmpv version to 1.102 drm-osd-plane -> drm-draw-plane drm-video-plane -> drm-drmprime-video-plane drm-osd-size -> drm-draw-surface-size "draw plane", as in the plane that OpenGL draws to, whether it be video + OSD or just OSD. "drmprime video plane", as in the plane used for hwdec video imported via drmprime. "draw surface size", as in the size of the surface used for the draw plane The new names are invariant whether or not hwdec_drmprime_drm is being used or not. The original naming was very confusing, as when doing regular rendering (swdec or vaapi) the video would be displayed on the "OSD plane", and the "Video plane" would remain unused.	2018-12-01 15:42:20 +02:00
Philip Langdale	721bec7dde	vo_gpu: hwdec_cuda: Guard GL and Vulkan headers properly We are currently unnecessarily including vulkan headers even when not building with vulkan support. I also guarded the GL header inclusion even though this doesn't appear to break anything today. Fixes #6330.	2018-11-18 23:50:38 +02:00
Niklas Haas	2704625e3f	vo_gpu: opengl: disable compute shaders for old GLSL Fixes #6272.	2018-11-17 00:49:10 +01:00
Philip Langdale	84d6638907	vo_gpu: hwdec_cuda: Clean up init() error handling Currently, the error paths in init() are a bit confusing, and we can end up trying to pop the current context when there is no context, which leads to distracting error messages. I also added an explicit path to return early if the GPU backend is not OpenGL or Vulkan. It's pointless to do any other cuda init after that point. (Of course, someone could write more interops.) Fixes #6256	2018-10-31 09:20:06 +01:00
Anton Kindestam	ba2dee38fb	hwdec_drmprime_drm: Missing NULL-check on drm_atomic_context video_plane Since `810acf32d6` video_plane can be NULL under some circumstances. While there is a check in init, init treats this as an error condition and would call uninit, which in turn calls disable_video_plane, which would then segfault. Fix this by including a NULL check inside disable_video_plane, so that it doesn't try to disable what isnt' there.	2018-10-25 13:50:09 +02:00
Philip Langdale	da1073c247	vo_gpu: vulkan: hwdec_cuda: Add support for Vulkan interop Despite their place in the tree, hwdecs can be loaded and used just fine by the vulkan GPU backend. In this change we add Vulkan interop support to the cuda/nvdec hwdec. The overall process is mostly straight forward, so the main observation here is that I had to implement it using an intermediate Vulkan buffer because the direct VkImage usage is blocked by a bug in the nvidia driver. When that gets fixed, I will revist this. Nevertheless, the intermediate buffer copy is very cheap as it's all device memory from start to finish. Overall CPU utilisiation is pretty much the same as with the OpenGL GPU backend. Note that we cannot use a single intermediate buffer - rather there is a pool of them. This is done because the cuda memcpys are not explicitly synchronised with the texture uploads. In the basic case, this doesn't matter because the hwdec is not asked to map and copy the next frame until after the previous one is rendered. In the interpolation case, we need extra future frames available immediately, so we'll be asked to map/copy those frames and vulkan will be asked to render them. So far, harmless right? No. All the vulkan rendering, including the upload steps, are batched together and end up running very asynchronously from the CUDA copies. The end result is that all the copies happen one after another, and only then do the uploads happen, which means all textures are uploaded the same, final, frame data. Whoops. Unsurprisingly this results in the jerky motion because every 3/4 frames are identical. The buffer pool ensures that we do not overwrite a buffer that is still waiting to be uploaded. The ra_buf_pool implementation automatically checks if existing buffers are available for use and only creates a new one if it really has to. It's hard to say for sure what the maximum number of buffers might be but we believe it won't be so large as to make this strategy unusable. The highest I've seen is 12 when using interpolation with tscale=bicubic. A future optimisation here is to synchronise the CUDA copies with respect to the vulkan uploads. This can be done with shared semaphores that would ensure the copy of the second frames only happens after the upload of the first frame, and so on. This isn't trivial to implement as I'd have to first adjust the hwdec code to use asynchronous cuda; without that, there's no way to use the semaphore for synchronisation. This should result in fewer intermediate buffers being required.	2018-10-22 21:35:48 +02:00
Niklas Haas	104b510774	vo_gpu: opengl: fix segfault when gl->DeleteSync is unavailable This deinit code was never checked, so this line would always crash on implementations without support for sync objects. Fixes #6197.	2018-10-16 01:57:49 +03:00
Akemi	8d2d0f0640	cocoa-cb: add Apple Software Renderer support by default the pixel format creation falls back to software renderer when everything fails. this is mostly needed for VMs. additionally one can directly request an sw renderer or exclude it entirely.	2018-09-30 17:13:34 +03:00
Anton Kindestam	810acf32d6	drm_atomic: Allow to create atomic context w/o drmprime video plane This is to improve the experience when running with default settings on a driver that doesn't have any overlay planes (or indeed only one plane), but still supports DRM atomic. Since the drmprime video plane is set to pick an overlay plane by default it would fail on these drivers due to not being able to create any atomic context. Users with such cards had to specify --drm-video-plane-id manually to some bogus value (it's not used after all). The "video" plane is only ever used by the drmprime-drm hwdec interop, which is not used at all in the typical usecase where everything is actually rendered on to the "OSD" plane using EGL, so having an atomic context without the "video" plane should be fine most of the time.	2018-09-30 14:22:49 +03:00
Anton Kindestam	351c083487	hwdec_vaegl: Fix VAAPI EGL interop used with gpu-context=drm Add another parameter to mpv_opengl_drm_params to hold the FD to the render node, so that the fd can be passed to hwdec_vaegl. The render node is opened in context_drm_egl and inferred from the primary device fd using drmGetRenderDeviceNameFromFd.	2018-07-09 02:33:35 +03:00
Anton Kindestam	7beee68f8d	context_drm_egl: Fix CRTC setup and release code when using atomic The previous code did not save enough information about the old state, and could end up changing what plane the fbcon:s FB got attached to, or in worse case causing a blank screen (observed in some multi-screen setups on Sandy Bridge). In addition refactor the handling of drmModeModeInfo property blobs to not leak, as well as enable reuse of already created blobs.	2018-07-09 02:17:47 +03:00
Anton Kindestam	1298b9d201	context_drm_egl: Fix some memory leaks on error exit Fix some memory leaks on error exit in crtc_setup_atomic and crtc_release_atomic.	2018-07-09 02:17:47 +03:00
Anton Kindestam	157b242289	hwdec_drmprime_drm: Do not show error message during probing Change the log-level of an error message that would sometimes show up during hwdec probing, and could be misleading.	2018-06-08 22:13:39 +03:00
Anton Kindestam	4c6f36611d	context_drm_egl: fix some comments and log messages that had not been updated since the plane rename commit	2018-05-01 20:48:02 +03:00
Anton Kindestam	e60728a622	drm/atomic: Fix crtc_setup_atomic and crtc_release_atomic Add some properties which where forgotten in crtc_setup_atomic. In both change to not use DRM_MODE_PAGE_FLIP_EVENT \| DRM_MODE_ATOMIC_NONBLOCK flags. This should make it more similar to the drmSetCrtc which it aims to replace (take effect directly, and blocking call). This also saves us the trouble of having to set up a poll to wait for pageflip, which would've been neccesary with DRM_MODE_PAGE_FLIP_EVENT, in both crtc_setup_atomic and crtc_release_atomic.	2018-05-01 20:48:02 +03:00
LongChair	ba3d90d9ed	drm/atomic: disable video plane when unused. This patch will make sure that the video plane is hidden when unused. When using high resolution modes, typically UHD, and embedding mpv, having the video plane sitting in the back when you don't play any video is eating a lot of memory bandwidth for compositing. That patch makes sure that the video layer is just disabled before and after playback.	2018-05-01 20:48:02 +03:00
LongChair	1ccc56eff0	drm/atomic: add atomic modesetting. This commit allows to add atomic modesetting when using the atomic renderer. This is actually needed when using and osd with a smaller size than screen resolution. It will also make the drm atomic path more consistent	2018-05-01 20:48:02 +03:00
LongChair	ed94f8dc00	drm/atomic: refactor planes names We are currently using primary / overlay planes drm objects, assuming that primary plane is osd and overlay plane is video. This commit is doing two things : - replace the primary / overlay planes members with osd and video planes member without the assumption - Add two more options to determine which one of the primary / overlay is associated to osd / video. - It will default osd to overlay and video to primary if unspecified	2018-05-01 20:48:02 +03:00
LongChair	49bc07faea	drm/atomic: add connector to atomic context This patch adds - DRM connector object to atomic context. - fd property to the drm atomic object as well as a method to read blob type properties. This allows to ensure that the proper connector is picked up, especially when specifying it from the commandline, and also allows to make sure we're using the right one when embedding with interop into an application.	2018-05-01 20:48:02 +03:00
LongChair	9f2970f28a	drm/atomic: refactor hwdec_drmprime_drm with native resources That new API was introduced and allows to have several native resources. Thisuses that mechanisma for drm resources rather than the deprecated opengl-cb structs. This patch therefore add two structs that can be used with the drm atomic interop. - mpv_opengl_drm_params : which will hold all the drm handles - mpv_opengl_drm_osd_size : which will hold osd layer size This commit adds a drm-osd-size=WxH parameter to commandline which allows to define the OSD plane dimension. OSD can be upscaled to screen resolution when having OSD at video resolution is too heavy. This is especially useful for UHD modes on embedded devices where the GPU cannot handle UHD modes at a decent framerate.	2018-05-01 20:48:02 +03:00
Akemi	6bd2bdc745	cocoa: change deprecation warning from opengl-cb to libmpv	2018-04-29 15:03:47 +03:00
wm4	eb33556cbf	egl_helpers: change minimum framebuffer size to 8 bit per component This is for working around bugs in certain Android devices. At least one device fails to sort EGLConfigs by size, so eglChooseConfig() ends up choosing a config with 5/6/5 bits per r/g/b component. The other attributes in the affected EGLConfigs did not look like they should affect the sorting process as specified by the EGL 1.4 standard. The device was reported as: Sony Xperia Z3 Tablet Compact Firmware 6.0.1 build number 23.5.A.1.291 GL_VERSION='OpenGL ES 3.0 V@140.0 AU@ (GIT@I741a3d36ca)' GL_VENDOR='Qualcomm' GL_RENDERER='Adreno (TM) 330' Other Qualcom/Adreno devices have been reported as unaffected by this (including some with same GL_RENDERER string). "Fix" this by always requiring at least 8 bit. This means it would fail on devices which cannot provide this. We're fine with this. mpv-android/mpv-android#112	2018-04-29 02:21:32 +03:00
wm4	67ce9813d6	egl_helpers: log certain EGL attributes Might be helpful with broken EGL implementations.	2018-04-29 02:21:32 +03:00
Aman Gupta	ed7bc3a5f3	hwdec_ios: fix crash after mapper_init failure	2018-04-17 01:06:29 +03:00
Philip Langdale	07915b1227	vo_gpu: hwdec: Use ffnvcodec to load CUDA symbols The CUDA dynamic loader was broken out of ffmpeg into its own repo and package. This gives us an opportunity to re-use it in mpv and remove our custom loader logic.	2018-04-15 19:31:50 +03:00
Aman Gupta	9efb0278e7	opengl: include details in EGL context errors	2018-04-12 02:31:07 +03:00
wm4	52dd38a48a	client API: add a new way to pass X11 Display etc. to render API Hardware decoding things often need access to additional handles from the windowing system, such as the X11 or Wayland display when using vaapi. The opengl-cb had nothing dedicated for this, and used the weird GL_MP_MPGetNativeDisplay GL extension (which was mpv specific and not officially registered with OpenGL). This was awkward, and a pain due to having to emulate GL context behavior (like needing a TLS variable to store context for the pseudo GL extension function). In addition (and not inherently due to this), we could pass only one resource from mpv builtin context backends to hwdecs. It was also all GL specific. Replace this with a newer mechanism. It works for all RA backends, not just GL. the API user can explicitly pass the objects at init time via mpv_render_context_create(). Multiple resources are naturally possible. The API uses MPV_RENDER_PARAM_* defines, but internally we use strings. This is done for 2 reasons: 1. trying to leave libmpv and internal mechanisms decoupled, 2. not having to add public API for some of the internal resource types (especially D3D/GL interop stuff). To remain sane, drop support for obscure half-working opengl-cb things, like the DRM interop (was missing necessary things), the RPI window thing (nobody used it), and obscure D3D interop things (not needed with ANGLE, others were undocumented). In order not to break ABI and the C API, we don't remove the associated structs from opengl_cb.h. The parts which are still needed (in particular DRM interop) needs to be ported to the render API.	2018-03-26 19:47:08 +02:00
LongChair	b4c6fb0f52	drm/atomic: ensure request is available until uninit Right now the atomic request is alive during the renderloop. We want it to be alive until the drm egl context is destroyed because some properties might still be set upon interop close This patch make the request to be kept created even outside the renderloop. The context uninit will commit the last request.	2018-03-23 00:44:47 +02:00
LongChair	dae88644e6	hwdec_drmprime_drm: Fix a DRM buffer memory leakage We use triple buffering for this interop and we were only unreffing the data structures, which doesn't destroy the drm buffers. This patch allows to make sure that we release the drm buffers on playback end.	2018-03-05 23:33:45 -08:00
Anton Kindestam	33cffdcbac	context_drm_egl: Allow fallback EGLConfig formats It turns out that Mali drivers are likely broken, and do not return GBM_FORMAT_ARGB8888 (they return GBM_FORMAT_XRGB8888) when getting EGL_NATIVE_VISUAL_ID for any EGLConfig, even though the resulting EGLConfig appears to be capable of alpha. It could also be potentially useful to allow an ARGB EGLConfig used with an XRGB framebuffer on some platforms, so we do that. (cf. weston) Unrelated indentation fix in gbm_format_to_string.	2018-03-04 16:56:06 -08:00
Niklas Haas	ad3f6d2f97	vo_gpu: don't segfault in libmpv_gl's destroy() This segfaults when the GPU context has not been fully initialized, such as would be the case when initialization errors.	2018-03-04 00:17:00 -08:00
wm4	b037121430	client API: deprecate opengl-cb API and introduce a replacement API The purpose of the new API is to make it useable with other APIs than OpenGL, especially D3D11 and vulkan. In theory it's now possible to support other vo_gpu backends, as well as backends that don't use the vo_gpu code at all. This also aims to get rid of the dumb mpv_get_sub_api() function. The life cycle of the new mpv_render_context is a bit different from mpv_opengl_cb_context, and you explicitly create/destroy the new context, instead of calling init/uninit on an object returned by mpv_get_sub_api(). In other to make the render API generic, it's annoyingly EGL style, and requires you to pass in API-specific objects to generic functions. This is to avoid explicit objects like the internal ra API has, because that sounds more complicated and annoying for an API that's supposed to never change. The opengl_cb API will continue to exist for a bit longer, but internally there are already a few tradeoffs, like reduced thread-safety. Mostly untested. Seems to work fine with mpc-qt.	2018-02-28 00:55:06 -08:00
Anton Kindestam	fe23715876	context_drm_egl: Repair VT switching The VT switcher was being set up, but it was being neither polled nor interrupted. Insert wait_events and wakeup functions based on those from vo_drm, and add return early in drm_egl_swap_buffers if p->active isn't set. This should get basic VT switching working, however there will likely still be some random glitches. Switching between mpv and X11/weston is unlikely to work satisfactorily until we can solve the problems with drmSetMaster and drmDropMaster.	2018-02-26 23:56:13 -08:00
Anton Kindestam	3325c7a912	context_drm_egl: Introduce 30bpp support This introduces the option --drm-format (currently used only by context_drm_egl, vo_drm implementation is pending) which allows you to pick between a xrgb8888 or a xrgb2101010 visual for --gpu-context=drm. Requires a recent mesa (18.0.0_rc4 or later) to work. This also fixes a bug when using --gpu-context=drm on a 30bpp-enabled mesa (allow_rgb10_configs set to true). Previously it would've set up an XRGB8888 format at the DRM/GBM level, while a 30bpp EGLConfig would be picked, resulting in a garbled image.	2018-02-26 23:56:13 -08:00
Anton Kindestam	bb07b22d42	egl_helpers: mpegl_cb can now signal an error condition This can be used by client code that needs to fail when it cannot find a suitable EGLConfig.	2018-02-26 23:56:13 -08:00
wm4	0dbad9503f	vo_gpu: hwdec_drmprime_drm: cosmetic simplification Coverity complained about the redundant init of hratio etc. - just remove that and merge declaration/init of these variables. Also the first double cast in each expression is unnecessary.	2018-02-16 22:04:15 -08:00
wm4	f17246fec1	vo_gpu: remove old window screenshot glue code and GL implementation There is now a better way. Reading the font framebuffer was always a hack. The new code via VOCTRL_SCREENSHOT renders it into a FBO, which does not come with the disadvantages of reading the front buffer (like not being supported by GLES, possibly black regions due to overlapping windows on some systems). For now keep VOCTRL_SCREENSHOT_WIN on the VO level, because there are still some lesser VOs and backends that use it.	2018-02-13 17:45:29 -08:00
James Ross-Gowan	1b80e124db	vo_gpu: d3d11: implement tex_download() This allows the new GPU screenshot functionality introduced in `9f595f3a80` to work with the D3D11 backend. It replaces the old window screenshot functionality, which was shared between D3D11 and ANGLE. The old code can be removed, since it's not needed by ANGLE anymore either.	2018-02-13 21:25:15 +11:00
Akemi	c5e4538bc4	cocoa-cb: initial implementation via opengl-cb API this is meant to replace the old and not properly working vo_gpu/opengl cocoa backend in the future. the problems are various shortcomings of Apple's opengl implementation and buggy behaviour in certain circumstances that couldn't be properly worked around. there are also certain regressions on newer macOS versions from 10.11 onwards. - awful opengl performance with a none layer backed context - huge amount of dropped frames with an early context flush - flickering of system elements like the dock or volume indicator - double buffering not properly working with a none layer backed context - bad performance in fullscreen because of system optimisations all the problems were caused by using a normal opengl context, that seems somewhat abandoned by apple, and are fixed by using a layer backed opengl context instead. problems that couldn't be fixed could be properly worked around. this has all features our old backend has sans the wid embedding, the possibility to disable the automatic GPU switching and taking screenshots of the window content. the first was deemed unnecessary by me for now, since i just use the libmpv API that others can use anyway. second is technically not possible atm because we have to pre-allocate our opengl context at a time the config isn't read yet, so we can't get the needed property. third one is a bit tricky because of deadlocking and it needed to be in sync, hopefully i can work around that in the future. this also has at least one additional feature or eye-candy. a properly working fullscreen animation with the native fs. also since this is a direct port of the old backend of the parts that could be used, though with adaptions and improvements, this looks a lot cleaner and easier to understand. some credit goes to @pigoz for the initial swift build support which i could improve upon. Fixes: #5478, #5393, #5152, #5151, #4615, #4476, #3978, #3746, #3739, #2392, #2217	2018-02-12 04:49:15 -08:00
Akemi	abf2efb107	osx: always deactivate the early opengl flush on macOS early flushing only caused problems on macOS, which includes: - performance problems and huge amount of dropped frames - problems with playing back video files with fps close to the display refresh rate - rendering at twice the rate of the video fps - not properly detected display refresh rate we always deactivate any early flush for macOS to fix these problems.	2018-02-12 04:49:15 -08:00
wm4	9f595f3a80	vo_gpu: make screenshots use the GL renderer Using the GL renderer for color conversion will make sure screenshots will use the same conversion as normal video rendering. It can do this for all types of screenshots. The logic when to write 16 bit PNGs changes. To approximate the old behavior, we decide by looking whether the source video format has more than 8 bits per component. We apply this logic even for window screenshots. Also, 16 bit PNGs now always include an unused alpha channel. The reason is that FFmpeg has RGB48 and RGBA64 formats, but no RGB064. RGB48 is 3 bytes and usually not supported by GPUs for rendering, so we have to use RGBA64, which forces an alpha channel. Will break for users who use --target-trc and similar options. I considered creating a new gl_video context, but it could double GPU memory use, so I didn't. This uses FBOs instead of glGetTexImage(), because that increases the chance it could work on GLES (e.g. ANGLE). Untested. No support for the Vulkan and D3D11 backends yet. Fixes #5498. Also fixes #5240, because the code for reading back is not used with the new code path.	2018-02-11 17:45:51 -08:00
LongChair	b01623e0d2	drmprime interop : Add frames triple buffering Currently using the drmprime interop with external mpv intgration can lead to rendering issues because the current frame is being released too early. Typically using this with Qt results in one frame shift because Qt will do waitforvsync and swap, rather than swap and waitforvsync. This leads to tearing as the frambuffer is released while being displayed on screen. In order to avoid releasing the framebuffer that is displayed, We keep the framebuffer alive for one more frame with triple buffering to make sure that whatever rendering process is used, the framebuffer will not be released when it's still on screen. This was tested on RockChip Rock64	2018-02-07 22:40:30 -08:00
Niklas Haas	0870859e3d	vo_gpu: add RA_CAP for gl_NumWorkGroups SPIRV-Cross doesn't support this for the time being. It's possible this could go away again at a later date.	2018-02-05 23:11:18 -08:00
wm4	76276c9210	video: rewrite filtering glue code Get rid of the old vf.c code. Replace it with a generic filtering framework, which can potentially handle more than just --vf. At least reimplementing --af with this code is planned. This changes some --vf semantics (including runtime behavior and the "vf" command). The most important ones are listed in interface-changes. vf_convert.c is renamed to f_swscale.c. It is now an internal filter that can not be inserted by the user manually. f_lavfi.c is a refactor of player/lavfi.c. The latter will be removed once --lavfi-complex is reimplemented on top of f_lavfi.c. (which is conceptually easy, but a big mess due to the data flow changes). The existing filters are all changed heavily. The data flow of the new filter framework is different. Especially EOF handling changes - EOF is now a "frame" rather than a state, and must be passed through exactly once. Another major thing is that all filters must support dynamic format changes. The filter reconfig() function goes away. (This sounds complex, but since all filters need to handle EOF draining anyway, they can use the same code, and it removes the mess with reconfig() having to predict the output format, which completely breaks with libavfilter anyway.) In addition, there is no automatic format negotiation or conversion. libavfilter's primitive and insufficient API simply doesn't allow us to do this in a reasonable way. Instead, filters can use f_autoconvert as sub-filter, and tell it which formats they support. This filter will in turn add actual conversion filters, such as f_swscale, to perform necessary format changes. vf_vapoursynth.c uses the same basic principle of operation as before, but with worryingly different details in data flow. Still appears to work. The hardware deint filters (vf_vavpp.c, vf_d3d11vpp.c, vf_vdpaupp.c) are heavily changed. Fortunately, they all used refqueue.c, which is for sharing the data flow logic (especially for managing future/past surfaces and such). It turns out it can be used to factor out most of the data flow. Some of these filters accepted software input. Instead of having ad-hoc upload code in each filter, surface upload is now delegated to f_autoconvert, which can use f_hwupload to perform this. Exporting VO capabilities is still a big mess (mp_stream_info stuff). The D3D11 code drops the redundant image formats, and all code uses the hw_subfmt (sw_format in FFmpeg) instead. Although that too seems to be a big mess for now. f_async_queue is unused.	2018-01-30 03:10:27 -08:00
Akemi	db08e28304	osx: code cleanups and cosmetic fixes	2018-01-20 14:43:49 -08:00
Niklas Haas	0192eeb09c	vo_gpu: rpi: defer gl_ctx_resize until after gl_ctx_init This segfaults otherwise. The conditional is needed to break a circular dependency (gl_init depends on mpgl_load_functions which depends on recreate_dispmanx which calls gl_ctx_resize). Fixes #5398	2018-01-15 23:16:17 -08:00
sfan5	48943a73f6	vo_gpu/context_android: replace both options with android-surface-size This allows us to automatically trigger a VOCTRL_RESIZE (also contained).	2018-01-02 15:04:31 -08:00
Aman Gupta	2dd020efc2	vo_gpu/android: fallback to EGL_WIDTH/HEIGHT Uses the EGL width/height by default when the user fails to set the android-surface-width/android-surface-height options. This means the vo-resize command is optional, and does not need to be implemented on android devices which do not support rotation. Signed-off-by: Aman Gupta <aman@tmm1.net>	2018-01-01 22:21:44 -08:00
sfan5	451fc931b0	vo_gpu/context: Let embedding application handle surface resizes The callbacks for this are Java-only and EGL does not reliably return the correct values.	2017-12-27 14:29:15 -07:00
wm4	016c4405fb	vo_gpu: EGL: provide SwapInterval to generic code This means that we now explicitly set an interval of 1. Although that should be the EGL default, some drivers could possibly ignore this (unconfirmed). In any case, this commit also allows disabling vsync, for users who want it.	2017-12-27 04:13:46 -07:00
Niklas Haas	a42b8b1142	vo_gpu: attempt re-using the FBO format for p->output_tex This allows RAs with support for non-opaque FBO formats to use a more appropriate FBO format for the output tex, possibly enabling a more efficient blit operation. This requires distinguishing between real formats (which can be used to create textures) and fake formats (e.g. ra_gl's FBO hack).	2017-12-25 00:47:53 +01:00
Niklas Haas	6186cc79e6	vo_gpu: allow invalidating FBO in renderpass_run This is especially interesting for vulkan since it allows completely skipping the layout transition as part of the renderpass. Unfortunately, that also means it needs to be put into renderpass_params, as opposed to renderpass_run_params (unlike #4777). Closes #4777.	2017-12-25 00:47:53 +01:00
James Ross-Gowan	3d8ca93d23	vo_gpu: win: remove exclusive-fullscreen detection hack This hack was part of a solution to VSync judder in desktop OpenGL on Windows. Rather than using blocking-SwapBuffers(), mpv could use DwmFlush() to wait for the image to be presented by the compositor. Since this would only work while the compositor was running, and the compositor was silently disabled when OpenGL entered exclusive fullscreen mode, mpv needed a way to detect exclusive fullscreen mode. The code that is being removed could detect exclusive fullscreen mode by checking the state of an undocumented mutex using undocumented native API functions, but because of how fragile it was, it was always meant to be removed when a better solution for accurate VSync in OpenGL was found. Since then, mpv got the dxinterop backend, which uses desktop OpenGL but has accurate VSync. It also got a native Direct3D 11 backend, which is a viable alternative to OpenGL on Windows. For people who are still using desktop OpenGL with WGL, there shouldn't be much of a difference, since mpv can use other API functions to detect exclusive fullscreen.	2017-12-20 14:53:41 +11:00
wm4	9ed8ca2529	vo_gpu: hwdec_drmprime_drm: don't crash for non-GL contexts Using vulkan with --hwdec crashed because of this.	2017-12-17 11:00:51 -08:00
Niklas Haas	ba1943ac00	msg: reinterpret a bunch of message levels I've decided that MP_TRACE means “noisy spam per frame”, whereas MP_DBG just means “more verbose debugging messages than MSGL_V”. Basically, MSGL_DBG shouldn't create spam per frame like it currently does, and MSGL_V should make sense to the end-user and provide mostly additional informational output. MP_DBG is basically what I want to make the new default for --log-file, so the cut-off point for MP_DBG is if we probably want to know if for debugging purposes but the user most likely doesn't care about on the terminal. Also, the debug callbacks for libass and ffmpeg got bumped in their verbosity levels slightly, because being external components they're a bit less relevant to mpv debugging, and a bit too over-eager in what they consider to be relevant information. I exclusively used the "try it on my machine and remove messages from MSGL_* until it does what I want it to" approach of refactoring, so YMMV.	2017-12-15 22:28:47 -08:00
wm4	5e38e03980	vo_gpu: hwdec_drmprime_drm: silence error on failed autoprobing When autoprobing the hwdec interops (which now happens to all compiled interops if hardware decoding is used), failure to load an interop should not print an error in the normal case. So hide it. (We could make the log level conditional on whether autoprobing is used, but directly loading it without autoprobing is obscure, and most other interops don't do this either.)	2017-12-11 20:50:50 +02:00
Nicolas F	744b67d9e5	Fix various typos in log messages	2017-12-03 21:24:18 +01:00
Anton Kindestam	cc16cd5aa4	video: probe format of primary plane in drm/egl context We need to support hardware/drivers which do not support ARGB8888 in their primary plane. We also use p->primary_plane_format when creating the gbm surface, to make sure it always matches (in actuality there should be little difference).	2017-12-03 17:30:17 +02:00
Anton Kindestam	04e5fbde43	hwdec: whitespace cleanup in hwdec_drmprime_drm.c	2017-12-03 17:30:17 +02:00
Anton Kindestam	eb46d46e73	video: fix use of possibly-NULL pointer in drm_egl_init	2017-12-03 17:30:17 +02:00
wm4	292724538c	video: remove some more hwdec legacy stuff Finally get rid of all the HWDEC_* things, and instead rely on the libavutil equivalents. vdpau still uses a shitty hack, but fuck the vdpau code. Remove all the now unneeded remains. The vdpau preemption thing was not unused anymore; if someone cares this could probably be restored.	2017-12-02 04:53:55 +01:00
wm4	0780d38329	hwdec: don't require setting legacy hwdec fields With the recent changes, mpv's internal mechanisms got synced to libavcodec's once more. Some things are still needed for filters (until the mechanism gets replaced), but there's no need to require other hwdec methods to use these fields. So remove them where they are unnecessary. Also fix some minor leaks in the dxva2 backends, and set the driver_name field in the Apple ones. Untested on Apple crap.	2017-12-02 04:53:51 +01:00
wm4	9f52a92899	video: move d3d.c out of decode sub directory It makes more sense to have it in the general video directory (along with vdpau.c and vaapi.c), since the decoder source files don't even access it anymore.	2017-12-01 17:58:56 +01:00
wm4	7e87feaf15	vo_gpu: hwdec: remove redundant fields The testing_only field is not referenced anymore with vaglx removed and the previous commit dropping all uses. The ra_hwdec_driver.api field became unused with the previous commit, but all hwdec interop drivers still initialized it. Since this touches highly OS-specific code, build regressions are possible (plus the previous commit might break hw decoding at runtime). At least hwdec_cuda.c still used the .api field, other than initializing it.	2017-12-01 05:57:41 +01:00
Mark Thompson	2cf5836293	vo_opengl: hwdec_vaegl: Reenable vaExportSurfaceHandle() It will be present from libva 2.1 (VAAPI 1.1.0 or higher).	2017-11-30 23:35:28 +00:00
wm4	c437267518	vo_gpu: remove hwdec_vaglx interop This has stopped being useful a long time ago, and it's the only GPL source file in the vo_gpu source directories. Recently it wasn't even loaded at all, unless you forced loading it.	2017-11-30 04:19:12 +01:00
sfan5	3471e27efb	vo_gpu/context_android: Process surface resizes correctly	2017-11-14 20:46:18 +02:00
wm4	9922b6ff43	vo_gpu: ra_gl: remove stride hack Same reasoning as in commit `9b5d062d36`.	2017-11-12 13:49:00 +01:00
James Ross-Gowan	4b014b3a81	vo_gpu: move d3d11_screenshot to shared code This can be used by the ANGLE backend and ra_d3d11.	2017-11-07 20:27:13 +11:00
James Ross-Gowan	9b2dae79b1	vo_gpu: d3d11: add RA caps for ra_d3d11 ra_d3d11 uses the SPIR-V compiler to translate GLSL to SPIR-V, which is then translated to HLSL. This means it always exposes the same GLSL version that the SPIR-V compiler supports (4.50 for shaderc/glslang.) Despite claiming to support GLSL 4.50, some features that are tied to the GLSL version in OpenGL are not supported by ra_d3d11 when targeting legacy Direct3D feature levels. This includes two features that mpv relies on: - Reading from gl_FragCoord in the fragment shader (requires FL 10_0) - textureGather from any texture component (requires FL 11_0) These features have been exposed as new RA caps.	2017-11-07 20:27:13 +11:00
James Ross-Gowan	68eac1a1e7	vo_gpu: d3d11: initial implementation This is a new RA/vo_gpu backend that uses Direct3D 11. The GLSL generated by vo_gpu is cross-compiled to HLSL with SPIRV-Cross. What works: - All of mpv's internal shaders should work, including compute shaders. - Some external shaders have been tested and work, including RAVU and adaptive-sharpen. - Non-dumb mode works, even on very old hardware. Most features work at feature level 9_3 and all features work at feature level 10_0. Some features also work at feature level 9_1 and 9_2, but without high-bit- depth FBOs, it's not very useful. (Hardware this old is probably not fast enough for advanced features anyway.) Note: This is more compatible than ANGLE, which requires 9_3 to work at all (GLES 2.0,) and 10_1 for non-dumb-mode (GLES 3.0.) - Hardware decoding with D3D11VA, including decoding of 10-bit formats without truncation to 8-bit. What doesn't work / can be improved: - PBO upload and direct rendering does not work yet. Direct rendering requires persistent-mapped PBOs because the decoder needs to be able to read data from images that have already been decoded and uploaded. Unfortunately, it seems like persistent-mapped PBOs are fundamentally incompatible with D3D11, which requires all resources to use driver- managed memory and requires memory to be unmapped (and hence pointers to be invalidated) when a resource is used in a draw or copy operation. However it might be possible to use D3D11's limited multithreading capabilities to emulate some features of PBOs, like asynchronous texture uploading. - The blit() and clear() operations don't have equivalents in the D3D11 API that handle all cases, so in most cases, they have to be emulated with a shader. This is currently done inside ra_d3d11, but ideally it would be done in generic code, so it can take advantage of mpv's shader generation utilities. - SPIRV-Cross is used through a NIH C-compatible wrapper library, since it does not expose a C interface itself. The library is available here: https://github.com/rossy/crossc - The D3D11 context could be made to support more modern DXGI features in future. For example, it should be possible to add support for high-bit-depth and HDR output with DXGI 1.5/1.6.	2017-11-07 20:27:13 +11:00
James Ross-Gowan	8020a62953	vo_gpu: export the GLSL format qualifier for ra_format Backported from @haasn's change to libplacebo, except in the current RA, there's nothing to indicate an ra_format can be bound as a storage image, so there's no way to force all of these formats to have a glsl_format. Instead, the layout qualifier will be removed if glsl_format is NULL. This is needed for the upcoming ra_d3d11 backend. In Direct3D 11, while loading float values from unorm images often works as expected, it's technically undefined behaviour, and in Windows 10, it will cause the debug layer to spam the log with error messages. Also, apparently in GLSL, the format name must match the image's format exactly (but in Direct3D, it just has to have the same component type.)	2017-11-07 20:27:13 +11:00
James Ross-Gowan	41dff03f8d	vo_gpu: add namespace query mechanism Backported from @haasn's change to libplacebo. More flexible than the previous "shared \|\| non-shared" distinction. The extra flexibility is needed for Direct3D 11, but it also doesn't hurt code-wise.	2017-11-07 20:27:13 +11:00
wm4	99dd2f57f0	vo_gpu: ra_gl: fix minimum GLSL version to 120 Not sure why there was 110, or why there is even a default.	2017-11-03 11:53:31 +01:00
wm4	aac74b7c36	vo_gpu: ra_gl: fix crash trying to use glBindBufferBase on GL 2.1 Apparently this is required, but it doesn't check for it. To be fair, this was tested by creating a compatibility context and pretending it's GL 2.1. GL_ARB_shader_storage_buffer_object actually requires GL 4.0 or up, but GL_ARB_uniform_buffer_object requires only GL 2.0.	2017-11-03 11:39:15 +01:00
wm4	b7ce3ac445	vd_lavc: remove need for duplicated cuda GL interop backend This is just a dumb consequence of HWDEC_ types somehow being part of both decoder and VO. Obviously, the VO should only care about supporting specific hardware surface types or providing specific device types, but until they are separated, stupid unintuitive mismatches will occur.	2017-10-30 18:31:20 +01:00
Ryo Munakata	046fe45950	hwdec_drmprime_drm: fix segv with --hwdec	2017-10-30 12:46:49 +01:00
wm4	6b745769b1	vd_lavc: add support for nvdec hwaccel See manpage additions. (In ffmpeg-mpv and Libav, this is still called "cuvid". Libav won't work yet, because it has no frame params support yet, but this could get fixed soon.)	2017-10-28 19:59:08 +02:00
Lionel CHAZALLON	1992bb5151	video : Move drm options to substruct. This allows to group them and most of all query the group config when needed and when we don't have the access to vo.	2017-10-23 21:08:20 +02:00
Lionel CHAZALLON	cfcee4cfe7	Add DRM_PRIME Format Handling and Display for RockChip MPP decoders This commit allows to use the AV_PIX_FMT_DRM_PRIME newly introduced format in ffmpeg that allows decoders to provide an AVDRMFrameDescriptor struct. That struct holds dmabuf fds and information allowing zerocopy rendering using KMS / DRM Atomic. This has been tested on RockChip ROCK64 device.	2017-10-23 21:07:24 +02:00
Lionel CHAZALLON	762b8cc300	video : allow drm primary plane to be transparent for egl context We want primary plane to be one top of overlay (video), so we need it to be 32 bits.	2017-10-23 21:06:53 +02:00
Mark Thompson	26b46950a1	vo_opengl: hwdec_vaegl: Disable vaExportSurfaceHandle() libva 2.0 (VAAPI 1.0.0) was released without it, but it is scheduled to be included in libva 2.1.	2017-10-23 11:58:13 +02:00
wm4	14541ae258	Add checks for HAVE_GPL to various GPL-only source files This should actually cover all of them, if you take into account that some unchanged GPL source files include header files with such checks. Also this was done already for the libaf derived code. This is only for "safety" and to avoid misunderstandings.	2017-10-10 15:51:16 +02:00
Mark Thompson	05cb8d28af	vo_opengl: hwdec_vaegl: Use vaExportSurfaceHandle() if present This new interface in libva2 offers a cleaner way to export surfaces which can then be imported to EGL. In particular, this works with the Mesa driver, so we can have proper playback without a pointless download and upload on AMD cards. This change does nothing with libva1, and will fall back to the libva1 interface (vaDeriveImage() + vaAcquireBufferHandle()) if vaExportSurfaceHandle() is not present.	2017-10-09 21:35:49 +02:00
Mark Thompson	c6e7ced7f4	vo_opengl: context_drm_egl: Don't create a new framebuffer for every frame	2017-10-09 18:40:45 +02:00
Aman Gupta	8fc21fd0d5	vo_gpu: add android opengl backend At the moment, rendering on Android requires ``--vo=opengl-cb`` and a lot of java<->c++ bridging code to receive the receive and react to the render callback in java. Performance also suffers with opengl-cb, due to the overhead of context switching in JNI. With this patch, Android can render using ``--vo=gpu --gpu-context=android`` (after setting ``--wid`` to point to an android.view.Surface on-screen).	2017-10-09 18:36:54 +02:00
Rostislav Pehlivanov	4c7c8daf9c	wayland_common: implement output tracking, cleanups and bugfixes This commit: - Implements output tracking (e.g. monitor plug/unplug) - Creates the surface during registry (no other dependencies) - Queues the callback immediately after surface creation - Cleaner and better event handling (functions return directly) - Better reconfigure handling (resizes reduced to 1 during init) - Don't unnecessarily resize (if dimensions match) Apart from that fixes 2 potential memory leaks (mime type and window title), 2 string ownership issues (output name and make need to be dup'd), fixes some style issues (switches were indented) and finally adds messages when disabling/enabling idle inhibition. The callback setter function was removed in preparation for the commit which will use the frame event cb because it was unnecessary.	2017-10-09 02:23:04 +01:00
Rostislav Pehlivanov	68f9ee7e0b	wayland_common: rewrite from scratch The wayland code was written more than 4 years ago when wayland wasn't even at version 1.0. This commit rewrites everything in a more modern way, switches to using the new xdg v6 shell interface which solves a lot of bugs and makes mpv tiling-friedly, adds support for drag and drop, adds support for touchscreens, adds support for KDE's server decorations protocol, and finally adds support for the new idle-inhibitor protocol. It does not yet use the frame callback as a main rendering loop driver, this will happen with a later commit.	2017-10-03 19:36:02 +01:00
wm4	0c04ce5f0d	vo_gpu: gl: implement proper extension string search The existing code in check_ext() avoided false positive due to sub-strings, but allowed false negatives. Fix this with slightly better search code, and make it available as function to other source files. (There are some cases of strstr() still around.)	2017-10-02 17:30:27 +02:00
wm4	9b60398f4e	video: remove old videotoolbox support Like as in previous commits, you need a very recent FFmpeg (probably git master).	2017-09-26 19:13:26 +02:00
wm4	ae7db6503b	video: drop old D3D11/DXVA2 support Now you need FFmpeg git, or something. This also gets rid of the last real use of gpu_memcpy(). libavutil does that itself. (vaapi.c still used it, but it was essentially unused, because the code path isn't really in use anymore. It wasn't even included due to the d3d-hwaccel dependency in wscript.)	2017-09-26 18:58:45 +02:00
James Ross-Gowan	3d119a0e41	vo_gpu: angle: fix misleading struct name This should have been renamed when it stopped being empty.	2017-09-23 18:33:33 +10:00
Niklas Haas	62ddc85d17	vo_gpu: simplify structs / names Due to the plethora of historical baggage from different eras getting confusing, I decided to simplify and unify the struct organization and naming scheme. Structs that got renamed: 1. fbodst -> ra_fbo (and moved to gpu/context.h) 2. fbotex -> removed (redundant after `2af2fa7a`) 3. fbosurface -> surface 4. img_tex -> image In addition to these structs being renamed, all of the names have been made consistent. The new scheme is as follows: struct image img; struct ra_tex *tex; struct ra_fbo fbo; This also affects derived names, e.g. indirect_fbo -> indirect_tex. Notably also, finish_pass_fbo -> finish_pass_tex and finish_pass_direct -> finish_pass_fbo. The new equivalent of fbotex_change() is called ra_tex_resize(). This commit (should) contain no logic changes, just renaming a bunch of crap.	2017-09-22 16:58:55 +02:00
James Ross-Gowan	fab0448c5e	Revert "cocoa: re-enable double buffering" Enabling double buffering fixed some graphical glitches when entering fullscreen, but it also caused a fullscreen performance regression. We decided that the glitches were preferable to the performance regression. This reverts commit `cee764849e`.	2017-09-22 23:08:46 +10:00
James Ross-Gowan	1d5620a658	vo_gpu: override ra_swapchain_fns for the d3d11 surface ANGLE can take advantage of some of these when using the external swapchain-backed surface.	2017-09-22 14:22:14 +02:00
Niklas Haas	aabe12b0bc	vo_gpu: opengl: fix possible screenshot window crash gl_read_fbo_contents can fail Fixes #4905	2017-09-22 14:20:22 +02:00
Aman Gupta	6254b6d637	vo_opengl_cb: hwdec_ios: fix build [179/188] Compiling video/out/vo_lavc.c ../../video/out/opengl/hwdec_ios.m:135:9: warning: unused variable 'gl' [-Wunused-variable] GL gl = ra_gl_get(mapper->ra); ^ ../../video/out/opengl/hwdec_ios.m:247:48: warning: incompatible pointer to integer conversion passing 'CVOpenGLESTextureRef' (aka 'struct __CVBuffer ') to parameter of type 'GLuint' (aka 'unsigned int') [-Wint-conversion] p->gl_planes[i]); ^~~~~~~~~~~~~~~ ../../video/out/opengl/ra_gl.h:9:45: note: passing argument to parameter 'gl_texture' here GLuint gl_texture); ^ 2 warnings generated.	2017-09-22 05:03:52 +02:00
Niklas Haas	aefd7a90c9	vo_gpu: fix memleak in ra_gl_ctx The ctx->ra was never freed propely, nor was p->wrapped_fb. (TIL: MPV_LEAK_REPORT exists)	2017-09-21 15:51:47 +02:00
James Ross-Gowan	cee764849e	cocoa: re-enable double buffering This causes a performance regression on 10.11 and newer, but the single buffered method was broken and could cause partially rendered frames to be presented to the screen. This reverts `9f30cd8292` and `e543853a7f`.	2017-09-21 15:01:22 +02:00
James Ross-Gowan	75c0c06640	vo_gpu: convert windows/osx hwdecs/contexts to new API	2017-09-21 15:01:17 +02:00
Niklas Haas	65979986a9	vo_opengl: refactor into vo_gpu This is done in several steps: 1. refactor MPGLContext -> struct ra_ctx 2. move GL-specific stuff in vo_opengl into opengl/context.c 3. generalize context creation to support other APIs, and add --gpu-api 4. rename all of the --opengl- options that are no longer opengl-specific 5. move all of the stuff from opengl/* that isn't GL-specific into gpu/ (note: opengl/gl_utils.h became opengl/utils.h) 6. rename vo_opengl to vo_gpu 7. to handle window screenshots, the short-term approach was to just add it to ra_swchain_fns. Long term (and for vulkan) this has to be moved to ra itself (and vo_gpu altered to compensate), but this was a stop-gap measure to prevent this commit from getting too big 8. move ra->fns->flush to ra_gl_ctx instead 9. some other minor changes that I've probably already forgotten Note: This is one half of a major refactor, the other half of which is provided by rossy's following commit. This commit enables support for all linux platforms, while his version enables support for all non-linux platforms. Note 2: vo_opengl_cb.c also re-uses ra_gl_ctx so it benefits from the --opengl- options like --opengl-early-flush, --opengl-finish etc. Should be a strict superset of the old functionality. Disclaimer: Since I have no way of compiling mpv on all platforms, some of these ports were done blindly. Specifically, the blind ports included context_mali_fbdev.c and context_rpi.c. Since they're both based on egl_helpers, the port should have gone smoothly without any major changes required. But if somebody complains about a compile error on those platforms (assuming anybody actually uses them), you know where to complain.	2017-09-21 15:00:55 +02:00
Niklas Haas	2f41b834b3	vo_opengl: make the ra_renderpass names consistent The random space kept screwing me over	2017-09-13 20:53:19 +02:00
Niklas Haas	293c696ddb	vo_opengl: use GLX_MESA_swap_control where available This overrides the use of GLX_SGI_swap_control, because apparently GLX_SGI_swap_control doesn't support SwapInterval(0), but the GLX_MESA_swap_interval does. Of course, everybody except mesa just accepts SwapInterval(0) even for GLX_SGI_swap_control, but mesa needs to be the special snowflake here and reject it, forcing us to load their stupid named extension instead. Meanwhile khronos has done nothing except spit out GLX_EXT_swap_control (not to be confused with GL_EXT_swap_control, which is exported by WGL_EXT_swap_control), that doesn't fix the problem because mesa doesn't implement it anyway. What a fucking mess.	2017-09-13 20:53:17 +02:00
Niklas Haas	0bb67b1055	vo_opengl: always initialize uniforms on first use Even if the contents are entirely zero. In the current code, these entries were left uninitialized. (Which always worked for nvidia - but randomly blew up for AMD)	2017-09-12 03:00:47 +02:00
Niklas Haas	0c2cb69597	vo_opengl: generalize UBO packing/handling This is simultaneously generalized into two directions: 1. Support more sc_uniform types (needed for SC_UNIFORM_TYPE_PUSHC) 2. Support more flexible packing (needed for both PUSHC and ra_d3d11)	2017-09-12 02:57:45 +02:00
Niklas Haas	3faf1fb0a4	vo: avoid putting large voctrl_performance_data on stack This is around 512 kB, which is just way too much. Heap-allocate it instead. Also cut down the max pass count to 64, since 128 was unrealistically high even for vo_opengl.	2017-09-11 18:20:18 +02:00
Niklas Haas	71c25df5e6	vo_opengl: refactor timer_pool_measure (again) Instead of relying on power-of-two buffer sizes and unsigned overflow, make this code more robust (and also cleaner). Why can't C get a real modulo operator?	2017-09-11 02:42:50 +02:00
Niklas Haas	8a4f2f0ac0	vo_opengl: fix out-of-bounds access in timer_pool_measure This was there even before the refactor, but the refactor exposed the bug. I hate C's useless fucking modulo operator so much. I've gotten hit by this exact bug way too many times.	2017-09-11 02:07:04 +02:00
Niklas Haas	0fe4a492c4	vo_opengl: fix out-of-bounds read in update_uniform Since the addition of UBOs, the assumption that the uniform index corresponds to the pass->params.inputs index is no longer true. Also, there's no reason it would even need this - since the `input` is also available directly in sc_uniform. I have no idea how I've been using this code for as long as I have without any segfaults until earlier today.	2017-09-11 01:04:08 +02:00
Niklas Haas	1da53248ab	vo_opengl: refactor/fix mp_pass_perf code This was needlessly complicated and prone to breakage, because even the references to the ring buffer could end up getting invalidated and containing garbage data on e.g. shader cache flush. For much the same reason why we can't keep around the *timer_pool, we're also forced to hard-copy the entire sample buffer per pass per frame. Not a huge deal, though. This is, what, a few kB per frame? We have more pressing CPU performance concerns anyway. Also simplified/fixed some other code.	2017-09-11 00:35:23 +02:00
Niklas Haas	d0c87dd579	vo_opengl: add a gamut warning feature This clearly highlights all out-of-gamut/clipped pixels. (Either too bright or too saturated) Has some (documented) caveats. Also make TONE_MAPPING_CLIP stop actually clamping the value range (it's unnecessary and breaks this feature).	2017-09-10 18:19:46 +02:00
Niklas Haas	5771f7abf4	vo_opengl: add support for vulkan GLSL dialect Redefining texture1D / texture3D seems to be illegal, they are already built-in macros or something. So just use tex1D and tex3D instead. Additionally, GL_KHR_vulkan_glsl requires using explicit vertex locations and bindings, so make some changes to facilitate this. (It also requires explicitly setting location=0 for the color attachment output)	2017-09-04 13:53:23 +02:00
Niklas Haas	62f0677614	vo_opengl: use rgba16 for 3DLUTs instead of rgb16 Vulkan compat. rgb16 doesn't exist on hardware anyway, might as well just generate the 3DLUT against rgba16 as well. We've decided this is the simplest way to do vulkan compatibility: just make sure we never actually need 3-component textures.	2017-09-04 13:53:14 +02:00
Niklas Haas	8cf5799ab1	vo_opengl: refactor scaler LUT weight packing/loading This is mostly done so we can support using textures with more components than the scaler LUTs have entries. But while we're at it, also change the way the weights are packed so that they're always sequential with no gaps. This allows us to simplify pass_sample_separated_get_weights as well.	2017-09-04 13:53:14 +02:00
Niklas Haas	f589a3bd78	vo_opengl: scale deband-grain to the signal range This prevents blowing up for high dynamic range sources, where a noise level of 48 can suddenly mean 4800.	2017-09-03 21:51:48 +02:00
wm4	9f0e358827	vo_opengl: fix overlay mode (again) Did I mention yet that I regret this overlay mode thing?	2017-08-30 12:19:32 +02:00
wm4	a9571fcc0f	vo_opengl: don't discard buffered video on redundant resize calls If a VO-area option changes, gl_video_resize() is called unconditionally. This function does something even if the size does not change (at least it discards buffered frames for interpolation), which can lead to stutter when you keep firing option change events during playback. Check for an actual resize, and if nothing changes, exit early.	2017-08-29 15:15:34 +02:00
wm4	a46500a2c8	vo_opengl: don't assume imgfmt=0 is valid Could cause a crash if anything called ra_get_imgfmt_desc(imgfmt=0). Let it fail correctly. This can happen if a hwdec backend does not set hw_subfmt correctly.	2017-08-29 15:05:32 +02:00
Niklas Haas	cc79d48d22	vo_opengl: fix the renderpass target format at creation time Required for vulkan.	2017-08-27 14:37:47 +02:00
Niklas Haas	7baa18d5f8	vo_opengl: fix misleading comment in ra.h tex_upload is not just for buffers	2017-08-27 14:36:28 +02:00
Niklas Haas	1d47473a7b	vo_opengl: use UBOs where supported/required This also introduces RA_CAP_GLOBAL_UNIFORM. If this is not set, UBOs must be used for non-bindings. Currently the cap is ignored though, and the shader_cache always generates UBO-using code where it can. Could be made an option in principle. Only enabled for drivers new enough to support explicit UBO offsets, just in case... No change to performance, which is probably what we expect.	2017-08-27 14:36:04 +02:00
Niklas Haas	136cf2b770	vo_opengl: add support for UBOs Not actually used by anything yet, but straightforward enough to add to the RA API for starters.	2017-08-27 14:36:00 +02:00
Niklas Haas	8404a354e5	vo_opengl: clarify RA_CAP_DIRECT_UPLOAD This no longer concerns the API user except in as much as the API user probably wants to know whether or not PBOs are active, so keep around the CAP field even though it's mostly useless now.	2017-08-27 14:36:00 +02:00
Niklas Haas	7684fda6ac	vo_opengl: refactor shader_cache binding There's no reason to be needlessly wasteful with our binding points here. Just add a CAP for it.	2017-08-27 14:36:00 +02:00

1 2 3 4 5 ...

997 Commits