IRC Logs of #dri-devel on irc.freenode.net for 2024-11-25

08:16 mivanchev: hey, developer of static-wine32 here, I have a question regarding 24.3.0. How are the GLX drivers loaded now if driOpenDriver is missing? is there a new architecture without the mega driver?
08:51 MrCooper: mivanchev: are you asking about the client side or the X server side?
08:52 mivanchev: MrCooper, client side. Specifically I want to know what's required to load the GL.so statically
08:52 mivanchev: I'm confused as to what substitutes driOpenDriver
08:54 MrCooper: the new loadable binary is libgallium-<Mesa version>.so
08:56 mivanchev: MrCooper, is this still a mega driver and how is it initialized?
08:56 MrCooper: yes, and via the usual GLX/EGL/... APIs
08:58 mivanchev: ok, so hypothetically if i have libgallium-<Mesa version>.so's code in a static mesa, no further steps will be need from the Wine side, just calling GL functions?
08:59 MrCooper: driOpenDriver was always Mesa-internal API AFAICT
08:59 MrCooper: not sure it's that simple, it's presumably dlopen'd
09:00 mivanchev: yes, i saw the dlopen so I thought something happens in the initialization code which i can't find
11:23 eric_engestrom: mareko: I think you might know the answer? ^
11:23 eric_engestrom: oop ignore me, for some reason I was missing all the replies from MrCooper 🤦
11:36 Lynne: do GPUs suffer the same performance issues as CPUs when it comes to denormals?
11:45 kode54: presumably they can have the same level of denormal filtering that SSE has on CPUs
11:45 pendingchaos: Lynne: for recent AMD GPUs, I don't think denormals should have a large performance impact
11:45 pendingchaos: omod (free multiplication by 2/4/0.5) doesn't work and most inexact transformations are disabled
11:45 pendingchaos: before RDNA2/GFX10.3, we had v_mad_f32 (unfused multiply-add), which was great but didn't support denormals
11:45 pendingchaos: we can use v_fma_f32 instead but that's inexact and also slow until Vega/GFX9
11:47 glehmann: I think on gfx6 and 7 enabling fp32 denorms caused slow down to fp64 rate
11:48 Lynne: why would a fused mult add be less exact? isn't it a point of fusing to increase accuracy as well as speed?
11:48 pendingchaos: it's not exactly the same as what the programmer wrote, so it's inexact
11:49 glehmann: > and most inexact transformations are disabled
11:49 glehmann: pendingchaos: isn't it the other way around? explicitly flushing denorms disables inexact patterns, explicitly preserving doesn't matter
11:50 pendingchaos: glehmann: right. I misremembered and thought inexact transforms were disabled in any float control was used
11:51 pendingchaos: if the shader author uses the invariant or precise keyword for an expression, we can't use v_fma_f32
11:54 pendingchaos: unless the fma() builtin is used IIRC
13:09 Lynne: by the way, monthly reminder that no profilers for pure compute-only vulkan exist
13:53 glehmann: Lynne: what do you mean? I think radv RGP capture support for compute should be possible, so if you want that, create a mesa feature request issue
13:58 Lynne: nope, RADV RGP only triggers if there's a frame that the client draws
13:58 Lynne: modifying it isn't so easy either
13:59 Lynne: I gave it a shot and I was able to get it to trigger on a dispatch, but it generated blank files, and if I forced it, it simply crashed the GPU
14:07 glehmann: well the chances of someone with a better understanding of the RGP code looking into it are going to be a lot higher with a feature request issue
17:41 mareko: eric_engestrom: libgallium-$version.so is a linked lib now, not dlopen'd
17:42 mareko: Mesa doesn't dlopen gallium drivers anymore
17:56 stsquad: can the mesa intel drivers be built for non-x86 platforms? (I need to test the QEMU native context patches and wondered if cross-arch would work)
17:59 eric_engestrom: mareko: ack, thanks!
18:40 dj-death: stsquad: seems to build fine on a M1
20:20 stsquad: dj-death: excellent.. just need to tweak my buildroot test image build
20:37 eric_engestrom: dj-death: last I heard, i915 doesn't build on aarch64 (because libdrm_intel doesn't build); has that been fixed?
20:37 eric_engestrom: but crocus and iris build fine