00:11DemiMarie: Is there a “how can I get started in Linux graphics” document anywhere? Asking because Qubes might be interested in Intel SR-IOV, Xen virtio-GPU native contexts, or some combination.
00:11DemiMarie: Is it safe to assume that if (unprivileged) userspace can freeze the GPU and the kernel fails to recover from it that is a kernel bug?
02:45alyssa: karolherbst: I think the next big compute stack should be rusticl on zink on radv with the amd llvm backend
02:45alyssa: LLVM -> SPIRV -> NIR -> SPIRV -> NIR -> LLVM
02:45alyssa: :P
02:59HdkR: How many layers of translation can we go?
03:18luc: hi, all, discussions on maillist by replying to **cover letter** of patch series won't be recorded on https://patchwork.freedesktop.org/series/123257/, will they?
06:33itoral: karolherbst: jasuarez told me you were having some trouble with fences in v3d, maybe I can assist you with that
08:02airlied: alyssa: okay I've slogged the functions MR to a place of great happiness :-) I'd appreciate when you next have time!
08:14pq: gildekel, Weston is unfortunate in that respect: it does not handle "link-status" property at all. It does have "HOTPLUG" uevent handling.
08:17pq: gildekel, I don't know what would happen in Weston on link training failure. OTOH, it does handle connectors appearing and disappearing, but that's not what you're asking about.
08:20pq: gildekel, I can't see Weston code reacting to mode list pruning in any way. It might be simply not implemented, so I'd guess it'd malfunction anyway already, so your kernel changes can't make it worse.
09:21emersion: pq, most likely a monitor would go black and weston would continue to use it as-if nothing happened
09:21emersion: pq, doesn't weston reload the modelist on HOTPLUG=1?
09:22emersion: (wlroots doesn't, it only does when link-status=bad)
09:23emersion: pq, a link-status property change results in HOTPLUG=1 uevent
09:23emersion: with also CONNECTOR and PROPERTY set
09:23emersion: in the uevent
09:24emersion: so if you don't read CONNECTOR/PROPERTY, it just looks like a normal HOTPLUG=1 uevent
09:24emersion: the kernel uses HOTPLUG=1 as a generic way of informing userspace that something in the KMS state changed
09:38karolherbst: alyssa: obviously
09:39karolherbst: it might be what we have to use until aco can deal with function calls :D
09:39karolherbst: doing that in zink should be trivial
10:05pq: emersion, no, Weston does things backwards: it updates the mode list only after the frontend has already picked a mode, and that only happens when the frontend inits or changes the mode. *facepalm*
10:06pq: Weston does update pretty much everything *else* on hotplug event
10:06pq: and communicates the change to the frontend, so the frontend can choose what to do
10:06pq: just... not the modes
10:08pq: so, nothing to worry about from kernel side, Weston is already broken there
10:10DottorLeo: hi!
10:11DottorLeo: someone has sucessfully used mesa on a phone? what is the best chipset supported by Mesa?
10:15javierm: tzimmermann: I see that drivers/video/fbdev/au1100fb.c also has some logic when !defined(CONFIG_FRAMEBUFFER_CONSOLE) && defined(CONFIG_LOGO)
10:15javierm: tzimmermann: in its au1100fb_drv_remove() callback, it does a au1100fb_fb_blank(VESA_POWERDOWN, &fbdev->info)
10:15javierm: not sure why, because that driver doesn't show the logo like drivers/video/fbdev/au1200fb.c does
10:16javierm: tzimmermann: but maybe you drop that too in your series?
10:16pq: gildekel, *cough* https://gitlab.freedesktop.org/wayland/weston/-/issues/124
10:17javierm: tzimmermann: and drivers/video/fbdev/xilinxfb.c has the same logic in xilinxfb_release() but also doesn't show the logo, maybe a copy&paste thing between these old drivers?
10:22tzimmermann: javierm, my cleanups where specifically for fb_prepare_logo() and fb_show_logo(), so that fb_logo.c can be made optional easily. can these other cleanups be send out separately?
10:22javierm: tzimmermann: yes, sure
10:23javierm: I was mentioning just in case you missed those
10:26tzimmermann: why these drivers do this is not so clear to me. that code should be in the code, if any
10:26tzimmermann: javierm, the definition of the blanking constants is kinda confusing https://elixir.bootlin.com/linux/latest/source/include/uapi/linux/fb.h#L303
10:30pq: tzimmermann, I can guess what those mean in the signal, but I don't know why the H and V suspend modes exist.
10:31javierm: tzimmermann: yeah, me neither but is weird that they do on their remove/release but nothing on driver probe
10:31javierm: that's why I think is some kind of left over or wrong copy&paste
10:33tzimmermann: pq, i think this corresponds to DPMS levels. wasn't it such that the old CRTs would resume faster/slower on different levels?
10:33pq: the monitor of the original IBM PC comes to mind, which would let smoke out if HSYNC pin was left in a wrong state for a too long moment.
10:34pq: tzimmermann, yes, kind of.
10:34pq: I presume that if you syncs flying, the CRT circuitry stays alive and warm, and the high voltage remains.
10:35pq: if you stop HSYNC, you lose high voltage, because usually the flyback transformer is driven directly from that, without an oscillator of its own.
10:36pq: getting the high voltage back may take a moment, but what takes a lot longer is if the CRT filament cools down.
10:36tzimmermann: "which would let smoke out if HSYNC pin was left in a wrong state" that's hilarious!
10:36pq: I don't what controls the CRT heater.
10:37pq: *I don't know
10:37tzimmermann: thanks for the HW details
10:38pq: vacuum tubes! \o/
10:40pq: filament... I mean the cathode
10:43ccr: similar kind of issue existed with certain Commodore PET models. you could, by manipulating some video register(s), cause damage to the CRT monitor.
10:43pq: yeah, the IBM PC monitor was something. That was before VGA. With VGA, it was still possible to let the smoke out of some monitors by sending a too high hsync frequency.
10:44pq: precisely because hsync was used to drive the flyback transistor, and too high freq. caused it to melt
10:45javierm: pq: I remember that a DPMS caused a noisy click on my old CRTC monitor
10:46javierm: in fact, every mode set was noisy IIRC
10:47pq: I guess relays, maybe protecting the above mentioned components. :-)
10:47pq: javierm, or was that accompanied with a vibrating image that then settled down?
10:47javierm: pq: I don't remember vibrating image, only the noise
10:47pq: ok
10:48javierm: unless the settle down was too quick for my eye to notice :)
10:48pq: degaussing was a fun feature of CRT monitors too
10:48javierm: tzimmermann: https://marcin.juszkiewicz.com.pl/2012/12/10/how-to-fry-speakers-in-your-chromebook/ is a more recent example
10:48javierm: of SW causing things to melt :)
10:49pq: isn't Asahi also needing to deal with explicit power control to avoid melting those speakers too?
10:50tzimmermann: javierm, "Is this howto useful? I think it is. Cause if you have device broken in some way and you want to get it replaced you can just run it and hope for replacement instead of repair." :D
10:51vsyrjala: iirc n900 had a pulseaudio filter to protect the speakers from unwanted frequencies. good luck if you wanted to not use pulse :/
10:52karolherbst: itoral: yeah.. so trying to bring up rusticl on v3d, but the compute job I'm enqueing is never signaling the fence or something... I honestly don't exactly know what's happening, that's my patch so far: https://gitlab.freedesktop.org/karolherbst/mesa/-/commit/c3d472346704d1a95609f992128dc18a4352919e
10:53karolherbst: I'm probably just missing something.. dunno
10:54karolherbst: I could also make a gallium trace if you think that would help you understand what's going on
10:55itoral: so what is happening exactly? you submit a compute job and it timeouts?
10:58ccr:shakes fist at pulseaudio
11:00karolherbst: itoral: yes
11:00youmukonpaku1337: pulse audio more like pulse midio
11:01itoral: karolherbst: what do you see in dmesg?
11:01karolherbst: itoral: nothing
11:01itoral: nothing? uh, that is very weirs
11:01karolherbst: I know..
11:01itoral: are you sure the job is being submitted?
11:01karolherbst: I suspect somehting with buffer tracking is off.. or context stuff
11:01karolherbst: yes
11:01karolherbst: and the ioctl doesn't return an error
11:02itoral: ugh, that is so weird... could you send an e-mail to me with instructions on how to replicate the issue? (assuming it is not too much work)
11:03itoral: in fact, if you create an issue in gitlab with the details it would be even better
11:03itoral: I am then try to replicate and if I do I figure I should be able to find what is happening
11:03karolherbst: I suspect it's something pipe_context related
11:04karolherbst: ehh wait...
11:04karolherbst: I booted it up again and now it works...
11:04itoral: uh, maybe the gpu was in a silly state :-/
11:04karolherbst: yeah....
11:04karolherbst: I think there were MMU faults in the dmesg, but they were triggered by other jobs? Maybe they messed up the GPU state I guess..
11:05itoral: yes, that could be
11:05karolherbst: v3d fec00000.v3d: MMU error from client L2T (0) at 0x3400, write violation, pte invalid
11:05karolherbst: v3d fec00000.v3d: MMU error from client L2T (0) at 0xa00, write violation, pte invalid
11:05karolherbst: that's what I had seen, but if one job triggering those mess up all the future ones that's kinda a problem for CTS runs :'(
11:06karolherbst: maybe it's also something else
11:06itoral: there is usually one of those on every boot... but you should only see that one
11:06karolherbst: this boot I see none
11:06itoral: if you are seeing more than that, and in parricular, if you are seeing those when you run your workload then there is something wrong
11:06karolherbst: now I have to deal with other errors like "unknown NIR ALU inst: div 64 %76 = u2u64 %75" :'(
11:07karolherbst: yeah..
11:07karolherbst: I try to track down what's causing those errors
11:07itoral: ugh, we don't support 64bit alu :-(
11:07karolherbst: right...
11:07karolherbst: probably needs more int64 lowering
11:07karolherbst: but then it requires packing
11:07itoral: yep
11:08karolherbst: also.. the hw doesn't support fma?
11:08itoral: nope :-(
11:08karolherbst: :'(
11:08itoral: yeah...
11:08karolherbst: CL requires _precise_ fma :')
11:08karolherbst: so fma will be lowered to software
11:09itoral: right
11:09karolherbst: though I wonder if we can improve the lowering in libclc :D
11:09karolherbst: anyway..
11:09karolherbst: u2u64 lowering needs pack_64_2x32_split
11:11karolherbst: int64 support is optional though, and I don't know why emulated fma even needs u2u64...
11:15karolherbst: uhh.. the lowering stores the mantissa as a long
11:15karolherbst_: pain
11:15itoral: there is a lower_pack_64_2x32_split right?
11:16karolherbst: ohh indeed
11:23karolherbst: itoral: it stopped working :') and nothing in dmesg
11:25karolherbst: ehh wait.. that's just nir opts infinitely loop.. nvm
11:25karolherbst: why is int64 lowering so cursed
11:26itoral: ouch!
11:26karolherbst: added lower_int64 into my opt loop, because opt_algebrics pack lowering adds u2u64
11:27karolherbst: and uhm.. it doesn't stop now
11:27karolherbst: :D
11:27karolherbst: ahh yes
11:27karolherbst: nir_pack_64_2x32_split lowering uses u2u64 and u2u64 lowering uses nir_pack_64_2x32_split
11:27karolherbst: :')
11:27itoral: oh boy :)
11:28karolherbst: you think if lower_pack_64_2x32_split can be easily implemented though?
11:29karolherbst: ehhh
11:29karolherbst: pack_64_2x32 I mean
11:30itoral: but that would require that we are actually able to have 64bit defs around, no?
11:31itoral: I mean, we would need an ssa that can reference an actual 64bit value
11:32karolherbst: not necessarily
11:34karolherbst: the lowering is literally doing `nir_pack_64_2x32_split(b, x32, nir_imm_int(b, 0))`
11:34karolherbst: could just.. uhm.. treat is as 32 bit :D
11:34itoral: oh
11:34karolherbst: I guess it would get messy with load/stores
11:34itoral: well... sure, we can do that XD
11:34karolherbst: and how that 64 bit value is used
11:34karolherbst: but I think when translating from nir that nonsense _could_ be handled in a hacky way
11:35karolherbst: other int64 lowering isn't as nice though
11:35karolherbst: just the conversion ones just place a 0 in the upper bits
11:36karolherbst: maybe we should just have alternative nir lowering
11:36karolherbst: `(('pack_64_2x32_split', a, b), ('ior', ('u2u64', a), ('ishl', ('u2u64', b), 32)), 'options->lower_pack_64_2x32_split')` mhh
11:36karolherbst: itoral: the thing is... u2u64 is also trivial to implement on 32 bit hardware
11:37karolherbst: but yeah... I suspect for it to not be very cursed, the backend IR would need some support for 64 bit values
11:37karolherbst: which it probably should have for vectorized load/stores anyway
11:37karolherbst: unless you don't have vectorized loads/stores
11:38karolherbst: or I just ignore fma for now...
11:39itoral: maybe we would like a lowering that works in terms of uvec2 instead of u64
11:39itoral: that's what we do for the vulkan device address extension
11:39karolherbst: mhh.. maybe
11:39itoral: which expects device addresses to be 64-bit
11:40karolherbst: translatinv u2u64 to a vec2 shouldn't be too hard
11:40karolherbst: or rather
11:40karolherbst: ehh.. I guess it depends more on where that 64 bit value is coming from
11:41alyssa: airlied: :+1:
11:41karolherbst: yeah.. anyway, if not doing fma stuff things seem to work
11:41karolherbst: - random compiler crashes :')
11:41itoral: cool
11:41itoral: hahaha
11:42karolherbst: _but_ basic kernels seem to work
11:42alyssa: ...does videocore deserve CL
11:43karolherbst: no, but there is high demand
11:43karolherbst: it has 32 bit pointers :')
11:43karolherbst: itoral: the bigger problem is 8/16 bit handling which is mandatory in CL
11:43karolherbst: at least for ints
11:44karolherbst: let's see if I can trigger this MMU fault or if my proper implementation of set_Global_bindings fixed that one
11:44karolherbst: RIP https://gist.githubusercontent.com/karolherbst/a60d48fa8a748fbf580d40fc0ba286c6/raw/793fcc44b74b126889e6a96ba912bdbf78c40fef/gistfile1.txt
11:45itoral: v3d doesn't support native 8bit/16bit int alu, but I guess it can be done in 32-bit?
11:45karolherbst: yeah
11:45karolherbst: more or less
11:45karolherbst: it gets a bit fishy around load/stores but I think we have lowering in place for most of it
11:45karolherbst: `nir_lower_bit_size`
11:45itoral: those MMU errors are a real issue
11:46itoral: probably some out-of-bounds access
11:46karolherbst: mhhh
11:46itoral: [ 1091.642212] v3d fec00000.v3d: MMU error from client L2T (0) at 0x0, write violation, pte invalid
11:46itoral: that one is pretty clear, seems to be trying to write something at a NULL address
11:47karolherbst: mhhh
11:47karolherbst: could be something in my set_global_binding impl
11:48karolherbst: the model CL has with memory is kinda annoying
11:48karolherbst: but I'm also not sure the way I've implemented load/store global is actually correct
11:48karolherbst: but it's just doing 32 bit addresses anyway
11:51itoral: mmm... I think nir_intrinsic_load_global takes a 64-bit value no?
11:51itoral: that's why we added nir_intrinsic_load_global_2x32
11:51itoral: (which is what we use in vulkan for device addresses)
11:52karolherbst: ahh
11:52karolherbst: it's shared memory stuff
11:52karolherbst: itoral: not necessarily
11:53karolherbst: it can take a 32 bit one, but that depends on the API
11:53karolherbst: CL supports multiple pointer sizes and the device simply reports what it supports
11:53karolherbst: so if the runtime claims 32 bit points, all pointers are 32 bit
11:53itoral: aha
11:53karolherbst: anyway..
11:53karolherbst: seeing those mmu errors when using shared memory
11:53karolherbst: which makes sense
11:53karolherbst: because...
11:53karolherbst: uhm
11:54karolherbst: pipe_grid_info::variable_shared_mem
11:54karolherbst: so CL has a cursed model of shared memory
11:54karolherbst: you have 1. in kernel declared static shared memory blocks
11:54karolherbst: and 2. you can pass arbitrary sized shared memory blocks as kernel parameters into the runtime
11:55karolherbst: so you have a static part (declared in nir_shader) + a variable part (set via pipe_grid_info::variable_shared_mem at launch_grid time)
11:55karolherbst: I guess I just need to support that one then :')
11:55itoral: oh, I see
11:56karolherbst: I've added a shader_info::cs::has_variable_shared_mem so backend compilers can deal with some of it
11:56karolherbst: or drivers in general
11:57karolherbst: luckily we know if a shader has variable shared mem at compile time, so drivers can deal with that in some proper way
11:58karolherbst: ohh
11:58karolherbst: I suspect that null pointer is when it's all variable and no shared memory block is allocated
11:58karolherbst: maybe
12:03karolherbst: mhhh
12:04karolherbst: I guess some of the shared_size handling in the compiler needs to be fixed as well
12:06itoral: gotta go now but if you hit anything on v3d where you need asistance feel free to ping me or open an issue and I'll try to help
12:06karolherbst: okay, cool
12:10karolherbst: nice... I've added _broken_ variable_shared_size handling, but at least some/all of the MMU faults are gone
13:20austriancoder: which existing nir pass would do this simple transformation?
13:20austriancoder: from:
13:20austriancoder: 32x4 %5 = load_const (0x00000000, 0x00000000, 0x00000000, 0x00000000) = (0.000000, 0.000000, 0.000000, 0.000000)
13:20austriancoder: @store_reg (%5 (0x0, 0x0, 0x0, 0x0), %12) (base=0, wrmask=xyzw, legacy_fsat=0)
13:20austriancoder: to:
13:20austriancoder: 32 %5 = load_const (0x00000000) = (0.000000)
13:20austriancoder: @store_reg (%5.xxxx (0x0, 0x0, 0x0, 0x0), %12) (base=0, wrmask=xyzw, legacy_fsat=0)
13:21austriancoder: I think my irc client has messed up the last messages :( https://www.irccloud.com/pastebin/kY2ts6HY/
13:48karolherbst: austriancoder: sounds like a job for nir_opt_reuse_constants, but I don't think nir_instr_set_add_or_rewrite (which it uses, and also used by opt_cse) seem to look into vectors
13:49austriancoder: karolherbst: or maybe nir_opt_shrink_vectors .. which I try to hack to support the new nir register thing
13:49karolherbst: mhhh
13:49karolherbst: I _think_ it could rather be a mix of two passes
13:49karolherbst: reuse_constants to use the .xxxx swizzle
13:49karolherbst: and shrink vectors could ditch the unused ones
13:52karolherbst: dunno though, it just kinda feels like a cse problem
13:52karolherbst: and cse is currently based on entire ssa values afaik
13:53karolherbst: I can also see that in the future we should be able to cse parts of vectors, what if you have xy and zw being equal in a vec4?
13:55austriancoder: karolherbst: in the future .. are you working on something?
13:55karolherbst: I'm not
13:55karolherbst: I was just thinking out loud :D
13:58austriancoder: :)
14:28gildekel: emersion:
14:28gildekel: `7:53 PM <emersion> gildekel: yeah i think 0 modes is a kernel bug`
14:28gildekel: I hace seen 0 modes connectors in the past, which did seem weird to me. But I am offering that connectors in such a bad state should be pruned by userspaces. How does sway handles these cases?
14:28gildekel:
14:29gildekel: pq:
14:29gildekel: `*cough* https://gitlab.freedesktop.org/wayland/weston/-/issues/124`
14:29gildekel: Oh boy - that's a 5 year old bug.. heh
14:30emersion: pretty sure sway misbehaves
14:31emersion: when the kernel exposes a 0-mode connector
14:31gildekel: Well, complete link-training failures on SST sources should be fairly uncommon, as the link-training fallback logic usually salvages the process
14:32gildekel: however, link-training fallback is not currently not implemented for MST in i915, which may cause userspaces to hit this case more often
14:32gildekel: So, if my series is approved, you may be running into more modeless connectors in a bad state
14:33emersion: imho still a kernel bug when a 0-mode connector is exposed
14:33emersion: sima: ^
14:34emersion: when i say "kernel mode", i mean that sway should not fix it
14:34emersion: s/mode/bug/
14:35gildekel: Hmm.. I am not entirely convinced that's a bug. How would you otherwise signal userspace that that connector is in a bad state?
14:35gildekel: you can't just prune it in DRM
14:35gildekel: userspace would be left confused about why the connector is completely missing. A connector with 0 modes have a state that userspace can parse
14:35gildekel: inform end users
14:37emersion: you mark it as disconnected, if it really isn't usable, gildekel
14:38gildekel: That's also an option I am suggesting, as an alternative. But that's still not an accurate state for the connector, as it is actually connected. There was successful communication, DPCD reads, modes, link-training even
14:39gildekel: If it is marked as disconnected, again, userspace may be left wondering how come a connected display is ignored without sufficient signal.
14:42zamundaaa[m]: Yeah it would be nice to be able to show the user that the display is connect and just not working. But if we don't get more information than "it doesn't work" then it's probably not too useful vs just not detecting the display in the first place
14:42zamundaaa[m]: Because "it doesn't work" is something the user will notice by themselves already
14:43gildekel: Agreed, but seeing a connected display misbehaving is better than seeing nothing. Also, DRM defines link-training failures to be signaled to userspace via uevents and a connector prop "link-status" to be BAD
14:43karolherbst: also kernel changing behavior even if userspace is buggy is still a kernel regression
14:45gildekel: No arguments there, karolherbst. But the current state of i915 is that there's a gap when link-training completely fails. The failed attempt does not issue a uevent to userspace, and, at least ChromeOS, is left thinking that the modeset was successful and the display is operational.
14:45karolherbst: right...
14:45karolherbst: I think that goes back to the point if userspace can even do anything with that information
14:45karolherbst: can it recover? probably not
14:45zamundaaa[m]: Perhaps a solution could be to add a new connection state? DRM_MODE_CONNECTED_LINK_FAILURE or something like that?
14:46karolherbst: can the kernel tell the user what to do to fix this situation or is it simply broken no matter what?
14:47gildekel: Well, if you get a connected connector with no modes, you could potentially signal to the user that the connector failed to link-train
14:47karolherbst: what does this tell the user even?
14:47gildekel: and suggest to replace the cable, or try a simpler display set up
14:47karolherbst: what should the user do with this information
14:47karolherbst: okay
14:47karolherbst: but "disconnectred" on a connected display kinda means "your cable might be bonkers" already
14:47karolherbst: _but_
14:48karolherbst: I think there is value to tell the user that the cable might be bonkers
14:48gildekel: hey, displays are hard. We face the same difficulty in ChromeOS. We are thinking to show a bubble with a link to possible troubleshooting around displays
14:48karolherbst: right
14:48gildekel: it could be anything from bad cable to incompatibility, to driver bugs.
14:48karolherbst: okay
14:48karolherbst: but does the way the kernel reports this matter here?
14:49gildekel: It does, I think. If a connector comes back simply disconnected, then how do you signal userspace something went wrong vs. nothing was detected on the link?
14:49karolherbst: not disagreeing with your idea to report 0 modes, it's just icky if it breaks userspace
14:49gildekel: It's ok, this discussion is healthy, and exactly what I was hoping for
14:50gildekel: I am trying to make all our lives better.. not harder
14:50karolherbst: yeah...
14:51gildekel: At the end of the day, I want to be able to know in ChromeOS why a connector that succeeded to modeset is not coming up. That's where it all started. I call those "zombie" displays
14:51gildekel: I realized it is because i915 stops issuing uevents to users once RBR x 1 Lane fails
14:51karolherbst: it's just that because this is uapi territory, not breaking userspace is just more important than a clean UAPI
14:51gildekel: Your userspace is already broken
14:51gildekel: you see zombie displays, at best
14:51gildekel: if you're running i915, that is
14:52karolherbst: fair
14:52gildekel: and btw, this change if currently affecting i915 alone
14:52karolherbst: I guess it depends on how bad the regression is. turning non seeing anything on the display into a compositor crash might be reason enough
14:52karolherbst: if it's just the compostitor doing something silly vs not doing something at all it might not matter
14:53gildekel: if you do not ignore 0 mode connectors, then, depending on how you choose modes, you'll hopefully just send a disable modeset request..?
14:53karolherbst: at least if the display is still black, the user could e.g. just check the cables (which they probably will) and the system might recover
14:54gildekel: Sure, that's true. But wouldn't a better user experience be that userspace signals the user that the display is in trouble?
14:54karolherbst: if the display is black, what can you even signal to the user?
14:54gildekel: via parsing the state of a connected connector in link-status=bad, as DRM dictates?
14:54karolherbst: though you still have your primary one...
14:55karolherbst: gildekel: oh.. I meant in case all connectors are bad or something, but I think this issue is about MST specifically, but then again, what if the MST display is the only display connected anyway
14:55gildekel: `10:54 AM <karolherbst> if the display is black, what can you even signal to the user?`
14:55gildekel: Unfortunately, that's a case that can't be alleviated by anything we do
14:56gildekel: if the only display a user has doesn't come up...
14:56gildekel: no amount of signaling can help
14:56gildekel: unless we emit audio
14:56gildekel: but there's no end to this
14:56karolherbst: but yeah... I think _if_ marking this state in a special way helps userspace to report something more meaningful to the user of the system then that's reason enough to be more explicit unless there will be regressions
14:58gildekel: Agreed. I personally believe that a controlled state is better than undefined behavior, which is "zombie" displays. Also don't forget that you produce frames in these scenarios.
14:58karolherbst: right...
14:58gildekel: At least in ChromeOS we do. Frames, mouse warping, layouts, the whole thing.
14:59daniels: I think a pragmatic compromise would be a tweak on what Karol suggested: for current userspace, expose as disconnected, but for userspace which opts in with a client cap, expose as connected-but-useless
15:00daniels: the fact that auxch works isn’t much comfort to userspace; any userspace that’s weird enough to care about the distinction can opt in to receive the new status
15:06gildekel: Alright. Obviously there's some discomfort around the suggested change. I'll think this through some more. Thanks for the input all. Thanks for the suggestion daniels
15:07daniels: np!
15:08daniels: your usecase makes total sense, I think it just needs finesse to keep other userspace doing the right thing is all
15:09gildekel: I completely understand. That's why I initiated the discussion. I don't want to break anyone's stuff.
15:10gildekel: My view is that the solution solidifies/extends DRM specs around complete link-status failures for both SST and MST cases, but I also agree that it's not worth regressions.
15:36daniels: you can add it to the list of things we’d do differently if we were greenfielding it, rather than something that was mostly bashed together at GUADEC 2007 ;)
15:36daniels: *than building on top of
15:44seanpaul_: how about adding a new value to the link-status property which is "terminal" or somethign
15:44seanpaul_: then we don't need a new connector status and _maybe_ don't need a new client cap
15:45daniels: so the connector status is disconnected but the link status is terminal?
15:45seanpaul_: connector would be connected
15:46seanpaul_: modes could be pruned or not (sway should be resilient in both cases, but meh)
15:46daniels: mm, but then you’d still have userspace blithely trying to light up a display which can never be used
15:46seanpaul_: link-status == bad typically means "try another modeset", whereas link-status == terminal means don't bother
15:46zamundaaa[m]: seanpaul_: Old userspace would just overwrite that link-status value with "Good"
15:46daniels: ‘connected’ = ‘pixels will appear somewhere’
15:47seanpaul_: zamundaaa[m]: that's probably ok, they're broken anyways
15:47seanpaul_: daniels: i don't think that's necessarily true, connectors are connected before modeset
15:49seanpaul_: we have this link-status property, it seems wasteful to introduce an entirely new signal which means almost the same thing
15:50daniels: right, but once you do a modeset, you can reasonably expect that they’ll probably work
15:51daniels: like, semantically to userspace, the expectation is ‘this is a connector you can and probably should light up’, not ‘there’s a cable plugged in but it will never work’
15:53seanpaul_: i guess my point is that we already have an exception mechanism to that reasonable expectation, so why not use that
16:09sima: daniels, seanpaul_ gildekel I think emersion 's proposal of just marking terminally fubar connectors as disconnected makes the most sense
16:09sima: userspace can handle that, and most userspace will handle that without falling over
16:09sima: plus if you magically recover the link, you can change to connected again and throw an uevent out to userspace
16:10sima: so I'm not sure why we need to add another awkward corner case here
16:10sima: since connected + 0 modes very much means there's a working screen there, we just don't know how to drive it
16:11sima: such screens do exist (or at least have, on vga, way back)
16:11gildekel: sima:
16:11gildekel: `userspace can handle that, and most userspace will handle that without falling over`
16:11gildekel: But that would mean we lose all potential signal that a connector is in a bad state vs. not connected..
16:11gildekel: I am not completely opposed to this solution, to be clear
16:12sima: gildekel, why do you care?
16:12gildekel: Because I would like to be able to provide some information to the user that a display is connected, but there is a connectivity issue
16:12gildekel: I would like to provide feedback, in any form
16:12sima: like from a user pov, what's the difference between a badly plugged in cable and a shit cable?
16:12dottedmag: Maybe add a property to a connector saying "why this thing is disconnected"? With absence of value meaning "dunno, probably no cable"
16:12sima: in both cases they think it should work, but it doesnt
16:14sima: and in both cases they'll figure out that something is shit, and any attempts at further debugging feel a bit silly to me
16:14sima: like maybe the connector on the board is dented, and no amount of cable replacement is going to fix anything
16:14gildekel: In one case, they are left on their own. In the other, the OS validates their circumstances and provides some feedback
16:14gildekel: I see that as a better user experience.
16:14sima: gildekel, yeah but what is the users going to do?
16:15sima: you can't tell them whether it's the laptop, cable, or sink that's busted
16:15sima: just "something is wrong"
16:15sima: which ... they know, it doesn't work
16:15zamundaaa[m]: sima: you can give them a hint, a list of things to do
16:15gildekel: ^
16:15zamundaaa[m]: For users that don't know anything about computers that's pretty helpful
16:15gildekel: In ChromeOS, we plan on providing a link to a Display Troubleshoot page
16:16gildekel: Displays are hard. period. We can provide basic education, where things can go wrong, cable management, etc.
16:16gildekel: vs. leave them on their own to open yet another bug in which "my displays don't turn on"
16:16gildekel: and call us all mokeys with keyboards on reddit
16:16gildekel: ^ (real story)
16:18sima: yeah that part is unfortunately not optional :-/
16:19sima: but yeah I still think what we should do is 1. handle this within current kms semantics, i.e. set the connector to disconnected or something if it's terminally busted
16:19sima: 2. do some extension on top of that, with the userspace glue to handle it, but that extensions needs to extend, not change the rules
16:20sima: maybe if we set it to disconnected we can abuse link-status=terminal, but that feels a bit risky
16:24gildekel: Well, in that case, how about zamundaaa[m]'s original suggestion to add a connector state? why isn't that sufficient? It'll be in terminal state + link-status bad after the final link training attempt. It should be sufficient to say that if a connector != CONNECTED then userspaces ignore it.. no?
16:25gildekel: And the, should a userspace care, it can parse connectors in terminal state for extra signal
16:35sima: gildekel, that's 1&2 together
16:36sima: and I'm not super keen on auditing everything whether it copes with a new connector state
16:36seanpaul_: sima: can you elaborate on "risky"? it actually might work even better this way since you can mark the base connector as terminal for MST cases
16:36sima: that's even more risky than extending link-status, because almost everything kms userspace looks at connector state
16:36seanpaul_: it's already disconnected
16:36dottedmag: default: fprintf(stderr, "Unknown connector state"); exit)(1);
16:37dottedmag: this kind of risky?
16:37sima: yup
16:39zmike: mareko: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24849
16:40seanpaul_: sima: to clarify, i was asking about why link-status extension was risky
16:40seanpaul_: (especially if the connector is disconnected)
16:41sima: seanpaul_, well same, but since the users of that are a lot more limited it's probably going to be fine, plus if it's already disconnected even more chances it's going to be fine
16:41sima: but fundamentally the no regression rule means that userspace is right, no matter how stupid
16:41seanpaul_: ok, understood, was curious if there was anything beyond that
16:41sima: and upgrading a "my screen external doesn't work" bug to a "my compositor just died" bug isnt good
16:42sima: seanpaul_, nah just general uapi paranoia
16:42gildekel: Agreed. I can work with a disconnected connector with link-status=terminal
16:42seanpaul_: perhaps i should tighten the straps on my tinfoil hat
16:42gildekel: you guys get a tinfoil hat? :o
16:43gildekel: sima: I'll add you to future revisions of the series, if you don't mind.
16:45zamundaaa[m]: I agree that link-status terminal + disconnected connector would be a good solution
16:45sima: gildekel, little wrapper with docs and all (plus extending uapi docs for the properties too ofc) would be good
16:46sima: since to avoid races you have to set to disconnected before updating link-status, then uevent
16:46sima: and we don't want to give drivers any other way to get to link-status=terminal I think
16:47gildekel: aye aye, Captain :)
16:48gildekel: We already take a similar approach when we set the link status to bad in i915, especially with the recursive function to set downstream MST ports to BAD as well
16:48gildekel: so I feel like this would make sense.
16:55seanpaul_: i assume for the MST case we're going to mark link-status terminal on the base connector and leave everything else as-is?
16:55gildekel: I think we should mark all MST topology as terminal as well
16:55gildekel: Userspaces tend to abstract these details
16:55gildekel: Or, rather, let me speak for ChromeOS
16:56gildekel: Alternatively, we can mark all downstream ports as BAD, but why...?
16:58seanpaul_: hmm, but usually if all connectors are disconnected from an MST branch, they are destroyed
16:59seanpaul_: so that would have the unintended consequence of leaving a fully disconnected MST topology
17:00gildekel: Does the clean up occur independently of a re-probe? because otherwise, a failed link-training even would change the status of the connectors, and, I would assume, the follow-up uevent will trigger the cleanup
17:01seanpaul_: > a failed link-training even would change the status of the connectors
17:01seanpaul_: i don't think it does/would
17:02gildekel: Well, it doesn't, currently. But it will if my series is accepted.
17:02gildekel: A part of the change is to modify the state of all downstream MST ports after a failed link-train
17:02seanpaul_: no, all status would stay the same, base would stay disconnected and sinks would stay connected
17:03gildekel: Not sure I am following.
17:03seanpaul_: link training would fail on the base connector
17:03gildekel: correct, and the new state would be propagated to the downstream ports as well.
17:04gildekel: that's a part of the work we have to do after the base connector fails.
17:04seanpaul_: but those connectors have connected status
17:04seanpaul_: so you would end up with link-status terminal and status_connected
17:05seanpaul_: so i think what you want to do is leave the downstream ports as CONNECTED, leave the base connector as DISCONNECTED and just update link-status on the base connector to terminal
17:05gildekel: I must be misunderstanding you somehow. Here's what I plan to do:
17:05gildekel: 1) Base connector fails link-training
17:05gildekel: 2) Modify its status to disconnected, mark it as terminal
17:05gildekel: 3) If connector is MST, recursively mark its downstream ports to disconnected + terminal
17:06gildekel: 4) send uevent
17:06seanpaul_: re: 2) it's already disconnected
17:06gildekel: It's not.. not at this point..
17:07gildekel: This is during link-training.. the connector comes in in a "good state" to link-training. Connector is connected with link-status=GOOD by kernel or userspace.
17:08gildekel: We can sync offline if you want
17:08seanpaul_: hmm, when does the base connector transition to disconnected in the successful case?
17:08gildekel: Oh. I see what you mean.
17:09gildekel: That's a good point. But shouldn't really matter.
17:09seanpaul_: re: 3) we currently don't support keeping the MST topology alive when all sinks are disconnected, so this would change current behavior
17:11gildekel: But then we're back to square one. We have connected connectors producing zombie displays..
17:11gildekel: on MST
17:11seanpaul_: you would have to inspect the base connector's link-status and do the recursion in userspace
17:12gildekel: why not just let the connectors die out, and have the base connector signal that?
17:12seanpaul_: which, conceptually kind of makes sense b/c the link-status between source & mst branch device is bad, but the link in between branch devices may not be
17:17gildekel: Ok. So do we at least change the link-status to terminal/bad on the downstream ports?
17:17gildekel: I would assume we d.
17:17gildekel: do*
17:20seanpaul_: i'd say no
17:21gildekel: But from this source's point of view, these connectors are unusable.
17:21gildekel: doesn't matter if another potential source can use them
17:45zamundaaa[m]: <seanpaul_> "you would have to inspect the..." <- That would not be backwards compatible with current userspace
17:51gildekel: I believe he was suggesting a solution to proper pruning of the misbehaving connectors in userspace. If left un-handled, then it should produce the same behavior in which the connectors appear to be connected and operational.
17:51gildekel: ...a solution for* proper pruning...
17:53zamundaaa[m]: Sure, I guess that would kinda-ish maybe be fine. But it would be quite different from how userspace operates right now
17:54zamundaaa[m]: by which I mean that most userspace completely ignores that MST is a thing at all. It's abstracted away by the kernel, and all we get as information about it at all is the mst path property
17:54gildekel: correct, that I agree. That will not change.
17:55gildekel: The current behavior is that after a failed link-training, all (base) connectors are left connected, marked as link-status=BAD, and no more uevents are issued. So userspace thinks the last modeset succeeded.
17:55gildekel: This applies to the MST connectors as well
17:56gildekel: If we modify only the base connector to be in terminal state (as it's already marked disconnected by the MST topology manager), and leave the MST connectors as they are (connected and link-status good), then userspace sees them as always
17:56gildekel: the difference is that userspaces that wish to parse the new state, can look up the base connector of the MST ports, and see that it's in a bad state, and prune the MST connectors
17:56gildekel: Otherwise, you'll have your zombie displays, as always, as the MST connectors are marked connected and ready
17:59zamundaaa[m]: I don't think that's a good solution, it's inconsistent between different connectors
18:00zamundaaa[m]: If you need to keep MST downstream ports as connected for kernel-internal reasons, then keep that mess in the kernel. If you want all userspace to do a recursion on the connectors and mark them as disconnected, then why not do that in the kernel when sending connector information to userspace?
18:03gildekel: To clarify, I am with you on this. I would rather mark the downstream connectors as disconnected/terminal
18:03gildekel: but there seem to be more disagreement here. And justly so, because doing that _will_ change current behavior in userspace
18:11zamundaaa[m]: What behavior would it change? If you're referring to MST connectors being marked as disconnected instead of removed, that's not a change that can break userspace (which isn't already fundamentally broken)
18:17gildekel: They will not be marked as disconnected, but removed entirely.. since MST support removes the topology entirely if all downstream connectors are marked as disconnected
18:17gildekel: so you go from having MST connectors showing up, to having no MST connectors
18:17gildekel: that's the change. Whether or not it's safe or acceptable is a different discussion
18:23zamundaaa[m]: But that's only in the kernel. I don't particularly care about what you do in there - if you have to keep connectors marked as connected internally, then do that
18:24zamundaaa[m]: What I'm saying is that when userspace calls drmModeGetConnector, you should - inside the kernel - check if link-status is terminal, and set the connection status to disconnected for userspace
18:25zamundaaa[m]: Then you can simply mark all the affected MST connectors as connected + link-status=terminal inside the kernel, and userspace will see disconnected + link-status=terminal
19:07gildekel: That's a possibility, but sounds a like debugging hell
20:07Kayden: hitting some test failures with an MR on virpipe. anyone know how to reproduce those? looks like something with virgl_test_server and llvmpipe?
20:08Kayden: hm I guess I just run the server and then LIBGL_ALWAYS_SOFTWARE=1 GALLIUM_DRIVER=virpipe program, huh
20:12tintou: yes
20:13Kayden: tintou: thanks!
20:13tintou: To run the test server as in the CI, you can look at https://gitlab.freedesktop.org/mesa/mesa/-/blob/main/.gitlab-ci/deqp-runner.sh?ref_type=heads#L137 to give the right arguments and environment variables
20:13Kayden: how do I get it building virgl_test_server? -Dgallium-drivers=virgl isn't sufficient it seems
20:14tintou: that's in the virglrenderer project https://gitlab.freedesktop.org/virgl/virglrenderer
20:14Kayden: oh, or that's...not part of mesa, got it
20:19Kayden: virglrenderer is what translates the shaders?
20:20Kayden: looks ilke it, yep
20:21Kayden: ah, from TGSI, so there's ntt too
20:29Kayden: now to figure out if this is a virglrenderer bug or a mesa glsl language frontend bug
20:41Kayden: arg, this is confusing.
20:42Kayden: so on my barrier optimization MR, I'm getting rid of unnecessary barrier modes
20:42Kayden: which causes virglrenderer to emit memoryBarrierBuffer() and memoryBarrierAtomicCounter() instead of the full memoryBarrier(). sensible
20:42Kayden: but it's doing #version 140 #extension GL_ARB_shader_storage_buffer_object : require
20:43Kayden: mesa only provides those functions in GLSL 4.30 or ESSL 3.10 or if #extension GL_ARB_compute_shader : enable
20:44Kayden: GL_ARB_compute_shader, however, does not mention a #extension directive
20:44airlied: I assume that means all compute shaders should have them
20:44Kayden: right. however, this is a vertex shader
20:45airlied: okay that seems like a problem then :-)
20:46Kayden: The functions memoryBarrierShared() and groupMemoryBarrier() are available only in compute shaders; the other functions are available in all shader types.
20:46Kayden: so they should be there. but when
20:47Kayden: there's some thought that maybe it's just adding them as an interaction with the other specs...except...memoryBarrier() is provided by ARB_shader_image_load_store, not ARB_shader_storage_buffer_object
20:48airlied: I'd probably just fix virgl on the mesa side by sending more barrier modes that necessary and file a bug to get virglrenderer fix
20:48Kayden: I guess virglrenderer ought to be doing an #extension GL_ARB_compute_shader : require when using memory barriers?
20:48Kayden: yeah, was going to say, not sure how to synchronize changes in the two projects
20:49Kayden: so right now I'm just enabling the pass for all drivers in st/glsl
20:49airlied: I don't think the GL_ARB_compute_shader makes sense at all in mes
20:49airlied: mesa
20:49Kayden: oh?
20:50airlied: since it doesn't look specified
20:50Kayden: yeah.
20:51airlied: ssbo spec says "Additionally, the shading language provides the memoryBarrier() function to control the relative order of memory accesses within individual shader invocations and provides various memory qualifiers controlling how the memory corresponding to individual variables is accessed.
20:51airlied: "
20:51airlied: then never mentions it again
20:52airlied: it's only then mentioned in the image one, which seems like virglrender should be emitting then
20:53Kayden: yeah, that's where the original memoryBarrier is defined
20:54Kayden: makes me think that mesa ought to be adding the memoryBarrierShared/AtomicCounter/Image/Buffer() variants if ARB_compute_shader is supported at all, without an #extension directive, whenever memoryBarrier() is available
20:55Kayden: ah
20:55Kayden: right, okay, so here's the thing, I guess
20:56Kayden: it wasn't possible to create these situations outside of compute shaders before my pass.
20:56Kayden: because you didn't have memoryBarrier*() subvariants outside of compute shaders, or without turning them on yourself somehow
20:56Kayden: hm. I guess you could turn them on and do it though
20:57Kayden: vrend_shader.c does do ctx->shader_req_bits |= SHADER_REQ_IMAGE_LOAD_STORE; for the full barrier but nothing for the others
20:58Kayden: airlied: you might also be interested in a lavapipe fix: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24842/diffs?commit_id=93ea6fdd706c430e38cc9f738c15a484ad311867
21:04zmike: is that actually affecting anything? the scopes below should be all the filtering needed
21:05Kayden: yeah, dEQP-GLES31.functional.image_load_store.2d.qualifiers.volatile_r32f was failing on zink+lavapipe without the fix, with https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24842/
21:07Kayden: was getting these barriers: http://whitecape.org/paste/lvp-diff.txt
21:09zmike: 🤔
21:09zmike: where are the NONE scope barriers coming from?
21:10Kayden: those are just memoryBarrier() I believe
21:12Kayden: yeah, test has memoryBarrier(); barrier();
21:12Kayden: in a compute shader
21:12Kayden: there's no SSBO or shared or global access, only images
21:13zmike: ah okay so it's a OpMemoryBarrier with scope=NONE?
21:13Kayden: but the barrier() still ought to synchronize the invocations I think
21:13Kayden: well, it's GLSL
21:15zmike: seems legit
21:15zmike: take my rb
21:16zamundaaa[m]: gildekel: you could always add a CAP that exposes the "real" state to userspace. Please just don't make connectors behave differently depending on their role in the MST topology
21:18Kayden: zmike: thank you!
21:24Kayden: robclark: I'm hitting a freedreno failure in https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24842 (barrier mode optimizations) in piglit's spec/arb_compute_shader/execution/simple-barrier-atomics. not sure how best to debug this since I don't have an a6xx handy
21:24Kayden: guessing I deleted some barriers that you were relying on
21:25Kayden: or the atomic counters aren't being accessed via derefs when nir_opt_barrier_modes() is called so it's failing to see them. the place I call the pass seems to be working for other drivers though
21:30airlied: Kayden: I'm a bit worried about emitting nir barriers on shader stages that haven't been traditionally used to seeing them
21:31airlied: though it's likely the problem is going to be mostly sw/virgl where we don't really have specified paths
21:31Kayden: airlied: seems like other than the extension directive thing, it ought to work
21:32Kayden: I guess I could make the pass optional via a nir_shader_compiler_options flag
21:32robclark: Kayden: I'll look in a few.. but you could probably use drm-shim... also I'm a big fan of asserts when it comes to what form a shader is in (since otherwise it is pretty much a mess knowing the right time and order for passes)
21:33idr: I just read issue 13 in the ARB_compute_shader spec, now I'm re-reading the IRC log...
21:34Kayden: robclark: oh can I? that'd be awesome
21:35robclark: well, assuming diffing nir_print and/or IR3_SHADER_DEBUG=disasm output is enough to spot the issue..
21:36Kayden: probably would be
21:37idr: Kayden: Based on my reading of issue 13, I think the "bug" is in virgl, but it occurs because it's getting something it doesn't expect.
21:38idr: Outside of a compute shader, a memory barrier of any sort should get turned into memoryBarrier().
21:40Kayden: idr: why?
21:41Kayden: oh, you mean, when virgl translates back to GLSL.
21:41Kayden: because the sub-functions don't exist there
21:41Kayden: right
21:41idr: Right.
21:41idr: I was going to double-check what GLSL 4.30 says. That may affect things too.
21:42idr: If it's converting to GLSL 4.30, then... maybe.
21:44Kayden: it's using 1.40 + #extensions
21:44idr: I looked at the 4.60 spec because it's what was handy... memoryBarrierAtomicCounter, memoryBarrierBuffer, and memoryBarrierImage exist in all stages there, so I assume that's 4.30+ behavior.
21:44Kayden: robclark: thanks for the drm-shim pointer, I can run the shaders now and I see what's going on :)
21:44robclark: \o/
21:46Kayden: idr: Yeah. I think that's the intention
21:47Kayden: idr: so it's really just the undefined mess of #extension enabling in pre-4.30
21:49idr: Which I interpret as "those new 4.30 functions only exist in compute shaders."
21:50Kayden: *nods*
21:51airlied: yeah I think just force virgl to emit full barriers on non-compute might be the best plan
21:51Kayden: yeah
21:52airlied: we can't fix virglrenderer to fix this bug, it would need feature flags etc and new mesa needs to run on old virglrenderer
21:53airlied: anyone know about the v3d cpu tasks and why they aren't just compute shaders?
21:55alyssa: idr: IIRC when I reworked barriers, I noticed there was virglrenderer brokenness worked around in nir-to-tgsi
21:55alyssa: I have paged out all details but it's worth looking in ntt
21:57Kayden: alyssa: yeah, I could either hack around it in nir_to_tgsi, or in a virgl renderer pass to put them back, or just add a nir_shader_compiler_options flag to avoid calling my pass on non-compute for virgl
21:57alyssa: Kayden: what i meant is, I think there's already brokenness here
21:57alyssa: may or may not be my fault
21:57alyssa: may or may not affect your pass
21:57Kayden: okay :)
21:59Kayden: apparently my atomic counter handling is busted, too. glsl_to_nir uses nir_var_mem_ssbo as the mode for those. but the nir_deref_var is nir_var_uniform with glsl_type atomic_uint
21:59Kayden: and my pass is currently running before nir_lower_atomics_to_ssbo
22:04alyssa: i kinda want to delete nir atomic counters
22:05Kayden: would be a fan
22:06Kayden: I guess it would impact r600
22:06idr: I think that's the only hardware that ever did anything special for atomic counters. :(
22:07Kayden: yeah
22:07idr: I have some vague recollection of warts on the spec because of that.
22:07idr: It wasn't us for once!
22:07alyssa: idr: lol
22:08alyssa: idr: I can blame your employer for spending months of my life on geometry shaders though, right?
22:08alyssa: (-:
22:11idr: Geometry shaders were for sure a group effort.
22:11idr: Honestly... I'd blame Microsoft for requiring GS and fp64.
22:12idr: Khronos was just keeping up with the Jones's.
22:12airlied: the fp64 requirement was one of dumbest self owns in graphics :-P
22:12airlied: pretty sure dx never actually required it, GL just had to one up them
22:13karolherbst: yeah.. who actually thought that was a good idea
22:13karolherbst: though without it, nobody would have implemented fp64 probably...
22:13karolherbst: or it would have been some enterprise only feature only exposed on 5 gpus
22:14airlied: which is exactly how it should have been :-)
22:14karolherbst: :D
22:14idr: MS must have required it, or it never would have ended up in Intel GPUs.
22:14Kayden: hmm, FEATURE_LEVEL_11 with D3D11_FEATURE_DOUBLES
22:16Kayden:not adept at cross-referencing MS docs yet
22:16Kayden: but yeah unfortunate for sure
22:20alyssa: there was a brief, terrible time when Mali GPUs had native hardware for fp64
22:20alyssa: not just fp64
22:20alyssa: fp64vec2!
22:21alyssa: So you can add two doubles in one instruction! :-D
22:22airlied: Kayden: yes so it was optional in d3d11, gl4.0 should have just left it as an extension
22:23airlied: or adopted some sort of capabilities for exts
22:35karolherbst: alyssa: but why...
22:35alyssa: karolherbst: the same genius 128-bit ALU that brought you native vec16 instructions
22:36karolherbst: I mean.. I kinda see the point of a 128 bit alu, but native fp64?
22:37karolherbst: I wonder if I finally have a CTS run on v3d without crashing the GPU
22:49Kayden: huh, this is new
22:50Kayden: went to go test my updated MR !24842 to make sure I fixed the virpipe and freedreno regressions, and... https://gitlab.freedesktop.org/mesa/mesa/-/pipelines/979688 says "You are not authorized to run this manual job" on any of the container steps. I used to click the play button there to cause it to cascade through the other jobs and actually test stuff
22:50Kayden: did something change? am I supposed to be doing this differently now?
22:51Kayden: oh.
22:51Kayden: wasn't signed in, heh
22:52Kayden: (signed in twice today but various browser tabs are having...problems. *shrug*)
23:05DemiMarie: Could one implement Venus on top of WebGL?
23:06DemiMarie: airlied: hard disagree on fp64, you kind of need it for many scientific computing workloads
23:09airlied: exactly so we don't need it :-)
23:09airlied: like intel even dropped it from their hw going forward
23:10HdkR: I like the NVIDIA approach. One fp64 pipeline per SM for compatibility, if you need real perf then buy the compute focused cards :D
23:44DemiMarie: airlied HdkR: who is the “we” in “so we don’t need it”?
23:45airlied: 99% of graphics/compute developers, the GL4 API mandating it etc
23:45airlied: like it was fine as an extension for specialist hardare like dx11 did it
23:46karolherbst: fp64 is unusable on any desktop GPU anyway