IRC Logs of #dri-devel on irc.freenode.net for 2024-01-27

00:13 alanc: robclark: I think that just means the CNA who assigned the CVE hasn't marked it as ready for publication yet
00:16 robclark: alanc: that sounds like incentive to not fix the associated gitlab issues :-P
04:27 robclark: alanc: disregard my last statement... even if they remain embargoed nonsense forever distros still see them.. but maybe we need some clear doc (either in drm kernel docs or mesa) about security boundaries and how UMD isn't one (see https://docs.vulkan.org/guide/latest/validation_overview.html#_undefined_behavior ... but also that is just making explicit for vk what is implicit for legacy APIs, ie gl/es, etc)... that would at
04:27 robclark: least give us something to copy/paste link to to argue against nonsense CVE's rather than have to spend time...
18:36 DavidHeidelberg: karolherbst: around? I'm trying to use cl.clCreateImage2D in tinygrad which supposedly should work, but on both freedreno and iris it fails with -40 (cannot allocate). I looked at gallium properties and rust code and it looks like it should work, but somehow it doesn't
18:43 karolherbst: DavidHeidelberg: -40 as the CL error code?
18:43 karolherbst: -40 is CL_INVALID_IMAGE_SIZE
18:44 karolherbst: usually means that the image is bigger than supported by the hardware
18:44 karolherbst: is checked inside `validate_image_desc`
18:50 DavidHeidelberg: yeah, but both iris and fd has max caps (at least it seems to me (16384)
18:52 DavidHeidelberg: 1st alloc: 12565x6
18:52 DavidHeidelberg: 2nd: 50260x192
18:52 DavidHeidelberg: 3rd: 50260x192
18:55 karolherbst: DavidHeidelberg: all fail?
18:55 karolherbst: ehh
18:55 karolherbst: 2nd and 3rd should fail
18:55 DavidHeidelberg: I assume just the 3rd fails
18:56 karolherbst: yeah, that's not supported by the drivers
18:56 DavidHeidelberg: the status is read afterwards, so...
18:57 karolherbst: yeah.. looks like rusticl behaves correctly here then, need to split the image on the application side
18:57 karolherbst: or just use a raw buffer
19:00 tnt: or use 12565 x 768 . I guess the GPU uses tiling for better caching when using images vs buffer and you might want to keep that behavior.
19:01 DavidHeidelberg: got suggestion on Adreno there is more efficient to use 2dimage because of cache
19:01 DavidHeidelberg: *faster cache for it
19:01 karolherbst: yeah... makes sense
19:01 karolherbst: then you kinda have to split the image if you need images as huge as those
19:02 karolherbst: could also use a 2darray thing and slice it :D
19:36 DavidHeidelberg: karolherbst: btw. 16x16k limit seems to apply only on 2d mipmapped textures
19:36 karolherbst: mhhh
19:36 DavidHeidelberg: for nvidia is possible 65x65k, haven't found data for intel yet
19:36 karolherbst: yeah.. that seems plausible
19:37 karolherbst: if you can ignore MSAA and mimpas you can often bumb the limits
19:37 karolherbst: we just don't have the interfaces in place in gallium to report that
19:39 DavidHeidelberg: u say "new pipe cap to the moon"? :D I heard people love caps
19:42 karolherbst: yeah....
19:42 karolherbst: at least we still have that one MR to clean up compute caps, but I need to pick that up
19:43 karolherbst: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23870
19:43 karolherbst: or maybe we want to have a special query for texture stuff...
19:43 karolherbst: the current interfaces aren't great
19:43 DavidHeidelberg: AMD should have also 65x65 https://forums.bit-tech.net/index.php?threads/gpus-with-high-max-texture-sizes-to-enable-more-256x-512x-textures-in-minecraft.280478/
19:44 karolherbst: the thing is...
19:44 karolherbst: gallium uses 16 bit in a couple of places, so I'm not quite sure we won't run into weirdo issues as some of the values are signed
19:44 karolherbst: so we might get to 32x32 easily, but 64x64 might be difficult
19:50 DavidHeidelberg: hmm, well, if we get closer to the usefulness of proprietary drivers, why not. I'll just do quick test on freedreno if it'll pass the texture..
19:59 karolherbst: yeah.. the question is if we need new caps or not
19:59 karolherbst: but I susepct we will
20:01 DavidHeidelberg: the regular textures keep the same limit.. so, me too
20:04 jenatali: IIRC you run into issues sampling at that resolution too, not enough bits for subpixel precision or something like that
20:06 karolherbst: some drivers just scale internally to support sampled images
20:07 karolherbst: (well.. the ISA might also have restrictions there)
20:11 jenatali: I know the D3D spec says texture coords go to 16.8 fixed point, so if the size is greater than 16 bit that has to change and you lose subtexel precision
20:58 tnt: i915 0000:00:02.0: drm_WARN_ON(new_crtc_state->do_async_flip && !plane->async_flip)
20:58 tnt: Should I worry ?
21:03 emersion: sounds like a bug
21:05 tnt: yeah, although I'm realizing my kernel isn't all that recent so I should update first before dragging anyone into it :)
21:29 DavidHeidelberg: karolherbst: btw. 65535x65535 passed on freedreno, sadly the code fails later (unrelated to the problem, but not fixed yet). At least the creation worked.
21:30 karolherbst: DavidHeidelberg: the issue is that pipe_box is all signed :)
21:31 karolherbst: and I'm sure people will complain changing that to unsigned or bumping to 32 bit because overhead on the GL side
21:31 karolherbst: so I'd rather make 32x32 work for now unless there are _strong_ reasons to bump it and go through all the work of making everybody happy here
21:32 karolherbst: there are also 32 bit limits on buffer sizes which is a more urgent issue and will probably also require some of that
21:33 DavidHeidelberg: right, the pipe_box making it clean it won't work :D
21:44 tnt: Does glsl has some function to check if a condition is true in any thread of a workgroup ? Something like anyInvocationARB but that works for the whole workgroup ?
21:45 DavidHeidelberg: karolherbst: but.. but... "x and width are used by buffers, so they need the full 32-bit range."
21:46 karolherbst: mhhhhh..
21:46 karolherbst: I wonder if we could expose 64x32....
21:46 DavidHeidelberg: yup :D
21:47 DavidHeidelberg: imho this would cover a lot, because still full 64x64k takes lot of VRAM even these days
21:47 karolherbst: yeah..
21:48 karolherbst: I guess freedreno has to report 16x16 for gl stilll, because MSAA and mipmaps?
21:48 DavidHeidelberg: sure, I think that apply for everyone in general
21:48 DavidHeidelberg: btw. my hack maybe will work, because it's "50260x192" texture
21:52 DavidHeidelberg: we could add pipe cap which will be used for CL and then for height just rusticl would MIN(32768,val)
21:52 DavidHeidelberg: would report
21:53 karolherbst: why does it need to be 50260x192 anyway?
21:53 DavidHeidelberg: I have totally no idea, I tried to read tinygrad code, but I would have to spend much more time to understand it
21:54 karolherbst: yeah.. it's not the best project...
21:54 karolherbst: and not even fast
21:54 DavidHeidelberg: imho it's good that it works relatively good with Mesa, on other hand, the hunt after getting low number of lines.. leads to very condensed code
21:54 karolherbst: yeah.. it's silly
21:55 karolherbst: I should fix those openvino bugs, but they are bugs in openvino.. :D
21:55 karolherbst: DavidHeidelberg: anything clblast based should also work btw
21:55 karolherbst: and that's actually fast
21:56 karolherbst: llama.cpp and whisper.cpp should just work (tm). I've tried the latter already. So if you can get the same thing done with something clblast based, I'd just use that
21:58 DavidHeidelberg: hmm, I'll have to find time to do more exploration :) I stayed with tinygrad since airlied published the blog
21:59 karolherbst: if you want to see something cool, use this: https://github.com/ggerganov/whisper.cpp
21:59 karolherbst: "Real-time audio input example" specifically
21:59 karolherbst: though
21:59 karolherbst: Intel's iGPU was too slow for that
21:59 karolherbst: :D
21:59 karolherbst: kinda
22:00 karolherbst: the base model works, but uhm.. results are not that great
22:00 karolherbst: ohh.. I haven't seen that "--print-colors" flag yet.. should try
22:09 DavidHeidelberg: wow, I'll check on XT6800