00:21 imirkin: enunes: fyi, clip planes were part of ES1.1 (but removed in ES2). if you have an ES1.1 driver, you could see how glClipPlane works there.
00:23 imirkin: e.g. a2xx/a3xx had enough hw to implement 6 fixed clip planes, but not enough to support e.g. gl_ClipVertex from GL 2.0, or the clip distance stuff from GL 3.0
01:12 airlied: dEQP-VK.pipeline.multisample.raster_samples.samples_4.primitive_point,Fail
01:12 airlied: dEQP-VK.rasterization.primitives_multisample_4_bit.no_stipple.points,Fail
01:12 airlied: is all that stands between lavapipe and vk 1.0 conformance
01:21 imirkin: airlied: just stop advertising 4x msaa
01:46 airlied: imirkin: spec requires it unfortuately
02:02 ishitatsuyuki: I'm trying to investigate the color management support with overlays on AMD GPUs, but it seems that the whole display engine source is undocumented... anyone happens to know more about that?
02:02 imirkin: there used to be docs online, but not sure if there are for the latest gens
02:03 imirkin: https://developer.amd.com/resources/developer-guides-manuals/
02:03 imirkin: i don't see anything too relevant though =/
02:04 ishitatsuyuki: yeah, they talk nothing about DCN, which is added in Raven Ridge
02:21 karolherbst: airlied: you know what? we can't pass the OCL CTS in full profile :/
02:21 karolherbst: or maybe we can, but it is still not conformant once we enable images
02:23 airlied: karolherbst: yeah I've been fixing things towards completing CL 3.0 CTS, but I have turned off images :-P
02:24 karolherbst: the biggest issue is supporting 64 textures :/
02:24 karolherbst: that will require some major rewrite...
02:24 karolherbst: luckily I think I already did it...
02:25 airlied: 64 images?
02:25 karolherbst: no, textures
02:25 airlied: is 64 the minimum?
02:25 karolherbst: yeah
02:25 karolherbst: we need 16 samplers inside a kernel
02:26 karolherbst: uhm.. wait
02:26 karolherbst: 64 images.. right
02:26 karolherbst: 128 textures :D
02:26 karolherbst: soo read_images_args == textures is 128
02:26 karolherbst: and write_image_args == images is 64
02:26 karolherbst: embedded devices have to have 8 each
02:28 karolherbst: there is some core support for some of it
02:28 karolherbst: PIPE_MAX_SHADER_SAMPLER_VIEWS == 128
02:29 karolherbst: so 128 textures are supported by core, but not all drivers
02:29 karolherbst: PIPE_MAX_SHADER_IMAGES == 32
02:29 karolherbst: so this is what we need to bump to 64
02:30 karolherbst: okay, so we don't have enough images... not textures
02:30 karolherbst: it's all very confusing :/
02:31 airlied: ah yeah bumping that to 64 shouldn't be too much trouble
02:31 airlied: just lead to some more memory conspumption in llvmpipe
02:39 karolherbst: just some 64 instead of 32 bitfields
02:39 karolherbst: wait..
02:39 karolherbst: ohh no, I don't have code for it I think :D
02:39 karolherbst: I was trying bumping samplers to 128 once
02:42 karolherbst: airlied: :p https://gist.github.com/karolherbst/d99b2fb10028a9d01e1f9a78ec0e75cb
02:42 airlied: karolherbst: nice!
02:44 karolherbst: airlied: spend some time digging through the spec to extract all the version, embedded vs full and custom type reqs: https://gitlab.freedesktop.org/karolherbst/mesa/-/blob/rusticl/src/gallium/frontends/rusticl/core/device.rs
02:44 karolherbst: should be more or less complete
03:38 karolherbst: ahhh.. even rust regex is an external crate :/
03:55 HdkR: While C++ regex is a nightmare? :P
03:55 tehcloud: lol, Rust... they tried to recruit me but I knew something was wrong when I heard the chanting and saw all the white robes
03:58 karolherbst: HdkR: you mispelled C++ is a nightmare
03:58 jekstrand: ^^
04:05 dcbaker[m]: I know you all hate it, but c++11 is actually pretty nice. Much nicer than C
04:05 dcbaker[m]:runs
04:05 imirkin:thinks c++ is great
04:06 imirkin: which these days is pretty much a guarantee that everyone here will hate it
04:07 jekstrand: imirkin: lol
04:07 jekstrand: I like C++ in theory
04:07 dcbaker[m]: C++98 is not that great, but c++11 (or even better 14 or 17) really are nice. C++ has a lot of baggage, but so will rust when it's 50 years old
04:07 jekstrand: I just don't like 95% of the C++ code I've ever read.
04:08 imirkin: yeah, i mean i'm not advising for the Turbo C++ pre-standard variant here...
04:08 airlied: yeah if someone wrote a mesa c++ style guide maybe it would be more consumable
04:08 airlied: but the range of acceptable c++ usage seems to be a sliding window per developer with not much overlap :-P
04:09 airlied: and people slip back into older c++ idioms the longer they've used it for
04:09 airlied: or which stack overflow article they learned it from
04:10 imirkin: yeah, definitely the gcc-2.95-safe c++ is strong within me. haven't had much excuse to learn the new stuff too much
04:10 imirkin: the c++11 stuff seems really nice
04:10 jekstrand: Yeah, unique_ptr itself is a pretty good reason.
04:11 jekstrand: Builtin threading and atomics is nice too
04:12 dcbaker[m]: Having generics and type safe containers in the stdlib is also fantastic
04:12 jekstrand: Yeah
04:12 jekstrand: Some of them aren't as nice as you'd want them to be
04:12 jekstrand:loves intrusive linked lists
04:14 HdkR: dcbaker[m]: https://github.com/FEX-Emu/FEX/blob/master/CMakeLists.txt#L35 I have C++20 enabled by default. Can't complain :P
04:15 jekstrand: But std::vector would be soooo nice....
04:17 dcbaker[m]: I have a piglit implementation in c++14. Other than interfacing with expat and zlib it's been really nice
04:18 jekstrand: dcbaker[m]: What's the status of that? Are we using it in CI? Is it in piglit yet?
04:27 dcbaker[m]: Neither. It works well enough for a lot of OpenGL cases. Can do vkrunner as well. I've got some wip code for junit output and for multi shader tests. Still haven't started on OpenCL
04:28 dcbaker[m]: It's much faster than piglit on my 8 thread icelake though, and uses a lot less memory
09:19 bbrezillon: danvet: did I get the dmafence annotation right in https://patchwork.freedesktop.org/patch/399259/ ?
09:23 danvet: bbrezillon, lgtm
09:27 bbrezillon: danvet: ok, thx
09:56 danvet: mripard, is your latest series culmulative?
09:57 danvet: or do I now have to like review 2 or something like that?
10:30 mripard: danvet: which one, the one about drm_atomic_helper_commit?
11:47 tzimmermann: danvet, sravn, do you remember why shmem helpers are incompatible with fbdev's deferred i/o ?
13:04 jophish: Where might I find some high-level documentation on RADV?
13:33 neobrain[m]: jophish: what kind of information are you looking for?
13:34 neobrain[m]: We don't really have much beyond the amd docs themselves, other than documenting the env vars radv uses I guess
13:40 sravn: tzimmermann: I have no idea about deferred I/O - sorry
13:40 tzimmermann: ok, np
14:11 jophish: neobrain[m]: I'm afraid I don't really know what I want to know yet!
14:11 jophish: I'm just generally interested, and wanted to understand a little better how things work
14:12 neobrain[m]: fair enough :)
14:14 neobrain[m]: what's nice about Vulkan drivers is that the API itself is slim enough that there isn't much of a driver architecture to comprehend anymore, so a good understanding of Vulkan will get you a long way ahead already
14:14 jophish: yeah, that's actually the observation I would make after perusing the source a little but
14:14 jophish: bit*
14:14 neobrain[m]: (though that said, our shader compiler ACO is a different beast :p)
14:15 jophish: I spent a bit of time today exploring the nvidia driver with radare2, I have to say that radv is much more approachable!
14:17 neobrain[m]: haha well, it does help being able to write the driver based on actual docs instead of spending all day reversing the hardware first
14:18 dschuermann: jophish: in src/amd/vulkan/ you find the RADV bits, while the shader compiler can be found in (frontend) src/compiler/nir/ and (backend) src/amd/compiler/
14:25 danvet: tzimmermann, sravn both use the same fields in struct page
14:25 danvet: to make them compatible we need some way to intercept the page fault stuff
14:25 danvet: and we'd need to keep track of dirty pages in some separate bitfield array
14:25 danvet: I think, I haven't actually tried to implement that
14:26 danvet: intercept page fault stuff = wrap vma->vm_ops somehow
14:26 danvet: without upsetting shmem vma->vm_ops
14:27 tzimmermann: danvet, thanks for answering. i tried to run shmem with fbdev without shadow fb and got a console on the display. so i wonderde why it's considered incompatible
14:35 jophish: thanks, dschuermann. Looking forward to reading the shader compiler :)
14:36 danvet: tzimmermann, only starts going boom when you do userspace mmap
14:36 danvet: so maybe fire up that igt and watch
14:37 dschuermann: jophish: also have a look at src/amd/compiler/README.md . if questions remain, feel free to ask :)
14:59 jophish: dschuermann: thanks! will do
15:19 danvet: airlied, CI would be sooooooooooo nice :-/
15:30 karolherbst: dcbaker[m]: I guess that proc-macro crates are not wired up inside meson as well?
15:30 mimi89999: Hello
15:30 mimi89999: Are there any OpenCL tests in Mesa?
15:31 karolherbst: why should the tests be in mesa?
15:31 karolherbst: there are some in piglit, but then there is also the OpenCL CTS
15:32 mimi89999: To test if there are no regressions for example
15:33 karolherbst: sure, but we don't put API tests in mesa, there are all in their own projects
15:33 mimi89999: Ah. OK.
17:17 ajax: anyone want to trade reviews? !7369
17:21 tango_: lol
17:21 tango_: «I'll review yours if you review mine»
17:22 jekstrand: ajax: Hah! Nice try. :P
17:22 jekstrand: ajax: I don't have any MRs that I want reviewed that badly. :)
17:23 tango_: mimi89999: pocl gets its opencl tests from a plethora of sources, you could follow those
17:25 tango_: mimi89999: -- Available testsuites: AMD;AMDSDK2.9;AMDSDK3.0;ASL;arrayfire;clBLAS;CLBlast;clFFT;conformance;CloverLeaf;Halide;IntelSVM;opencl-book-samples;OpenCV;Parboil;piglit;PyOpenCL;Rodinia;shoc;VexCL;ViennaCL;Glow
17:25 tango_: this is what pocl uses
17:54 jenatali: curro, karolherbst: Clover's LLVM deps includes all-targets, do you know if that's actually needed?
17:54 karolherbst: jenatali: probably not
17:55 jenatali: I apparently have to rebuild LLVM to add RTTI :(
17:55 karolherbst: jenatali: but where is that specified?
17:55 jenatali: karolherbst: https://gitlab.freedesktop.org/mesa/mesa/-/blob/master/meson.build#L1469
17:57 karolherbst: seems like it was always like this
17:57 karolherbst: but yeah.. we really just need the CPU and AMD target
17:57 karolherbst: CPU being.. native arch thingy
17:57 karolherbst: maybe check what llvmpipe depends on
17:58 karolherbst: but yeah.. it kind of depends on what drivers to pull in
17:58 jenatali: I don't think Clover needs any specific target
17:58 jenatali: Yeah the driver can pull in what they want, but Clover itself just needs the IR libs
17:58 karolherbst: ohh, wait
17:58 karolherbst: you are right
17:58 karolherbst: we just need AMD
17:58 jenatali: No, AMD needs AMD ;)
17:58 karolherbst: mhhhhh
17:58 karolherbst: we do some LLVM stuff inside clover
17:58 karolherbst: and some of it is AMD specific
17:58 karolherbst: so I wouldn't be surprised if we actually need it
17:58 jenatali: Only if the AMD driver is being built though, right?
17:59 karolherbst: yes
17:59 karolherbst: wait...
17:59 karolherbst: no, I think it's always built atm
17:59 karolherbst: libclllvm
17:59 jenatali: I'm trying to actually build Clover on Windows so I can address curro's comments about sharing more code, we'll see how far I get
17:59 karolherbst: cool
18:00 karolherbst: but maybe it indeed works without a target...
18:00 karolherbst: but we have this "t->createTargetMachine(target.triple, target.cpu,..." code
18:00 karolherbst: in clover
18:01 jenatali: Yeah the target should be spir though
18:01 karolherbst: not for AMD
18:01 jenatali: Sure, but that'd be a runtime error if you tried running with AMD, not a linker error
18:01 Venemo: why does this matter? AFAIK radeonsi just gets a NIR shader from you, doesn't it?
18:01 Venemo: or am I misunderstanding something here?
18:01 karolherbst: we still support the LLVM path
18:02 karolherbst: jenatali: ahh, yeah
18:02 jenatali: Right, no NIR involved there
18:02 jenatali: karolherbst: For the record, here's what we were able to get by with for our CLC frontend: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7565/diffs?commit_id=581a3c5c4b134dbcfdae411c5b84d9a826f4acd3#0cc1139e3347f573ae1feee5b73dbc8a8a21fcfa_1469_1490
18:02 gitbot: Mesa issue (Merge request) 7565 in mesa "Add Microsoft CLC CLOn12 compiler stack" [Nir, Opencl, Spir-V, D3D12, Panfrost, Opened]
18:02 karolherbst: I think airlied made the nir path work as well.. but no idea what the state is
18:02 karolherbst: jenatali: it doesn't matter anyway.. just use libLLVM :p
18:03 jenatali: Hm?
18:03 karolherbst: I just think people statically linking are the only one bothering
18:03 karolherbst: and on linux you get your big fat libLLVM.so
18:03 karolherbst: which contains everything anyway
18:03 jenatali: Windows doesn't build a shared LLVM
18:03 jenatali: We have to link static
18:04 karolherbst: yeah.. I know. I just don't think our list is cleaned up and things just got added once we ran into a problem
18:04 jenatali: Yep, I'll see if I can help you prune it :P
18:11 mimi89999: jenatali: Still on that Proronix article?
18:11 mimi89999: Phoronix
18:11 Venemo: karolherbst: what is the LLVM path?
18:11 jenatali: mimi89999: Which one?
18:11 mimi89999: About sharing code
18:11 dcbaker[m]: karolherbst: In my branch? Haven't gotten there yet
18:12 jenatali: I don't remember a Phoronix article about sharing code...
18:13 dcbaker[m]: mimi89999: The Intel one? If so you're confusing jenatali with someone else
18:18 karolherbst: Venemo: clover handing over the LLVM directly to r600 and radeonsi
18:18 karolherbst: and it's an llvm with the AMD target
18:18 Venemo: oh, I see
18:19 Venemo: could that work with NIR too?
18:20 airlied: karolherbst: not even sure clover is involved
18:21 airlied: ph I suppose we do pull IR out at some point
18:22 karolherbst: yeah
18:22 karolherbst: clover/llvm/codegen/ and stuff
18:22 karolherbst: there is a bunch of things
18:22 karolherbst: and all very AMD specific
18:23 airlied: well the idea is to just generate machine code from clang frontend, it's technically not amd specific, just no one else has a backend in llvm that wants that :-P
18:24 karolherbst: ohh, it is amd specific as the AMD target has some ABI which clover fullfills
18:25 jenatali: airlied: Like printf :P
18:25 karolherbst: it's just not visible if you look at the code
18:25 karolherbst: yeah, and kernel args for images and other things
18:25 Venemo: what happens with drivers that don't use LLVM?
18:25 karolherbst: they don't go that path
18:25 jenatali: Venemo: They get NIR
18:25 karolherbst: we have some code generating SPIR targeting LLVM
18:25 karolherbst: and this gets translated to spir-v
18:25 jenatali: Right, Clang -> SPIR -> SPIR-V -> NIR
18:26 Venemo: by Clang, do you mean LLVM IR?
18:26 jenatali: SPIR is an LLVM IR
18:26 karolherbst: clang essentially parses the CLC code and gives us llvm
18:27 Venemo: is it the same LLVM IR that for example RADV's LLVM backend creates?
18:27 karolherbst: no
18:27 karolherbst: LLVM IR always has a target
18:27 jenatali: Venemo: For Clover we'd use the spir target triple for Clang
18:27 karolherbst: and it depends what instructions you emit
18:27 karolherbst: they are just incompatible
18:27 Venemo: okay
18:28 Venemo: so, theoretically, if we were interested in the future in using ACO there, it could get NIR, right?
18:29 karolherbst: correct
18:29 Venemo: thanks, that's what I was curious about
18:29 karolherbst: and the amount of additional nir instructions is quite low
18:29 Venemo: I never really paid attention to clover to be honest, so am a bit out of speed here.
18:30 Venemo: for example I don't even fully see why you need llvm there
18:30 karolherbst: because we are too lazy to write a C/C++ parser
18:30 jenatali: I guess someone could write an OpenCL C -> NIR compiler...
18:30 Venemo: oh, I see
18:31 jenatali: Clang just already happens to have one
18:31 karolherbst: yeah.. but preprocessor support?
18:31 karolherbst: all that include magic?
18:31 karolherbst: -D support
18:31 karolherbst: it's quite a lot
18:31 karolherbst: glsl is childs play in comparison
18:31 Venemo: sorry, I wasn't aware that OpenCL was basically C.
18:31 jenatali: Yep. A few restrictions, and a bunch of extensions
18:31 Venemo: yeah, I wouldn't expect you to write a C compiler :)
18:32 dschuermann: karolherbst: wait, you go OpenCL C -> LLVM -> SPIR-V -> NIR?
18:32 jenatali: Yep
18:32 karolherbst: yep
18:32 karolherbst: well, OpenCL also allows you to hand in SPIR-V directly
18:32 jenatali: Or SPIR
18:33 karolherbst: yeah, but SPIR is highly optional
18:33 karolherbst: where SPIR-V is kind of required
18:33 jenatali: True :)
18:33 karolherbst: anyway, going through SPIR-V is the natural choice
18:33 dschuermann: well, ok that is also what the khronos tooling does
18:33 jenatali: karolherbst: Do you know if SPIR-V is required for 3.0?
18:33 karolherbst: no idea
18:33 karolherbst: but I assume it is
18:33 dschuermann: it's not required by that garbage release
18:33 jenatali: airlied: ^^?
18:34 karolherbst: dschuermann: "garbage release"?
18:34 karolherbst: and here we thought CL 3.0 was the first _sane_ release since CL 1.2
18:34 karolherbst: jenatali: ahh "Creating programs from an intermediate language (such as SPIR-V) is optional for devices supporting OpenCL 3.0."
18:34 jenatali: Huh, interesting
18:34 karolherbst: oh well
18:35 dschuermann: the vendor can give you a list of supported IRs, be it SPIR, SPIR-V or whatever (PTX anyone?)
18:35 karolherbst: yeah.. it surprises me as well
18:35 dschuermann: how is that a good spec by any means?
18:35 karolherbst: dschuermann: that was true since CL 1.0
18:35 jenatali: I thought for sure they would've kept that as required
18:35 jenatali: karolherbst: IRs was 2.0 I believe, with SPIR-V becoming a required IR in 2.1 I think
18:35 karolherbst: clCreateProgramWithBinary
18:36 karolherbst: could abuse it for PTX if you wanted to
18:36 jenatali: Yeah but SPIR was the only IR that could go down that path as an extension, IRs wasn't really spec'd until 2.0
18:36 karolherbst: right
18:36 dschuermann: don't have to: In addition, platform vendors may support their own IL if this is appropriate.
19:04 karolherbst: dschuermann: point was, this was possible since CL 1.0 anyway
19:04 karolherbst: jenatali: I guess SPIR-V is optional for all those FPGA vendors who don't bother with vulkan anyway
19:05 dschuermann: yes, but now spir-v is optional again... so, for portability you can only rely on OpenCL C
19:05 jenatali: I guess
19:05 karolherbst: ahh, yeah
19:05 karolherbst: maybe we should push for 3.1 making it required again :D
19:05 jenatali: dschuermann: For portability, it's always only been OpenCL C, since there's plenty of implementations that stopped at 1.2
19:06 karolherbst: nvidia *cough*
19:06 airlied: just need to write a spir-v ptx and do CLonCUDA
19:06 imirkin: llvm has a ptx output i think
19:07 karolherbst: airlied: you mean spir-v -> llvm -> ptx?
19:07 airlied: nope
19:07 karolherbst: well, llvm do has PTX output
19:07 airlied: I suppose you could involve llvm in there but I'm going to guess it's non-trivial
19:07 jenatali: airlied: I'm pretty sure NVIDIA's proprietary CL driver (at least on Windows) *is* CL on CUDA
19:07 dschuermann: there is already sycl on cuda iirc
19:07 karolherbst: jenatali: yep
19:07 airlied: having ptx output and getting the LLVM IR that backend expects might be different things
19:08 imirkin: jenatali: there's a cuda-ish core, you can see its ugly head in GL too
19:08 imirkin: like when you trigger some error, it'll give you some nonsensical error which makes it clear there are 10 layers of conversion going on
19:08 karolherbst: especially if you do compute shaders
19:30 mimi89999: OpenCL on CUDA isn't already implemented by POCL?
19:31 karolherbst: I think it actually is
19:31 imirkin: but NIH...
19:33 airlied: mimi89999: indeed they've done some stuff, must see if it ever landed in their master branch
19:34 airlied: yup it's in there
19:34 airlied: https://github.com/pocl/pocl/blob/master/lib/CL/devices/cuda/pocl-ptx-gen.cc
19:35 airlied: I think we talked back in time about using pocl as the frontend instead of clover
19:36 airlied: I wonder if they do barriers yet or just ignore them
19:37 karolherbst: airlied: biggest question is on how to integrate it into mesa though
19:37 airlied: yeah it has an accel plugins backend
19:38 airlied: not sure how to integrate that
19:38 karolherbst: yeah.. that's my biggest worry as well
19:38 karolherbst: do we really want to expose galliums API to the outside?
19:38 karolherbst: I guess the answer is no
19:38 airlied: no we would need a wrapper around it
19:38 karolherbst: yeah
19:38 airlied: which means inventing another API
19:38 karolherbst: yep
19:38 airlied: pocl might be good for CL on lvl0 type thing
19:39 imirkin: we could call it the Gallium Library
19:39 imirkin: aka GL
19:39 karolherbst: airlied: might be
19:40 dcbaker[m]: imirkin: let's pick a different metal. Mercurial has a nice ring to it :D
19:40 dcbaker[m]: that way we wont confuse anyone
19:40 imirkin: good idea
19:40 imirkin: (sadly the metal itself, in english, is 'mercury')
19:41 dcbaker[m]: lame
19:42 tanty: dcbaker[m] : could I get too quick reviews at
19:42 tanty: https://gitlab.freedesktop.org/mesa/piglit/-/merge_requests/413
19:42 gitbot: Mesa issue (Merge request) 413 in piglit "templates: fix summary generation when not all image keys exist" [Infrastructure, Regression, Opened]
19:42 tanty: and
19:42 tanty: https://gitlab.freedesktop.org/mesa/piglit/-/merge_requests/414
19:42 gitbot: Mesa issue (Merge request) 414 in piglit "cmake: move the INCLUDE commands after the project command" [Infrastructure, Opened]
19:42 tanty: ?
19:43 imirkin: dcbaker[m]: but dma-buf could have been called bitbucket ... missed opportunity
19:43 dcbaker[m]: imirkin: I came up with something even better "The Gallium Serialized Interchange Format" TGSI for short
19:43 imirkin: hehe
19:44 imirkin: the T = Tungsten btw
19:44 imirkin: (which is actually W iirc)
19:45 dcbaker[m]: Yeah, from Tungsten Graphics
19:45 imirkin: yep. just in case you weren't aware. you clearly were though :)
19:46 vsyrjala: clearly gallium should have been called wolfram graphics library (wgl)
19:47 karolherbst: or cupper graphics library (cgl)
19:48 dcbaker[m]: "Graphics Interface Translation HUB"
19:49 mimi89999: I tried to run the CL test in Piglit with `./piglit run cl results/cl`, but got `FileNotFoundError: [Errno 2] No such file or directory: '/home/michel/git/piglit/generated_tests/cl/builtin/int'`.
19:49 dcbaker[m]: you need to build the opencl tests
19:49 dcbaker[m]: they're not on by default
19:49 dcbaker[m]: -DPIGLIT_BUILD_OPENCL=true IIRC
19:50 danvet: robclark, should I apply the msm patches in Lee Jones series to drm-misc-next or will you apply?
19:50 danvet: [PATCH v2 00/42] Rid W=1 warnings from GPU (non-Radeon)
19:50 danvet: patches 15-20
19:51 danvet: agd5f_, I'll leave patch 1 in that series to you since amd
19:51 danvet: I think the others I'll just all apply and done
19:51 danvet: sravn, ^^ or did you want to do that?
19:51 robclark: danvet: iirc they were all standalone.. so I can pick them up
19:52 dcbaker[m]: tanty: done
19:52 mimi89999: `Manually-specified variables were not used by the project:
19:53 jenatali: karolherbst: Any idea why Clover has a dependency on libelf?
19:53 karolherbst: AMD
19:53 tanty: dcbaker[m]: thanks a lot!
19:54 danvet: robclark, yeah they're all standalone
19:54 karolherbst: jenatali: src/gallium/frontends/clover/llvm/codegen/native.cpp
19:54 danvet: it's more that there's a ton
19:54 danvet: and "maybe merging right before merge window pull" doesn't scale for this I think :-)
19:54 mimi89999: Maybe `cmake . -D PIGLIT_BUILD_CL_TESTS=on`
19:54 jenatali: :(
19:54 danvet: agd5f_ does a great job applying, but I think that's about it
19:55 dcbaker[m]: mimi89999: I also just end up using ccmake because figuring out what arguments a cmake project takes is otherwise impossible :)
19:55 dcbaker[m]: s/also/always/
20:01 sravn: danvet: my only concern is that we get them merged so we avoid a lot of lingering patches. So please go ahead
20:03 sravn: I did a quick W=1 run if drivers/gpu/ and there was a tons of warnings, but I think Lee's series will fix the most. The ones remaining are drivers that requires specific archs or configs
20:10 jenatali: curro: FYI, Clover won't build with MSVC very easily... Fun fact, Mesa's c99_compat.h breaks LLVM headers by #defining restrict to __restrict, meaning that __declspec(restrict) becomes invalid
20:17 agd5f_: danvet, yeah, I'll take care of all the AMD ones
21:30 airlied: jenatali: I hadn't considered changing the spirv parsing for printf, I suppose I can extract the strings easier there
21:30 jenatali: airlied: That'd work, or you could during during the lowering from printf intrinsic -> nir sequence of variable stores
21:33 jekstrand: Are there Fedora packages for clang 10? I see libclang-10 but not the executable
21:34 airlied: jekstrand: just clang?
21:34 airlied: we don't tend to package older compiler frontend though
21:35 jekstrand:links clang-11 to clang-10 and hopes no one notices...
21:36 airlied: jenatali: in vtn_opencl.c I assume you mena
21:36 jenatali: jekstrand: I meant in nir_lower_printf
21:36 airlied: jenatali: doing it in the nir lowering pass seems non-trivial
21:37 jenatali: Ok, that works
21:37 airlied: jenatali: that hits the problem of needing two passes
21:37 jenatali: airlied: I'm not sure I follow why that's the case
21:37 airlied: you don't know what the printf vars are until you hit the printf
21:37 airlied: so once you do then you have to find every store to that var
21:37 airlied: it's not SSA
21:38 jenatali: I'm still not following... at the beginning of the lowering pass there's no var
21:38 jenatali: Ohhhh wait yeah, the args are a local var, right
21:39 jenatali: Sorry I was thinking you meant the buffer where the actual printf data gets stored
21:39 jenatali: airlied: Yeah seems reasonable to pull it all the way to vtn_opencl.c then
21:40 airlied: of course maybe we can just work out how to map things using the address but I'm not confident
21:42 jenatali: Yeah, I didn't expect that to work for everyone
21:44 airlied: yeah I can't see with clover how I can make it work for gpus
21:45 mimi89999: `[710/710] skip: 3, pass: 685, fail: 19, crash: 3` seems quite bad.
21:46 airlied: mimi89999: what is that against?
21:46 mimi89999: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3768
21:46 gitbot: Mesa issue 3768 in mesa "OpenCL might produce incorrect output" [Opened]
21:46 mimi89999: My issue with te RX 580
21:46 karolherbst: mimi89999: seems good enough
21:47 karolherbst: anyway, runing against piglit will probably not fix your problems
21:50 mimi89999: Is it a lost case?
21:51 karolherbst: no, just needs proper debugging
22:00 mimi89999: Can I help with that? I can test anything on that device.
22:06 airlied: writing a piglit test that reproduces the failure might help, if one of the current failing ones isn't obviously it
22:45 mimi89999: airlied: `cl-custom-r600-create-release-buffer-bug` seems like a good candidate.
23:03 karolherbst: curro: I can't remember but was there something I wanted to change to the CL 1.2 image MR?
23:11 curro: karolherbst: IIRC there were some hacks to convert between the CL and gallium array coordinate ordering you were trying to implement a better fix for. or did you fix that already?
23:11 karolherbst: ohhh, right
23:12 karolherbst: totally forgot about it :D
23:12 karolherbst: all I could came up was, making the subclassing suck less or inline samplers super hackish implemented
23:12 karolherbst: btw, did you had a chance to look at the inline sampler stuff?
23:14 curro: karolherbst: why does the subclassing affect how we process coordinates in transfer.cpp?
23:14 curro: karolherbst: i don't think so, do you have a link to it?
23:15 karolherbst: curro: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7241/diffs?commit_id=d4b50e02db544100512736777203ce62f9873a9c
23:15 gitbot: Mesa issue (Merge request) 7241 in mesa "Draft: OpenCL 1.2 image support" [Opencl, Clover, Opened]
23:15 karolherbst: curro: ohh, no, this was just a general thing to improve maybe
23:16 karolherbst: like having an templated image_base instead of the variants and just use that
23:16 karolherbst: and it would implement the mem type and dimensions stuff directly
23:16 karolherbst: which is more or less the only hard reason we have subclasses
23:17 karolherbst: anyway.. just an idea
23:17 karolherbst: the coordination fixup is critical though ;)
23:18 curro: karolherbst: lemme look through your patches
23:25 mimi89999: I can't find the commit fixing the r600 create release buffer bug that the test is for.