03:38airlied: dakr: actually splitting the lock worked, so I'll send out two patches to fix the object corruption
03:46fdobridge: <airlied> @gfxstrand https://patchwork.freedesktop.org/patch/580696/ that is the proper patch (not much different to the one you are running)
03:46fdobridge: <airlied> at least should kill that one bug, now I've got to work out the other painful one
03:50fdobridge: <gfxstrand> Cool
04:03fdobridge: <Sid> hm, did I do something wrong..
04:03fdobridge: <redsheep> I certainly did. Is it expected that installed mesa drivers are faster than ones that aren't installed?
04:03fdobridge: <redsheep> Even when the build script involved is absolutely identical, save installing it?
04:06fdobridge: <Sid> I don't see the patch I sent on dri-devel archives, hmm
04:06fdobridge: <Sid> neither on patchwork
04:06fdobridge: <airlied> yeah it probably got stuck in moderation
04:07fdobridge: <airlied> I can just pull it from discord 😛
04:07fdobridge: <redsheep> This performance testing is starting to make me seriously paranoid, I might have been wrong about the regression on 27840
04:07fdobridge: <Sid> ah, moderation e-e
04:08fdobridge: <Sid> hang on I'll reupload with better commit message
04:10fdobridge: <redsheep> I don't know how I possibly could have foreseen that installing a driver would have a serious performance impact
04:10fdobridge: <Sid> https://cdn.discordapp.com/attachments/1034184951790305330/1212975332986978304/0001-drm-nouveau-keep-DMA-buffers-required-for-suspend-re.patch?ex=65f3ca4f&is=65e1554f&hm=95d4b8a222f2b4431b8a3248dfafa114d92145c5a8a25bbd83430fdf1d8857d0&
04:11fdobridge: <Sid> just some minor grammar changes that come from moving the Fixes bit down in the commit message
04:27fdobridge: <redsheep> OK I am officially confused. Not only does the exact same commit yield higher performance by getting installed in The Talos Principle, the opposite is true in The Witness.
04:27fdobridge: <redsheep>
04:27fdobridge: <redsheep> The witness is markedly slower on an installed driver, again with exactly the same build parameters. Further, when doing more tightly comparable testing where you use two different non-installed builds with only 27840 as the variable under test the performance difference is so small it's not measurable in both games.
04:45fdobridge: <redsheep> @gfxstrand Apparently my testing was tainted again ^
04:45fdobridge: <redsheep>
04:45fdobridge: <redsheep> Those numbers for 27840 were bogus, and I have absolutely no idea how. Does mesa load libraries at runtime that could be different depending on location or something? Maybe the phase of the moon?
04:47fdobridge: <redsheep> The only thing I can possibly figure is that when I query my actual environment without specifying an icd path my vulkaninfo has one entry that says llvm, and another that does not... Maybe that matters?
04:48fdobridge: <redsheep> Is llvm involved when using nvk at all?
04:49fdobridge: <redsheep> https://cdn.discordapp.com/attachments/1034184951790305330/1212985004963926016/confusion.txt?ex=65f3d351&is=65e15e51&hm=5e9368b5a2e09484dd610a5674aeb75fb0d10b43c2bc273dc75d8b6f4feb4585&
04:50fdobridge: <redsheep> If it isn't that llvm thing I fail to see how it's possible that specifying no icd vs specifying one built on the same commit could make a difference
04:57fdobridge: <redsheep> I really wanted to work on calibrated timestamps tonight, instead I wasted all of my time trying to confirm I wasn't spouting nonsense numbers, and I was 😦
05:03fdobridge: <gfxstrand> The best way to make sure you're getting the driver you think you're getting is to set `VK_ICD_FILENAMES` directly. With that, the Vulkan loader will load the driver you specify and only the driver you specify.
05:04fdobridge: <redsheep> Yeah I was under the mistaken impression that just matching commit was enough, I guess.
05:08fdobridge: <redsheep> This might still be worth investigating further, if there's something in my environment that gets loaded when the ICD is not specified that can swing performance by nearly double in some cases then that's worth knowing about in detail.
05:26fdobridge: <gfxstrand> So, the LLVM thing is because in the 3rd one, you're getting both NVK and lavapipe. Apps should still be preferring NVK, though.
05:27fdobridge: <redsheep> Ah, I see. Well there's probably no way lavapipe is fast enough to be competitive here, but I don't know if I know anything for sure anymore, maybe that is the issue.
05:32fdobridge: <gfxstrand> :blobcatnotlikethis:
05:37fdobridge: <airlied> lavapipe dreams of one day being mistaken for a real life driver 😛
05:39fdobridge: <gfxstrand> https://tenor.com/view/im-in-your-dreams-gif-19262251
05:40fdobridge: <redsheep> I wonder if there's any way I could be 100% certain that when I test running my session on zink that it's not just llvmpipe or lavapipe+zink
05:40fdobridge: <redsheep> It would certainly explain it being so slow, even after damage got wired up
05:44fdobridge: <gfxstrand> Yeah, so the earlier version of that patch just completed an 18-thread CTS run with no flakes.
05:44fdobridge: <gfxstrand> You have no idea how happy this makes me.
05:45fdobridge: <gfxstrand> The IRQ thing does still seem to strike sometimes, though.
05:45fdobridge: <gfxstrand> Or maybe it's something else. IDK. All I know is that sometimes the GPU just goes out for a long lunch and never bothers to come back to the office.
05:52fdobridge: <airlied> yeah I think that is the same as BAR eviction problem I can reproduce, all contexts get timed out and stuff never works again
05:53fdobridge: <gfxstrand> Still, With that locking patch I'm seeing more stability than I think I've ever seen. 💜 Good work!
05:54fdobridge: <gfxstrand> Still, With that locking patch I'm seeing more stability than I think I've ever seen so good work! (edited)
05:54fdobridge: <redsheep> It seems VK_EXT_calibrated_timestamps was promoted to VK_KHR_calibrated_timestamps, so VK_KHR_calibrated_timestamps is what I actually need to implement to resolve issue 9625, right?
05:54fdobridge: <gfxstrand> Implement the KHR version of the entrypoing and then turn on both extensions
05:55fdobridge: <gfxstrand> The dispatch code will handle the entrypoint aliasing for you
06:57fdobridge: <tom3026> something on ada is wonky compared to ampere in kernel also btw, "nouveau 0000:01:00.0: gsp: cli:0xc1d00002 obj:0x00730000 ctrl cmd:0x00731341 failed: 0x0000ffff" 5-6 lines of those everytime im launching something on the dgpu. and once at boot and a few seconds/minutes after. same when closing said application, almost like its when its about to suspend/enter D3
06:57fdobridge: <tom3026> never saw those on the 3060 card
06:58fdobridge: <redsheep> Any notable kernel parameters set? What kernel are you testing? I have seen similar errors but only pretty rarely with my ada card
06:59fdobridge: <tom3026> "initrd=\intel-ucode.img initrd=\initramfs-linux.img root=UUID="21d5e523-e1e7-46c3-924a-aa26bca1a2b2" rw mitigations=off audit=0 nowatchdog initcall_blacklist=simpledrm_platform_driver_init msr.allow_writes=on ibt=off srso=off split_lock_detect=off add_efi_memmap acpi_osi=linux tsx=on tsx_async_abort=off tsc=reliable nmi_watchdog=0 zswap.max_pool_percent=20 zswap.enabled=1 zswap.compressor=lz4 zswap.zpool=zsmalloc i915.mitigations=off i915.fa
07:00fdobridge: <tom3026> yes yes i know mitigations=off covers half of those, but it was a testing one by one incident and it all sticked. xD
07:00fdobridge: <redsheep> pcie_aspm is the only one I see that seems like it could be relevant
07:00fdobridge: <redsheep> I don't think I have that
07:01fdobridge: <redsheep> Particularly since that has to do with power management that seems like a good place to start
07:29fdobridge: <airlied> Those are just display port probes failing
07:29fdobridge: <airlied> I should drop that debug
07:32fdobridge: <tom3026> makes sense since this has displayport and the other one didnt heh
07:33fdobridge: <tom3026> or well usb-c > adapter -> dp :p
08:56fdobridge: <Sid> what does add_efi_memmap do?
08:59fdobridge: <!DodoNVK (she) 🇱🇹> I hope this isn't some ProtonDB-level option
08:59fdobridge: <tom3026> dont remember, old habits. something something about rebuilding some memory map and was required in old days when boot was borked on early efi systems
09:00fdobridge: <tom3026> "If the EFI memory map has additional entries not in the E820 map,
09:00fdobridge: <tom3026> you can include those entries in the kernels memory map of available
09:00fdobridge: <tom3026> physical RAM by using the following kernel command line parameter."
09:00fdobridge: <tom3026> now what that means no idea, im old. 😄
09:00fdobridge: <Sid> I see
09:00fdobridge: <Sid> asking because rebar broke on my setup recently lmfao
09:01fdobridge: <Sid> gives me a `no space for [memory region`
09:01fdobridge: <tom3026> "add_efi_memmap include EFI memory map of available physical RAM"
09:01fdobridge: <Sid> but works fine in a windows env
09:02fdobridge: <tom3026> does the efi perhaps contain some region of how much physical ram there is, linux does black magics and figure out that on its own. add efi memmap just makes the efi region matter too?
09:02fdobridge: <tom3026> no idea
09:02fdobridge: <tom3026> and old efi had protected sections that overlapped and all hell brooke loose? oh well its like 10 years ago heh
09:03fdobridge: <Sid> fair, was just curious :>
09:05fdobridge: <tom3026> https://linux.kernel.narkive.com/wC0oPNmT/patch-2-2-x86-boot-only-pick-up-additional-efi-memmap-if-add-efi-memmap-flag 16y ago
09:05fdobridge: <tom3026> xD
09:06fdobridge: <Sid> was just curious if it touches pci space at all, because afaik linux remaps pci regions on boot
09:10fdobridge: <tom3026> https://blog.fpmurphy.com/2012/08/uefi-memory-v-e820-memory.html seems to explain more but i havent fully grasped the idea yet. but yeah this was around the time when i fiddled with the thinkpad x230, corebooting and using tianocore for efi
09:10fdobridge: <tom3026> probably isnt needed unless dmesg shows some errors
09:11fdobridge: <Sid> I feel like something in the pci driver changed
09:12fdobridge: <Sid> which broke rebar for me
09:12fdobridge: <Sid> but I'm in no mood to go bisect that...
09:17fdobridge: <tom3026> 6.7.6 with your patch applied :p
09:18fdobridge: <Sid> nice!
09:18fdobridge: <tom3026> no i meant you should just go there see if rebar is actually an driver change bork or not
09:18fdobridge: <Sid> my patch is in the mailing list (thanks dave!) and should land in 6.8-rc7
09:18fdobridge: <Sid> oh
09:18fdobridge: <tom3026> but yeah i got it applied too heh
09:18fdobridge: <Sid> I'm on 6.7.6 with my patch applied too, yes
09:18fdobridge: <Sid> but
09:18fdobridge: <Sid> that's a good idea, hm
09:19fdobridge: <Sid> except I don't know when it broke, and afaik nouveau does not handle allocating bar region
09:19fdobridge: <Sid> I think
09:19fdobridge: <tom3026> oh
09:20fdobridge: <Sid> or the log line would be `nouveau` instead of `pci`
09:21fdobridge: <Sid> `[ 0.604440] pci 0000:01:00.0: BAR 1 [mem size 0x200000000 64bit pref]: can't assign; no space`
09:21fdobridge: <tom3026> add_efi_memmap
09:21fdobridge: <tom3026> xD
09:22fdobridge: <Sid> which is why I asked :D
09:23fdobridge: <tom3026> its the magic command from before the financial crisis of 2008 that solves all memory mappings, unless it doesnt.
09:30fdobridge: <Sid> heh
09:32fdobridge: <tom3026> otherwise since your rebar is a hack, there is also pci=realloc , "reallocating PCI bridge resources if allocations done by BIOS are too small to accommodate resources required by all child devices"
09:32fdobridge: <Sid> already using realloc, yeah
09:32fdobridge: <tom3026> ah okay
09:33fdobridge: <Sid> not using realloc makes the GPU fail to initialize
09:33fdobridge: <Sid> :>
09:33fdobridge: <Sid> I'm just confused, because rebar reports correctly in Hiren's BootCD still
09:35fdobridge: <tom3026> 0x200000000 , is 8192mb so the question is why it thinks there is no space
09:35fdobridge: <Sid> exactly my confusion
09:42fdobridge: <tom3026> https://github.com/torvalds/linux/blob/master/drivers/pci/setup-res.c#L355
09:43fdobridge: <Sid> sadly
09:43fdobridge: <Sid> `[ 0.604446] pci 0000:01:00.0: BAR 3 [mem size 0x02000000 64bit pref]: can't assign; no space
09:43fdobridge: <Sid> [ 0.604448] pci 0000:01:00.0: BAR 3 [mem 0xfffffffffe000000-0xffffffffffffffff 64bit pref]: failed to assign`
09:44fdobridge: <Sid> oh wait, wrong bar
09:44fdobridge: <Sid> `[ 0.604440] pci 0000:01:00.0: BAR 1 [mem size 0x200000000 64bit pref]: can't assign; no space
09:44fdobridge: <Sid> [ 0.604443] pci 0000:01:00.0: BAR 1 [mem 0xfffffffe00000000-0xffffffffffffffff 64bit pref]: failed to assign`
09:44fdobridge: <Sid> for some reason it's also trying to allocate in a region outside my bus resource
09:45fdobridge: <Sid> ```
09:45fdobridge: <Sid> [ 0.214711] pci_bus 0000:00: root bus resource [io 0x0000-0x0cf7 window]
09:45fdobridge: <Sid> [ 0.214712] pci_bus 0000:00: root bus resource [io 0x0d00-0xffff window]
09:45fdobridge: <Sid> [ 0.214713] pci_bus 0000:00: root bus resource [mem 0x000a0000-0x000bffff window]
09:45fdobridge: <Sid> [ 0.214714] pci_bus 0000:00: root bus resource [mem 0x8f800000-0xdfffffff window]
09:45fdobridge: <Sid> [ 0.214715] pci_bus 0000:00: root bus resource [mem 0x66e800000-0x7fffffffff window]
09:45fdobridge: <Sid> [ 0.214716] pci_bus 0000:00: root bus resource [mem 0xfc800000-0xfe7fffff window]
09:45fdobridge: <Sid> [ 0.214717] pci_bus 0000:00: root bus resource [bus 00-fe]
09:45fdobridge: <Sid> ```
09:45fdobridge: <Sid> which didn't happen before!
09:46fdobridge: <Sid> ...hm
09:46fdobridge: <karolherbst🐧🦀> doesn't it retry a few times until it works?
09:46fdobridge: <Sid> the mem regions only sum up to ~7.75 gigs
09:47fdobridge: <Sid> no, it retries a few times then falls back to using the default bar size of 256mb
09:47fdobridge: <karolherbst🐧🦀> ahh.. that's not great, but smells like an upstream PCI issue then
09:47fdobridge: <karolherbst🐧🦀> or firmware being firmware
09:47fdobridge: <karolherbst🐧🦀> though it works on windows, right?
09:47fdobridge: <karolherbst🐧🦀> or the prop driver?
09:47fdobridge: <Sid> works on windows
09:47fdobridge: <karolherbst🐧🦀> I wonder if that's the difference actually
09:47fdobridge: <Sid> not at all on linux
09:47fdobridge: <karolherbst🐧🦀> like...
09:47fdobridge: <Sid> not on prop, not nouveau
09:47fdobridge: <karolherbst🐧🦀> the nvidia driver trying to reassign those bars
09:48fdobridge: <karolherbst🐧🦀> huh...
09:48fdobridge: <karolherbst🐧🦀> uhh
09:48fdobridge: <karolherbst🐧🦀> I meant the open nvidia one though 😄
09:48fdobridge: <Sid> on jan 23 it looked like this
09:48fdobridge: <Sid> ```
09:48fdobridge: <Sid> [ 0.499242] pci_bus 0000:00: root bus resource [io 0x0000-0x0cf7 window]
09:48fdobridge: <Sid> [ 0.499246] pci_bus 0000:00: root bus resource [io 0x0d00-0xffff window]
09:48fdobridge: <Sid> [ 0.499249] pci_bus 0000:00: root bus resource [mem 0x000a0000-0x000bffff window]
09:48fdobridge: <Sid> [ 0.499251] pci_bus 0000:00: root bus resource [mem 0x8d800000-0xdfffffff window]
09:48fdobridge: <Sid> [ 0.499253] pci_bus 0000:00: root bus resource [mem 0xfc800000-0xfe7fffff window]
09:48fdobridge: <Sid> [ 0.499255] pci_bus 0000:00: root bus resource [bus 00-fe]
09:48fdobridge: <Sid> ```
09:48fdobridge: <Sid> which
09:48fdobridge: <Sid> is still 1.38 gigs
09:48fdobridge: <Sid> but
09:48fdobridge: <karolherbst🐧🦀> I wonder if there is some randomness in play here
09:48fdobridge: <Sid> the fact that it keeps changing pisses me off
09:49fdobridge: <karolherbst🐧🦀> yeah.. sounds like something you ~~want to~~ should report upstream
09:50fdobridge: <Sid> not karol trying to nerdsnipe me
09:50fdobridge: <tom3026> dmesg doesnt contain any "resource collision: [mem 0xSOMEADRESS-0xSOMEADRESS] conflicts" ?
09:51fdobridge: <Sid> https://cdn.discordapp.com/attachments/1034184951790305330/1213060955898904626/dmesg.log?ex=65f41a0d&is=65e1a50d&hm=19fc8fa555efba39fb6a87536a6c6d146c092adb487ef03bbfb5e06c2addd481&
09:51fdobridge: <Sid> not that I can find
09:55fdobridge: <Sid> ~~also karol I don't think simply reporting will satisfy me anymore~~
09:59fdobridge: <tom3026> you are fast approaching kernel maintainer status 😄
09:59fdobridge: <Sid> https://tenor.com/view/cat-funny-scared-screaming-cat-cute-gif-25233437
10:05fdobridge: <huntercz122> since when Sid is a kernel dev
10:06fdobridge: <Sid> ~~since the recent nouveau suspend/resume regression~~
10:07fdobridge: <Sid> tbh I was only gonna bisect and report with the problematic commit
10:07fdobridge: <Sid> but the commit was small and easy to understand...
10:08fdobridge: <Sid> @tom3026 for good measure I'm gonna reflash my modded bios
10:08fdobridge: <Sid> in hopes it reverts everything to absolute default settings
10:08fdobridge: <Sid> and slowly work from there
10:09fdobridge: <Sid> first will try to get gpu-z to show correct bar size in HBCD
10:09fdobridge: <Sid> then work on the linux side of things
10:33fdobridge: <karolherbst🐧🦀> potential developers are also baited by fixing bugs they it themselves 😛
10:33fdobridge: <karolherbst🐧🦀> *always
10:35fdobridge: <karolherbst🐧🦀> ~~now that nouveau isn't experimental anymore, I should also start looking into using crates, like.. serde~~
10:42fdobridge: <Sid> for sanity
10:42fdobridge: <Sid> https://cdn.discordapp.com/attachments/1034184951790305330/1213073941791510539/dd479dd1-f0cc-4635-9039-4b9fe504f764.jpg?ex=65f42625&is=65e1b125&hm=25b5c65bab90fbb7e1aea25e7e9baddea6b07df2476d22a1de36d80b5863acac&
10:43fdobridge: <Sid> that's in HBCD
10:43fdobridge: <Sid> and now on linux
10:43fdobridge: <Sid> ```
10:43fdobridge: <Sid> 01:00.0 VGA compatible controller: NVIDIA Corporation TU116M [GeForce GTX 1660 Ti Mobile] (rev a1) (prog-if 00 [VGA controller])
10:43fdobridge: <Sid> Subsystem: Acer Incorporated [ALI] TU116M [GeForce GTX 1660 Ti Mobile]
10:43fdobridge: <Sid> ...
10:43fdobridge: <Sid> Region 0: Memory at 40000000 (32-bit, non-prefetchable) [size=16M]
10:43fdobridge: <Sid> Region 1: Memory at 4110000000 (64-bit, prefetchable) [size=256M]
10:43fdobridge: <Sid> Region 3: Memory at 4100000000 (64-bit, prefetchable) [size=32M]
10:43fdobridge: <Sid> ```
10:43fdobridge: <Sid> ffs
10:44fdobridge: <Sid> what was the kernel version on jan 24th 🐸
10:44fdobridge: <Sid> 6.8 rc1
10:45fdobridge: <Sid> :\
10:46fdobridge: <Sid> why are the regions different 💢
10:47fdobridge: <Sid> oh I was on 6.7.0
10:47fdobridge: <Sid> great
11:15fdobridge: <tom3026> are you sure this ever worked on the blob?
11:16fdobridge: <Sid> yes
11:18fdobridge: <Sid> https://cdn.discordapp.com/attachments/1034184951790305330/1213082968084451339/rebar.png?ex=65f42e8d&is=65e1b98d&hm=14194b9dcf8c85214e20272b8a20c5026f7d392cd2f54b1a1f6a15cab007ee1f&
11:19fdobridge: <Sid> 25th jan
11:19fdobridge: <Sid> now I'm back on 6.7.0 (and still on 550.40.7) and it's still borken
11:19fdobridge: <Sid> meaning it's something with my BIOS settings, maybe
11:19fdobridge: <Sid> unless
11:20fdobridge: <Sid> my root partition switching from ext4 to xfs and dropping the swapfile makes a difference
11:21fdobridge: <rhed0x> Alan Wake 2 works on NVK?
11:21fdobridge: <rhed0x> Or is that just a wallpaper?
11:21fdobridge: <Sid> that's proprietary
11:21fdobridge: <rhed0x> oh
11:21fdobridge: <Sid> sorry 😅
11:21fdobridge: <Sid> not a wallpaper, no. game in windowed mode so I can have the terminal over it to monitor BAR usage
11:22fdobridge: <tom3026> from my little googling and understanding,
11:22fdobridge: <tom3026> ```
11:22fdobridge: <tom3026> [ 0.425255] pci_bus 0000:00: root bus resource [io 0-332 window]
11:22fdobridge: <tom3026> [ 0.425259] pci_bus 0000:00: root bus resource [io 332-65535 window]
11:22fdobridge: <tom3026> [ 0.425261] pci_bus 0000:00: root bus resource [mem 10-11 window]
11:23fdobridge: <tom3026> [ 0.425263] pci_bus 0000:00: root bus resource [mem 4096-4294967295 window]
11:23fdobridge: <tom3026> [ 0.425265] pci_bus 0000:00: root bus resource [mem 4096000000-549755813887 window]
11:23fdobridge: <tom3026> [ 0.425267] pci_bus 0000:00: root bus resource [mem 4244635648-4278190079 window]
11:23fdobridge: <tom3026> [ 0.425269] pci_bus 0000:00: root bus resource [bus 00-254]
11:23fdobridge: <tom3026>
11:23fdobridge: <tom3026> [ 0.603588] pci 0000:00:01.0: bridge window [mem 41943040-4278190079]: assigned
11:23fdobridge: <tom3026> [ 0.603595] pci 0000:00:15.0: BAR 0 [mem 42949672960-42949674007 64bit]: assigned
11:23fdobridge: <tom3026> [ 0.603867] pci 0000:00:15.1: BAR 0 [mem 42949674008-42949675055 64bit]: assigned
11:23fdobridge: <tom3026> [ 0.604140] pci 0000:00:1e.0: BAR 0 [mem 42949675056-42949676047 64bit]: assigned
11:23fdobridge: <tom3026> [ 0.604440] pci 0000:01:00.0: BAR 1 [mem size 214748364800 64bit pref]: can't assign; no space
11:23fdobridge: <tom3026> [ 0.604443] pci 0000:01:00.0: BAR 1 [mem 4294967295999990000-4294967295999999999 64bit pref]: failed to assign
11:23fdobridge: <tom3026> [ 0.604446] pci 0000:01:00.0: BAR 3 [mem size 33554432 64bit pref]: can't assign; no space
11:23fdobridge: <tom3026> [ 0.604448] pci 0000:01:00.0: BAR 3 [mem 4294967295990000000-4294967295999999999 64bit pref]: failed to assign
11:23fdobridge: <tom3026> [ 0.604451] pci 0000:01:00.0: BAR 0 [mem 41943040-42024959]: assigned
11:23fdobridge: <tom3026> [ 0.604457] pci 0000:01:00.0: ROM [mem 42024960-42268671 pref]: assigned
11:23fdobridge: <tom3026> [ 0.604459] pci 0000:01:00.1: BAR 0 [mem 42268672-42270719]: assigned
11:23fdobridge: <tom3026> [ 0.604465] pci 0000:01:00.3: BAR 0 [mem 42270720-42272767]: assigned
11:23fdobridge: <tom3026> ```
11:23fdobridge: <tom3026> converted hex to decimals :p, but yeah that BAR is going out of your root bus resource window, which is fetched from ACPI
11:24fdobridge: <Sid> yeah, I did get that myself too 😅
11:24fdobridge: <Sid> but now that I've gone back to the kernel/driver config that worked previously
11:24fdobridge: <Sid> and it still doesn't work
11:24fdobridge: <Sid> I can go poke around UEFI settings
11:25fdobridge: <tom3026> yeah acpi is more of bios/uefi problem
11:25fdobridge: <tom3026> or linux is just stricter with these things compared to windows
11:25fdobridge: <Sid> linux is actually more lax
11:25fdobridge: <tom3026> oh derp its going out of acpi allowed window DENIED
11:25fdobridge: <Sid> since we do ReBAR in kernel space
11:25fdobridge: <Sid> instead of blindly following UEFI
11:26fdobridge: <tom3026> question is why it worked at all before tho
11:26fdobridge: <tom3026> heh
11:26fdobridge: <Sid> heck, you can even boot with a modified DSDT without having to patch it into the driver
11:26fdobridge: <Sid> s\/driver/firmware
11:29fdobridge: <Sid> wtf
11:29fdobridge: <Sid> just comparing output from lspci is also so different
11:30fdobridge: <tom3026> or did your bios restore itself in some failsafe manner lol
11:31fdobridge: <tom3026> or "restore default settings" overwrote those https://github.com/xCuri0/ReBarUEFI things this does
11:31fdobridge: <Sid> I did do a manual reset
11:32fdobridge: <tom3026> ah okay
11:32fdobridge: <Sid> I'm using a different thing, but yeah, did set the vars again
11:32fdobridge: <Sid> https://github.com/terminatorul/nvstrapsrebar
11:32fdobridge: <Sid> bios hasn't changed, only bios config
11:32fdobridge: <Sid> and changing pci settings is scary D:
11:33fdobridge: <Sid> memory regions are different, and
11:33fdobridge: <Sid> interrupt pin A is being routed to a different IRQ
11:33fdobridge: <Sid> what
11:34fdobridge: <Sid> look
11:34fdobridge: <Sid> https://www.diffchecker.com/GIgAClVE/
11:34fdobridge: <Sid> left is old, right is new
11:35fdobridge: <Sid> bios hours
11:36fdobridge: <tom3026> yeah i would say something with that nvstrapsrebar didnt apply correctly or configured right
11:36fdobridge: <Sid> it's config'ed the same
11:37fdobridge: <tom3026> you should order a ada laptop
11:37fdobridge: <tom3026> 😄
11:37fdobridge: <Sid> see that's a good solution
11:37fdobridge: <Sid> but I do not have the money
11:38fdobridge: <Sid> and if I had the money I'd get a desktop anyway
11:39fdobridge: <!DodoNVK (she) 🇱🇹> https://gitlab.freedesktop.org/mesa/mesa/-/issues/10719 🤔
11:40fdobridge: <tom3026> "Most people should choose the first menu option and press E to Enable auto-settings BAR size for Turing GPUs. " sounds interesting. so which way did you choose auto setting or manually setting a bar size?
11:40fdobridge: <tom3026> is it auto setting it on each reboot and fails 9 out of 10 times
11:40fdobridge: <Sid> the recommended way, yeah
11:41fdobridge: <Sid> I could change to manual
11:45fdobridge: <tom3026> dont hold me accountable for bricking it tho :blobcatnotlikethis:
11:45fdobridge: <Sid> nah
11:45fdobridge: <Sid> iGPU drives display
11:45fdobridge: <Sid> I can always revert
11:55fdobridge: <Sid> blegh
11:55fdobridge: <tom3026> didnt help? 😦
11:58fdobridge: <Sid> nope
12:00fdobridge: <tom3026> i kinda cheated too btw https://i.imgur.com/uOjMG8v.png
12:00fdobridge: <tom3026> 😄
12:01fdobridge: <tom3026> did the resource regions change? tho
12:02fdobridge: <pixelcluster> anyone want to take bets on how terribly wrong the output is
12:03fdobridge: <tom3026> 😄
12:03fdobridge: <Sid> ....oh
12:03fdobridge: <Sid> fuck me
12:04fdobridge: <Sid> I'm so dumb
12:04fdobridge: <Sid> https://cdn.discordapp.com/attachments/1034184951790305330/1213094588940681326/Screenshot_20240301-173437.png?ex=65f43960&is=65e1c460&hm=20a4760bc0dbc6fbef193894b59979364de43e6d217193155178e63131e476fb&
12:05fdobridge: <Sid> guess what was happening...
12:06fdobridge: <tom3026> history is repeating itself
12:06fdobridge: <Sid> yes
12:06fdobridge: <Sid> I re-added the pci-pm config to minimize power draw a week or two ago
12:06fdobridge: <Sid> among a lot more pci-pm tweaks
12:06fdobridge: <Sid> among a lot more power related tweaks (edited)
12:07fdobridge: <Sid> and completely forgot about rebar breaking if I set pci-pm rules via udev...
12:07fdobridge: <Sid> I need to figure out a way to set those rules post-boot
12:08fdobridge: <Sid> my experiments with power management were very successful
12:09fdobridge: <Sid> I managed to take my laptop's idle drain down from I think ~13W to ~6W
12:09fdobridge: <Sid> even dipping as low as 5.3W at some points
12:09fdobridge: <tom3026> yeah i went from 20w according to powertop to like 10-12 doing so aswell
12:09fdobridge: <Sid> nono you don't get the sheer nonsense I pulled off
12:10fdobridge: <Sid> GPU before: `LnkCtl: ASPM Disabled; RCB 64 bytes, LnkDisable- CommClk+`
12:10fdobridge: <Sid> GPU after:
12:10fdobridge: <Sid> ```
12:10fdobridge: <Sid> LnkCtl: ASPM L0s L1 Enabled; RCB 64 bytes, LnkDisable- CommClk+
12:11fdobridge: <Sid> ExtSynch- ClockPM+ AutWidDis- BWInt- AutBWInt-```
12:11fdobridge: <Sid> *with working L1 substates*
12:11fdobridge: <Sid> ```
12:11fdobridge: <Sid> Capabilities: [258 v1] L1 PM Substates
12:11fdobridge: <Sid> L1SubCap: PCI-PM_L1.2+ PCI-PM_L1.1+ ASPM_L1.2+ ASPM_L1.1+ L1_PM_Substates+
12:11fdobridge: <Sid> PortCommonModeRestoreTime=255us PortTPowerOnTime=10us
12:11fdobridge: <Sid> L1SubCtl1: PCI-PM_L1.2- PCI-PM_L1.1- ASPM_L1.2- ASPM_L1.1-
12:11fdobridge: <Sid> T_CommonMode=0us LTR1.2_Threshold=0ns
12:11fdobridge: <Sid> L1SubCtl2: T_PwrOn=10us
12:11fdobridge: <Sid> ```
12:12fdobridge: <Sid> I also force enabled ASPM right on the pci bridge
12:12fdobridge: <Sid> and force enabled package c-states on the CPU
12:13fdobridge: <Sid> because apparently Acer is *terrible* at configuring their laptops' UEFI firmware
12:13fdobridge: <Sid> this is a *gaming laptop* that idles at ~5.5W
12:13fdobridge: <Sid> :D
12:14fdobridge: <Sid> ~~I'm basically treating this laptop like a desktop~~
12:15fdobridge: <tom3026> heh okay
12:15fdobridge: <tom3026> anyways udev rules can use GOTO
12:16fdobridge: <tom3026> ```
12:16fdobridge: <tom3026>
12:16fdobridge: <tom3026> ACTION=="add", SUBSYSTEM=="pci", ATTR{vendor}=="0x10de", GOTO="rule_end"
12:16fdobridge: <tom3026> ACTION=="bind", SUBSYSTEM=="pci", ATTR{vendor}=="0x10de", GOTO="rule_end"
12:16fdobridge: <tom3026> ACTION=="unbind", SUBSYSTEM=="pci", ATTR{vendor}=="0x10de", GOTO="rule_end"
12:16fdobridge: <tom3026>
12:16fdobridge: <tom3026> # Enable runtime PM for all pci devices
12:16fdobridge: <tom3026> SUBSYSTEM=="pci", ATTR{power/control}="auto"
12:16fdobridge: <tom3026>
12:16fdobridge: <tom3026> LABEL="rule_end"
12:16fdobridge: <tom3026> ```
12:16fdobridge: <Sid> I don't wanna do that though, I wanna enable pci pm for every pci device
12:16fdobridge: <Sid> just need a way to do it post boot 🐸
12:16fdobridge: <tom3026> and just manually set the things later in either a systemd service or just a bash script when booted in the /sys/blabla/power/control
12:17fdobridge: <Sid> ...yeah
12:17fdobridge: <Sid> but yes, like I said yesterday
12:17fdobridge: <Sid> I'm a cursed child
12:17fdobridge: <Sid> I do cursed things
12:18fdobridge: <Sid> the only reason I'm not soldering in a newer CPU/GPU on this laptop is because I can't afford to perma-brick it
12:19fdobridge: <tom3026> you could do an egpu setup tho :p
12:19fdobridge: <Sid> where's the fun in that :D
12:19fdobridge: <tom3026> i did that on my thinkpad x230
12:19fdobridge: <Sid> and my laptop doesn't have a thunderbolt port
12:20fdobridge: <tom3026> well got a spare m.2 slot?
12:20fdobridge: <Sid> both my m.2 slots are utilized, and attaching an eGPU to an m.2 will just turn it into a desktop
12:20fdobridge: <tom3026> yeah
12:21fdobridge: <Sid> because then I'll have to leave the bottom plate open
12:24fdobridge: <Sid> anyway, nice to see nouveau pick up rebar automatically on boot now
12:24fdobridge: <Sid> back in january it only did that if the module was loaded after pci was done allocating the memory
12:24fdobridge: <Sid> i.e if the module was loaded post boot
12:26fdobridge: <tom3026> oh well that was fun, what do we debug now?
12:26fdobridge: <tom3026> 😄
12:27fdobridge: <Sid> *sweat*
12:27fdobridge: <Sid> you're terrifying
12:38fdobridge: <Sid> GPL seems to be crashing a lot of games, hm
12:39fdobridge: <zmike.> fwiw nv blob has historically had the most fails with glcts tessellation tests
12:40fdobridge: <zmike.> they've fixed most of them over the years by now, but it used to be quite a lot
12:40fdobridge: <tom3026> got a link to that suspen/resume finalized patch?
12:41fdobridge: <tom3026> fixing up this pkgbuild
12:41fdobridge: <Sid> just a sec
12:41fdobridge: <Sid> https://gitlab.freedesktop.org/drm/kernel/-/commit/f6ecfdad359a01c7fd8a3bcfde3ef0acdf107e6e
12:42fdobridge: <tom3026> thanks
12:43fdobridge: <Sid> should be in rc7 :>
12:44fdobridge: <Sid> wow we're hitting real regressions now
12:45fdobridge: <Sid> even without GPL dirt rally crashes
12:47fdobridge: <Sid> blasphemous, how did this get acked! /j
12:47fdobridge: <Sid> https://cdn.discordapp.com/attachments/1034184951790305330/1213105255198232596/image.png?ex=65f4434f&is=65e1ce4f&hm=581d0fc8a9d00c55b08a49b74fe1b99c08191df99320d9f11b85d1fc659d0c3a&
12:57fdobridge: <marysaka> gotta getch 'em all
13:12fdobridge: <tom3026> https://patchwork.freedesktop.org/patch/580696/ gfxstrand seemed to get less issues in the CTS with this, no idea if its in any of the -RC yet but im building it now
13:15fdobridge: <Sid> not in the rc yet no
13:15fdobridge: <Sid> let me know how it goes for you
13:16fdobridge: <Sid> am gonna go read a book for a while
13:20fdobridge: <Sid> because freedesktop.org hates me and won't let me fetch commits any quicker than 25KiB/s
13:22fdobridge: <tom3026> @airlied is it actually this https://github.com/torvalds/linux/blob/master/drivers/gpu/drm/nouveau/nvkm/subdev/gsp/r535.c#L662 you meant is just a displayport "debug" message? because thats what im hitting, not the nvkm_debug(&gsp->subdev, "cli:0x%08x obj:0x%08x ctrl cmd:0x%08x argc:%d\n", client->object.handle, object->handle, cmd, argc); 20 lines below
14:12fdobridge: <tom3026> seems i can convert mom to nvk, sims4 runs rather well
14:12fdobridge: <tom3026> 😄
14:16fdobridge: <Sid> @gfxstrand with calibrated timestamps, how do we wanna handle the upgrade it got from EXT to KHR
14:16fdobridge: <Sid> am trying to implement it rn :>
14:19fdobridge: <redsheep> ^
14:19fdobridge: <mohamexiety> you enable both in the physical_device.c tables and implement the `KHR` version of the functions. the vulkan runtime handles the `EXT` <-> `KHR` thing when an app calls either of them
14:19fdobridge: <Sid> I see, thank
14:25fdobridge: <Sid> hmmmm I shouldn't have rm -rf'd the cts suite
14:44fdobridge: <tom3026> yeah runtime pm doesnt seem to work on nouveau
14:44fdobridge: <tom3026> 56w usage idling
14:44fdobridge: <tom3026> 😄
14:45fdobridge: <!DodoNVK (she) 🇱🇹> I got D3cold on nouveau
14:47fdobridge: <tom3026> how did you check that
15:04fdobridge: <!DodoNVK (she) 🇱🇹> By opening some sysfs nodes of course
15:11fdobridge: <Sid> works on my machine
15:28fdobridge: <gfxstrand> @tiredchiku , this ☝🏻
15:57fdobridge: <Sid> hm, this seems to require more understanding of the hardware than I have atm
15:57fdobridge: <Sid> I'll tackle it tomorrow
16:01fdobridge: <marysaka> How does people run Steam games with NVK atm? :aki_thonk:
16:02fdobridge: <redsheep> Setting the icd in launch options before %command%
16:03fdobridge: <Sid> which is an optional step if there's only one gpu on your system and is being driven by nouveau
16:03fdobridge: <prop_energy_ball> hit play 🐸
16:03fdobridge: <prop_energy_ball> I have nvk installed at system level
16:03fdobridge: <prop_energy_ball> but you can `VK_ICD_FILENAMES=blah %command%`
16:04fdobridge: <prop_energy_ball> I WILL **NEVER** USE VK_DRIVER_FILES
16:04fdobridge: <Sid> king
16:04fdobridge: <Sid> ...god damn it fdev
16:05fdobridge: <Sid> steam copy
16:05fdobridge: <Sid> https://cdn.discordapp.com/attachments/1034184951790305330/1213155050344550450/image.png?ex=65f471af&is=65e1fcaf&hm=7f6cdc5bd7200bbdb5fb7038879c0f84bf325e5340238e8e57192e1e1ce18e8c&
16:05fdobridge: <Sid> launched from steam
16:05fdobridge: <marysaka> Okay so nothing too fancy to do :SoniiPray:
16:05fdobridge: <Sid> yup
16:06fdobridge: <marysaka> (I'm not installing at the system level yet, and if I was I think I would make some package on copr)
16:06fdobridge: <Sid> gee I wonder how I played for so long without the fdev account not owning the damn game, ***FDev***
16:06fdobridge: <Sid> https://cdn.discordapp.com/attachments/1034184951790305330/1213155445045198899/image.png?ex=65f4720d&is=65e1fd0d&hm=42b93d71243dcb005b86e093b71b05ad0bb6ea0a68216bac88a3c939016b225f&
16:06fdobridge: <Sid> s\/not owning/owning
16:14fdobridge: <esdrastarsis> Why?
16:14fdobridge: <Sid> https://tenor.com/view/abe-simpson-abe-simpson-cloud-yelling-cloud-angry-gif-12558964
16:14fdobridge: <Sid> that's why
16:24fdobridge: <waelunix> Day 2 of trying to get NVK working on my machine. Hopefully i don't face anymore NixOS-isms 🐸
16:27fdobridge: <tom3026> @gfxstrand for what its worth, sims 4 max settings ran beautiful on this 4080
16:28fdobridge: <tom3026> we should make a nvkdb as protondb
16:28fdobridge: <tom3026> what currently runs or not 😄
16:28fdobridge: <Sid> it exists
16:28fdobridge: <Sid> it's called #g
16:28fdobridge: <Sid> uh
16:28fdobridge: <Sid> #gayming Ctrl + f has: image
16:29fdobridge: <Sid> :D
16:29fdobridge: <tom3026> 😄
16:29fdobridge: <Sid> me: ok I've had a long day figuring stuff out with my laptop, time to relax and play a game for a bit
16:29fdobridge: <Sid> also me:
16:30fdobridge: <Sid> https://cdn.discordapp.com/attachments/1034184951790305330/1213161413820944504/image.png?ex=65f4779c&is=65e2029c&hm=195eb600351b4e859367f2c42bebdcdb61ba2100e8330e102a8df5a47593213f&
16:31fdobridge: <gfxstrand> What even is `VK_DRIVER_FILES`? I know it's some new thing that's supposed to replace `VK_ICD_FILENAMES` but does it actually have different semantics?
16:31fdobridge: <Sid> no it's the same
16:31fdobridge: <Sid> they're both interchangable in use
16:31fdobridge: <prop_energy_ball> /shrug No, someone at LunarG woke up and decided to make an alias someday
16:31fdobridge: <prop_energy_ball> I don't know why, or the rationale other than they started randomly saying `VK_ICD_FILENAMES` is deprecated
16:32fdobridge: <prop_energy_ball> but VK_ICD_FILENAMES is a MUCH better name than DRIVER_FILES
16:32fdobridge: <prop_energy_ball> so I don't get it =(
16:33fdobridge: <Sid> elite dangerous does not seem to get past its planetary shader gen on nvk, regardless of GPL
16:34fdobridge: <Sid> wine log from when it crashes
16:34fdobridge: <Sid> https://cdn.discordapp.com/attachments/1034184951790305330/1213162336215638016/elitedangerousNVKcrash.log?ex=65f47878&is=65e20378&hm=8fe900a9bccbfe78f74e4e3a4a6cbd111f4855e13c033d5e42ce87ccbcf442fd&
16:34fdobridge: <tom3026> well VK_DRIVER_FILES can do wildcard matching on names
16:34fdobridge: <tom3026> "*NVIDIA*"
16:34fdobridge: <tom3026> uhm
16:34fdobridge: <Sid> something something vkCreateComputePipelines exception
16:34fdobridge: <gfxstrand> Yeah and why would anyone want that?
16:34fdobridge: <tom3026> discord ate my wildcard
16:35fdobridge: <Sid> to maybe pass both x86 and x86_64 libs while keeping it short, maybe
16:35fdobridge: <gfxstrand> Yeah, it's probably the compiler getting wedged on something. Annoying that it's not printing out an assert.
16:35fdobridge: <gfxstrand> You might be able to run with fossilize and extract the compute shaders and run outside of Wine.
16:35fdobridge: <!DodoNVK (she) 🇱🇹> Compile 64-bit NVK with debug symbols and take note of the library offset
16:35fdobridge: <gfxstrand> You might be able to run with fossilize and extract the compute shaders and compile those outside of Wine. (edited)
16:36fdobridge: <Sid> I was gonna try running it with damavand first 😅
16:36fdobridge: <!DodoNVK (she) 🇱🇹> Compile 64-bit NVK with debug symbols and take note of the library offset (after crashing the game with debug symbols of course) (edited)
16:36fdobridge: <gfxstrand> In case anyone wonders what the current level of troll user requests is...
16:36fdobridge: <gfxstrand> https://cdn.discordapp.com/attachments/1034184951790305330/1213162959518302248/image.png?ex=65f4790d&is=65e2040d&hm=ad820aa2566b8888d572a622d2e7c255cbe14de7c7878a1b5138fd9a910b949b&
16:37fdobridge: <tom3026> 2025, nvk is now the first vulkan driver on windows xp!
16:37fdobridge: <pac85> I'm afraid it wouldn't be the first
16:37fdobridge: <Sid> damavand just does not
16:37fdobridge: <Sid> oookay then
16:37fdobridge: <!DodoNVK (she) 🇱🇹> First Mesa XDDM driver (real)
16:38fdobridge: <Sid> time to swap proton experimental from bleeding-edge to bleeding-edge-debug
16:38fdobridge: <Sid> can someone go back and stop 2020 Sid from switching to linux 😅
16:39fdobridge: <Sid> I'm sitting here on a friday night, while there's a cultural fest going on at my uni, trying to get more info on a foss driver crash
16:40fdobridge: <Sid> would that be setting -Db_ndebug=true in the compile time options?
16:42cwabbott: "machine, pls make driver / real fast like / w/ BIG features too, / play all my fav games, / also fancy upscaling with woosh on / Thanks, human / PS no bugs :)"
16:43fdobridge: <tom3026> might need !strip if on arch and makepkg
16:43fdobridge: <Sid> did add that, yeah
16:51fdobridge: <!DodoNVK (she) 🇱🇹> I think that toggles the assertions
16:51fdobridge: <Sid> good thing I enabled it then
16:51fdobridge: <Sid> still waiting on proton to update though
16:51fdobridge: <Sid> so I can get dxvk debug logs too
17:01fdobridge: <Sid> well, I didn't get any assert (likely missing an env var for it), but I do have the last shader we tried to compile before crashing
17:01fdobridge: <Sid> or, will, the shader hash
17:02fdobridge: <Sid> ```
17:02fdobridge: <Sid> debug: Compiling shader CS_1ca9cf8c2e7ca6f96b40305816bf1652c7c85025
17:02fdobridge: <Sid> debug: Input Signature for - CS_1ca9cf8c2e7ca6f96b40305816bf1652c7c85025
17:02fdobridge: <Sid> debug: Output Signature for - CS_1ca9cf8c2e7ca6f96b40305816bf1652c7c85025
17:02fdobridge: <Sid> ```
17:02fdobridge: <Sid> no further info
17:02fdobridge: <Sid> let me try spoofing an amd card to see if those shaders work better
17:02fdobridge: <Sid> because afaik ED ships different shaders for nv and amd
17:04fdobridge: <Sid> nope
17:05fdobridge: <!DodoNVK (she) 🇱🇹> Do you still get the syscall_fault?
17:05fdobridge: <Sid> if there's anything specific I have to do to make the assert print out, let me know
17:05fdobridge: <Sid> yes
17:05fdobridge: <!DodoNVK (she) 🇱🇹> What's the library offset now?
17:06fdobridge: <Sid> https://cdn.discordapp.com/attachments/1034184951790305330/1213170491712479322/hereSeeForYourselfIDunnoWhatToLookFor.log?ex=65f48010&is=65e20b10&hm=f218991936a4a79d62348c40834578776437bdb117bc72a75fc4ad60c233d9ec&
17:06fdobridge: <Sid> 0x793557, I think?
17:06fdobridge: <Sid> or is it 0x7c78c3393557
17:07fdobridge: <!DodoNVK (she) 🇱🇹> Can you do `objdump -D -Mintel -l libvulkan_nouveau.so > libvulkan_nouveau.txt`?
17:08fdobridge: <Sid> sure can, just a sec
17:09fdobridge: <Sid> guessing it's meant to take longer than a sec
17:10fdobridge: <!DodoNVK (she) 🇱🇹> Yes
17:10fdobridge: <Sid> oh, spicy command
17:10fdobridge: <Sid> laptop fans ramping up and down
17:10fdobridge: <Sid> fun
17:22fdobridge: <Sid> heckin chonk
17:22fdobridge: <Sid> 340mb and still going
17:22fdobridge: <Sid> oh well
17:23fdobridge: <!DodoNVK (she) 🇱🇹> That's a lot of debug symbols
17:24fdobridge: <Sid> it's just your vulkan-nouveau-git pkgbuild but
17:24fdobridge: <Sid> modified
17:32fdobridge: <Sid> chonk
17:32fdobridge: <Sid> even gzip -9'd it's 54mb
17:33fdobridge: <Sid> 428mb uncompressed
17:33fdobridge: <tom3026> thats massive tho, using some high compression ratio on btrfs probably would reduce the gigabytes ~/dev uses of all the git repos lol
17:33fdobridge: <Sid> let's try zstd -22
17:34fdobridge: <Sid> I'm already using zstd -7 compression on zfs
17:34fdobridge: <!DodoNVK (she) 🇱🇹> Now search for 793557 in that file
17:36fdobridge: <Sid> let's see if vscode is up to the task...
17:36fdobridge: <Sid> oh
17:36fdobridge: <Sid> es
17:36fdobridge: <Sid> ` 793557: e8 d4 90 e2 ff call 5bc630 <_ZN4core4cell16RefCell$LT$T$GT$10borrow_mut17heca93d7b8ff06000E>`
17:37fdobridge: <!DodoNVK (she) 🇱🇹> Can you go to the line where you found this?
17:38fdobridge: <Sid> as in?
17:39fdobridge: <Sid> in the objdump generated txt?
17:39fdobridge: <!DodoNVK (she) 🇱🇹> Yes
17:39fdobridge: <Sid> already on it, yeah
17:39fdobridge: <!DodoNVK (she) 🇱🇹> What comes after and before it?
17:40fdobridge: <Sid> ```
17:40fdobridge: <Sid> 79354a: 00
17:40fdobridge: <Sid> 79354b: 48 8d 7c 08 30 lea rdi,[rax+rcx*1+0x30]
17:40fdobridge: <Sid> 793550: 48 8d 35 49 10 9e 00 lea rsi,[rip+0x9e1049] # 11745a0 <_ZN4core3fmt2rt12USIZE_MARKER17hdc7ca105c6554142E+0xa1a8>
17:40fdobridge: <Sid> 793557: e8 d4 90 e2 ff call 5bc630 <_ZN4core4cell16RefCell$LT$T$GT$10borrow_mut17heca93d7b8ff06000E>
17:40fdobridge: <Sid> 79355c: 48 89 94 24 30 01 00 mov QWORD PTR [rsp+0x130],rdx
17:40fdobridge: <Sid> 793563: 00
17:40fdobridge: <Sid> ```
17:45fdobridge: <Sid> I recognize
17:45fdobridge: <Sid> x86 instructions
17:45fdobridge: <Sid> but that's about it
17:45fdobridge: <Sid> also this is with sparse MR built 🐸
17:49fdobridge: <!DodoNVK (she) 🇱🇹> Can you provide more context for this line?
17:52fdobridge: <Sid> https://cdn.discordapp.com/attachments/1034184951790305330/1213182128364199986/context.log?ex=65f48ae7&is=65e215e7&hm=bb736a0679eb1b4b636d03cf167e9c62b650b752cf1d82d64589df5846ff726a&
17:57fdobridge: <!DodoNVK (she) 🇱🇹> So I guess the issue is in this line: https://gitlab.freedesktop.org/mesa/mesa/-/blob/main/src/nouveau/compiler/nak/repair_ssa.rs#L33
17:58fdobridge: <Sid> possibly
18:18fdobridge: <Sid> so
18:18fdobridge: <Sid> news
18:19fdobridge: <Sid> I can just disable start up shader warming for elite
18:25fdobridge: <Sid> nvm it still does planetary
21:02Sid127: ^C11h^C^C13e^C^C0l^C^C13l^C^C11o
21:02Sid127: aw
21:03Sid127: oh
21:04fdobridge: <gfxstrand> I think I figured out what's wrong with KHR-GL46.tessellation_shader.tessellation_control_to_tessellation_evaluation.gl_tessLevel and it falls thoroughly into the "thanks, I hate it" category
21:11fdobridge: <redsheep> Which end of things is the issue on then? Zink, NVK, CTS?
21:14fdobridge: <gfxstrand> I think I need to whack a magic shader control bit
21:14fdobridge: <gfxstrand> I just don't know what bit
21:14fdobridge: <gfxstrand> @karolherbst who did you get an answer from the last time we did this?
21:15fdobridge: <gfxstrand> And can we get the rest of the bits?
21:15fdobridge: <gfxstrand> Jeff got quickly cagey
21:18fdobridge: <karolherbst🐧🦀> John
21:18fdobridge: <karolherbst🐧🦀> and uhm...
21:18fdobridge: <karolherbst🐧🦀> Andy
21:19fdobridge: <karolherbst🐧🦀> I'd write Andy first.. that reminds me.. I have to ping on the other things 😄
21:21fdobridge: <gfxstrand> Okay, I'll write Andy
21:25fdobridge: <karolherbst🐧🦀> what evil bit do you try to figure out this time though?
21:33fdobridge: <gfxstrand> The one that disables OOB access exceptions
21:41fdobridge: <karolherbst🐧🦀> mhhh
21:42fdobridge: <karolherbst🐧🦀> yeah.. good idea to ask about it
21:42fdobridge: <karolherbst🐧🦀> I just suspect the answer will require cursed things 😄
21:54fdobridge: <gfxstrand> I expect it's a single bit we have to set through FALCON again
21:55fdobridge: <gfxstrand> I may just start whacking bits and see what happens
21:59fdobridge: <Sid> let's go! <https://github.com/torvalds/linux/commit/f6ecfdad359a01c7fd8a3bcfde3ef0acdf107e6e>
22:14fdobridge: <gfxstrand> @karolherbst What do the arguments to SET_PRIV_REG do? The first is the address, the third is a bit to set but what's the second?
22:14fdobridge: <gfxstrand> Or is the third a mask to modify and the second is the value to set?
22:14fdobridge: <karolherbst🐧🦀> mask
22:15fdobridge: <karolherbst🐧🦀> it's a mask to specify which bits to set
22:15fdobridge: <karolherbst🐧🦀> so you can also set 0
22:15fdobridge: <gfxstrand> Okay, so it goes addr, bits, mask?
22:15fdobridge: <Sid> out of sheer curiosity, I'm guessing we're working on getting nouveau overclocking/hwmon going over the sysfs interface?
22:16fdobridge: <karolherbst🐧🦀> the way it's implemented in NVK it goes: value, mask, addr
22:16fdobridge: <gfxstrand> right, okay
22:16fdobridge: <gfxstrand> that makes sense
22:17fdobridge: <karolherbst🐧🦀> the actual interface to the firmware is a bit more cursed
22:17fdobridge: <karolherbst🐧🦀> I reordered it from how nvidia done it, because I could safe a temporary this way :ferrisUpsideDown:
22:18fdobridge: <karolherbst🐧🦀> but anyway... the binary interface takes three values + the address as the trigger in `FALCON04`
22:19fdobridge: <karolherbst🐧🦀> and what the first value does is kinda unknown, because it's 0
22:19fdobridge: <karolherbst🐧🦀> but then you get the value and the mask
22:20fdobridge: <karolherbst🐧🦀> maybe the first one is a mask of values to keep?
22:20fdobridge: <karolherbst🐧🦀> *bits
22:20fdobridge: <gfxstrand> Yeah, that's fine
22:21fdobridge: <gfxstrand> Okay, so I've fuzzed it and it's none of the bits in that register
22:21fdobridge: <gfxstrand> Or at least it isn't just one bit in that register
22:22fdobridge: <karolherbst🐧🦀> tried setting and unsetting?
22:32fdobridge: <pavlo_it_115> Nothing interesting =)
22:32fdobridge: <pavlo_it_115> Gn
22:32fdobridge: <pavlo_it_115> https://cdn.discordapp.com/attachments/1034184951790305330/1213252438568472636/image.png?ex=65f4cc62&is=65e25762&hm=8b75a2b5c4e1b75a8997692360304afa235aa35013d4f11f7bd9648b1d904a7e&
22:32fdobridge: <huntercz122> now you have something interesting for your CV
22:32fdobridge: <pavlo_it_115> Nothing interesting =)
22:32fdobridge: <pavlo_it_115> Good night all! (edited)
22:32fdobridge: <pavlo_it_115> https://cdn.discordapp.com/attachments/1034184951790305330/1213252438568472636/image.png?ex=65f4cc62&is=65e25762&hm=8b75a2b5c4e1b75a8997692360304afa235aa35013d4f11f7bd9648b1d904a7e&
22:32fdobridge: <Sid> ...
22:32fdobridge: <Sid> ooookay then
22:33fdobridge: <gfxstrand> @karolherbst I think I know what I'm looking for: #define gr_gpc0_tpc0_sm0_hww_warp_esr_report_mask_r() (0x00504728U)
22:33fdobridge: <Sid> ah yes, moved 2 lines of code between functions on the linux kernel xD
22:33fdobridge: <gfxstrand> https://github.com/alliedvision/linux_nvidia_jetson/blob/4609206e6594f1eb21e43e69afa8974cf20cc096/kernel/nvgpu/drivers/gpu/nvgpu/include/nvgpu/hw/gv11b/hw_gr_gv11b.h#L1302C1-L1302C81
22:33fdobridge: <gfxstrand> So the question is where did they move the register...
22:35fdobridge: <marysaka> There is some file similar for TU104 :aki_thonk: https://github.com/alliedvision/linux_nvidia_jetson/blob/4609206e6594f1eb21e43e69afa8974cf20cc096/kernel/nvgpu/drivers/gpu/nvgpu/include/nvgpu/hw/tu104/hw_gr_tu104.h#L1008
22:35fdobridge: <gfxstrand> Yeah, I'm looking at that now
22:38fdobridge: <karolherbst🐧🦀> yeah.. that's part of the trap handler thing
22:38fdobridge: <karolherbst🐧🦀> it's all a bit cursed to set up
22:39fdobridge: <karolherbst🐧🦀> but this is also relevant for setting up the shader trap handler properly
22:39fdobridge: <karolherbst🐧🦀> the thing is just, that those masks are not documented
22:39fdobridge: <karolherbst🐧🦀> mhh...
22:39fdobridge: <karolherbst🐧🦀> maybe that one actually ise
22:39fdobridge: <karolherbst🐧🦀> *is
22:40fdobridge: <karolherbst🐧🦀> wtf...
22:40fdobridge: <karolherbst🐧🦀> @gfxstrand https://github.com/alliedvision/linux_nvidia_jetson/blob/4609206e6594f1eb21e43e69afa8974cf20cc096/kernel/nvgpu/drivers/gpu/nvgpu/include/nvgpu/hw/gv11b/hw_gr_gv11b.h#L1618 :ferrisUpsideDown:
22:41fdobridge: <karolherbst🐧🦀> I haven't found this definition in any other repo
22:42fdobridge: <karolherbst🐧🦀> can you whack it via the priv_reg macro?
22:43fdobridge: <gfxstrand> no
22:43fdobridge: <gfxstrand> At least it doesn't seem to do anything
22:43fdobridge: <esdrastarsis> Like I said, theres nothing to discuss...
22:45fdobridge: <karolherbst🐧🦀> I ask you to rethink how you interact with people there and to tone down your language
22:46fdobridge: <pavlo_it_115> 👍
22:46fdobridge: <karolherbst🐧🦀> mhhh... yeah.. I don't really know what the kernel needs to set here and what userspace does
22:46fdobridge: <karolherbst🐧🦀> you can look at priv_reg register maps somewhere in nvgpu
22:48fdobridge: <karolherbst🐧🦀> uhhh...
22:48fdobridge: <karolherbst🐧🦀> where was that..
22:49fdobridge: <karolherbst🐧🦀> @gfxstrand `gv11b_gr_init_get_access_map` or `ga10b_gr_init_get_access_map`
22:49fdobridge: <karolherbst🐧🦀> those are the regs we can probably all touch via the macro
22:49fdobridge: <karolherbst🐧🦀> `gr_pri_gpcs_tpcs_sms_hww_warp_esr_report_mask` is _probably_ the one you need here
22:49fdobridge: <karolherbst🐧🦀> which is `0x419ea8`
22:50fdobridge: <karolherbst🐧🦀> `0x4188fc, /* gr_pri_gpcs_zcull_ctx_debug */` :ferrisUpsideDown:
22:50fdobridge: <karolherbst🐧🦀> ` 0x419a04, /* gr_pri_gpcs_tpcs_tex_lod_dbg */`.. there are some goodies, if I just knew what they all do
22:53fdobridge: <gfxstrand> Yeah, I've tried `0x419ea8` bit 14
22:55fdobridge: <gfxstrand> I also tried `0x50472c`
22:58fdobridge: <pavlo_it_115> Am I pushing people too much, or am I being insolent?
22:59fdobridge: <pavlo_it_115> Am I pushing people too much, or am I being insolent?
22:59fdobridge: <pavlo_it_115> What exactly is the matter (edited)
23:02fdobridge: <gfxstrand> Annoying whoever NVIDIA has assigned to watch those issues isn't going to change their answer RE firmware and it is going to annoy one of the few people NVIDIA has facing the world of open-source.
23:03fdobridge: <pavlo_it_115> sorry..
23:04fdobridge: <karolherbst🐧🦀> you are just very rude
23:04fdobridge: <karolherbst🐧🦀> there
23:06fdobridge: <gfxstrand> Clearing most of the bits in that register doesn't seem to do anything. 😢
23:08fdobridge: <pavlo_it_115> https://cdn.discordapp.com/attachments/1034184951790305330/1213261486332313600/dc633f13e59990ab.png?ex=65f4d4cf&is=65e25fcf&hm=b52e23bcbe477f8b1a5d2dc068e340179df65cbf7f074253f2911ad1db0fcb52&
23:08fdobridge: <pavlo_it_115> https://tenor.com/view/facedown-face-down-watchdogs-frustration-gif-10398043
23:21fdobridge: <redsheep> Are you still up against this block on calibrated timestamps? I've also been attempting this, so far mostly just by reading the other implementations since there seems to be no real documentation
23:22fdobridge: <Sid> you can go ahead if you want to, I just had nothing to do and thought I'd take a crack at it
23:44fdobridge: <gfxstrand> I think I've got it!
23:45fdobridge: <gfxstrand> Yup. It was bit 14 after all
23:45fdobridge: <gfxstrand> I had a hack in the way
23:48fdobridge: <karolherbst🐧🦀> nice
23:48fdobridge: <karolherbst🐧🦀> via `0x419ea8` then?
23:48fdobridge: <karolherbst🐧🦀> or the other reg?
23:54fdobridge: <gfxstrand> MR incoming
23:55fdobridge: <redsheep> I wonder if that might fix heaven, it's uniquely heavy on tessellation. I will give it a try