01:13 Dezponia: Sweet! With the 4.4 git kernel my GTX TITAN Black no longer crashes when I run OpenGL. Havent managed to build a new mesa though so still on 11.0.7, llvm-libs-svn constantly fails when building.
01:13 Dezponia: Still its stabel and plays Xonitc on max everything at 40-60FPS :)
01:17 Dezponia: Guess whatever that crash was on 4.2.5 was solved later. Will try 4.3 when it hits arch stabel (currently in testing)
01:27 koz_: Dezponia: You clearly have all the moneyz for graphics hardware.
01:28 Dezponia: koz_: Not really, I'm just trying to bunker up on the last "good" freedom respecting hardware I can find, as the future looks bleak from both intel and amd and ARM is an unknown right now but knowing the embeded industry, it looks very bleak
01:29 koz_: Fair enough.
01:29 Dezponia: I might be interested in knowing what sort of hardware would be useful for nouveau devs to get their hands on though
01:30 Dezponia: RSpliet mentioned he had to borrow a fermi card for his project yesterday. Without access to hardware I can totally understand if development hits a few snags. But thats a problem that can be fixed and development eased (even if manpower/time will always be the bottleneck)
01:30 Yoshimo: im always told that they lack time not hardware
01:31 Dezponia: Thats harder to fix :) but for specific instances hardware can be fixed :)
01:33 Dezponia: I suspect I have to get a newer mesa compiled to really try out Heaven and such but I'm grabbing them regardless to see what does and doesnt work for now :)
01:34 Dezponia: Just super psyched the desktop doesnt freeze because something dared try opengl on kernel 4.4 :P
01:36 Yoshimo: thinking about yesterday Dezponia, mr. Spliet might make use of a couple of fermi cards if i read that right
01:37 Dezponia: Yoshimo: Thats what I was refearing to
01:37 Dezponia: Yoshimo: I'd assume it would be most useful to have a cross section of various cards from different "code names"
01:38 Dezponia: Like a GF104, GF110 and GF114 seems interesting
01:41 Yoshimo: fermi cards are very cheap these days indeed
01:42 Dezponia: Seems like it, even the top card, the 580 3GB edition is fairly cheap
01:42 Yoshimo: 20€ for a 560ti ;)
01:42 Dezponia: Only real question is would a dev prefer the stock reference pcb cards or those fancy dual fan super extreme oc ultra super edition cards?
01:43 Dezponia: I would guess that the stock pcb would be the most representative of the average card
01:44 Yoshimo: on the other hand the non.standard things might be the more intresting ones
01:44 Dezponia: Best to ask from case to case I guess :)
01:44 Dezponia: Either way I'm very partial to buying the ones with the most VRAM (surprise surprise)
02:10 Dezponia: karolherbst: Hai there. My card is no longer crashing when hitting opengl on the 4.4-git kernel :)
02:10 karolherbst: :D
02:10 karolherbst: nice
02:10 karolherbst: imirkin: if somebody wants to get 0f working, tell them to use the 4.4 kernel :D
02:11 karolherbst: none of my branches will compile with 3.16 anymore I think
02:11 Dezponia: karolherbst: Have not managed to build mesa because the llvm-libs-svn build wont work but whatever, it works, tesselation also works in heaven. No idea what clock I'm running at but its probably low.
02:11 karolherbst: Dezponia: boot with nouveau.pstate=1
02:11 Dezponia: oh right, hurr durr
02:11 karolherbst: then you get a file named pstate inside /sys/class/drm/card0/device
02:12 Dezponia: karolherbst: I have not tried 4.3 but its in Arch's testing repo so should hit stabel in a few days, will test agian then
02:12 karolherbst: mhhhh
02:12 karolherbst: you will need 4.4 for 0f pstate anyway
02:12 Dezponia: Oh right, nevermind then :P
02:14 Dezponia: Alrighty, time to see what explodes
02:14 karolherbst: well you won't be able to clock to 0f anyway, because it will get a volting error
02:15 karolherbst: thing is, we know a way to fix this, but this can lead to heat issues and nouveau doesn't do anything to prevent the gpu from overheating
02:15 karolherbst: well the gpu itself does, but currently the thresholds are just too high
02:15 karolherbst: maybe
02:15 karolherbst: maybe not
02:16 karolherbst: well there is also a downclock and max fan threshold, but if the temperature would increase further nouveau don't do anyhting itself
02:16 Dezponia: Well i see the pstate file, I assume the AC value is the current clock?
02:16 karolherbst: yeah
02:16 Dezponia: So lowest right now, no surprises
02:16 karolherbst: your card should reclock to 0a just fine though
02:17 Dezponia: What about 0d?
02:17 karolherbst: same as 0f I think
02:17 Dezponia: Ah
02:17 Dezponia: Looked like it, cant see the difference
02:17 karolherbst: most of the time there is none
02:17 karolherbst: I only found one card where there was one
02:17 karolherbst: and the memory clock was -1000MHz
02:17 karolherbst: :D
02:17 karolherbst: anyway
02:18 karolherbst: you want to run the unigine benchmark
02:18 karolherbst: and check the gpu temp with sensors
02:18 karolherbst: and check that your fan is working
02:19 Dezponia: karolherbst: temp 53C, 810rpm fan and I can see it spinning
02:19 Dezponia: No reclock yet though, not automatic?
02:19 karolherbst: nope
02:19 karolherbst: not yet
02:19 karolherbst: I have a working prototype though
02:20 karolherbst: but we have to deal with all the other issues first
02:20 Dezponia: Alrighty, so what do I do to test the reclock thingy then? :)
02:22 karolherbst: echo 0a into the pstate file :D
02:22 karolherbst: you can just select the pstates that way
02:23 Tom^: echo 0a | sudo tee /sys/class/drm/card0/device/pstate
02:24 Dezponia: Seems to have worked, screen flashed for a few frames and fps jumped by 15 or so
02:24 karolherbst: mhhh
02:24 Dezponia: temps 61 fan still 780-810 rpm
02:24 karolherbst: the screen shouldn't flash though
02:25 Tom^: wait Dezponia which kernel version?
02:25 karolherbst: yeah, your fan might not work yet, there is a patch though
02:25 Dezponia: Tom^: 4.4 git
02:25 karolherbst: Dezponia: you have envytools installed right?
02:25 Dezponia: karolherbst: yes
02:25 Tom^: karolherbst: hm didnt the fan patch land in 4.4 ?
02:25 karolherbst: Dezponia: do nvaforcetemp 89 and check sensors again
02:25 karolherbst: Tom^: don't think so
02:25 Tom^: ah that explains that then
02:26 karolherbst: Dezponia: fan still at 810?
02:26 Dezponia: karolherbst: Yepp
02:26 Tom^: http://lists.freedesktop.org/archives/nouveau/2015-November/023401.html
02:26 karolherbst: then do nvaforcetemp 91
02:26 karolherbst: this should boost your fan then :D
02:26 Dezponia: fan into overdrive
02:26 karolherbst: k
02:27 karolherbst: then nvaforcetemp 0
02:27 Dezponia: 5000rpm and now its going down
02:27 Dezponia: fan back down
02:27 karolherbst: yeah, this means you just need that patch Tom^ linked
02:28 karolherbst: without this nouveau will have a funny behaviour if you clock to max :D
02:28 karolherbst: like it waits until the temperature reaches 90°C
02:28 karolherbst: then it boost your fans to max
02:28 Dezponia: Pfff, thats finnneeeeee :P
02:28 karolherbst: and then after the temperature felt it sets the fan to min speed agian
02:28 karolherbst: I think it is just a bit annoying :D
02:28 Dezponia: Well yeah :P
02:29 karolherbst: Dezponia: then try out 0f, but I think this will just fail
02:29 karolherbst: well
02:29 karolherbst: the memory clock should go up though
02:29 karolherbst: :D
02:29 karolherbst: the core clock will just stay at 0a level
02:29 Tom^: and core, but you will make it freeze on load.
02:29 Dezponia: karolherbst: No need to adjust the fan in preperation or such first?
02:29 karolherbst: Dezponia: well with 90°C the fan will boost
02:29 karolherbst: your gpu shouldn't get any hotter than 90°C with that
02:30 Dezponia: Works fine for now, still running
02:30 Dezponia: Core has not moved though, just memory
02:30 karolherbst: yeah as I said :p
02:30 karolherbst: and if you check dmesg
02:30 karolherbst: there will be some volting issue -22 or something
02:30 Tom^: but didnt you change that? or has that also not landend :p
02:31 Dezponia: karolherbst: Correct
02:31 Tom^: im so spoiled running your branches. ^_^
02:31 Dezponia: Temp at 73 now, slowly climbing
02:32 karolherbst: Tom^: not yet
02:32 Dezponia: This is cool though, looking forward to seeing what 4.5 brings and such, perhaps then things will settle down :)
02:32 karolherbst: Tom^: we have to setup some overheating stuff first
02:32 karolherbst: :D
02:32 karolherbst: yeah maybe
02:33 Dezponia: 82C and rising :)
02:33 karolherbst: don't expect this to be running perfect before 4.6 though :D
02:33 Dezponia: Thats fine. I'm in no rush
02:34 Dezponia: Seeing some textures flashing red in heaven
02:34 Dezponia: Hit 90 and the fan kicked in
02:36 karolherbst: Dezponia: yeah I know, but this happens just like never, does it?
02:37 Dezponia: Super rare, seen it 1 time in the flyby mode and 1 time when I drove the camera into a wall and the floatin island sort of pushed it into the roof
02:38 Dezponia: Seems to only happen on the floating fortress for some reason
02:39 Tom^: J
02:41 Dezponia: Is there some special resolution or so you guys use for getting good FPS numbers to compare?
02:43 karolherbst: Dezponia: full hd fullscreen ultra settings + 4x msaa :D
02:43 Dezponia: karolherbst: Tesselation extreme?
02:47 karolherbst: Dezponia: why not, thought it looks a bit strange :D
02:48 Tom^: tesselation normal :p
02:48 Dezponia: Tom^: Thanks, just looking to get some numbers that you can compare with
02:48 Tom^: Dezponia: http://i.imgur.com/IIZVfd1.png
02:48 Dezponia: Running a benchmark set now with extreme to see what happens
02:48 Tom^: ;)
02:50 Dezponia: 11.8 average, 5.2 min and 24.7 max with extreme tesselation :P
02:52 Dezponia: Normal tesselation certainly does look more... normal :P
03:01 Dezponia: 10 min, 25.2 max, 13.7 average with Tom^'s settings
03:01 Dezponia: As it turns out core clock is important, who knew? :P
03:01 Tom^: yea im running at 0f :p
03:02 karolherbst: mhh
03:02 karolherbst: shouldn't matter that much though
03:02 karolherbst: I mean how many MHz are missing? 200?
03:02 Tom^: from my 0a to 0f around 800mhz
03:02 karolherbst: :O
03:02 Dezponia: I was running at 324 core
03:02 karolherbst: that many
03:02 karolherbst: ohhhh
03:02 Tom^: yes that many ;)
03:02 karolherbst: 324 core is a bit slow though
03:02 Dezponia: Normal "base" is 889 on this card
03:02 karolherbst: Dezponia: did you also get 324MHz on 0a?
03:02 Dezponia: Yepp
03:03 Dezponia: It never changed
03:03 Dezponia: voltage error thingy
03:03 Dezponia: -22
03:03 karolherbst: that's strange
03:03 karolherbst: let me check
03:03 karolherbst: ha :D
03:03 karolherbst: lol
03:03 karolherbst: all your pstate have the same core clock
03:03 karolherbst: except 07
03:03 karolherbst: that's unusual for high end cards
03:04 Dezponia:feels special
03:04 karolherbst: oh meh, you could compile nouveau yourself with one of my branches and the fan patch
03:04 karolherbst: but because I know that this will work, it is just a thing you should do if you are up to :D
03:05 Dezponia: karolherbst: I'd be glad to if I had any clue how. Also as I mentioned I'm still stuck on mesa 11.0.7 since I wouldnt not get the deps built to get 11.2
03:05 karolherbst: mesa doesn't have anything to do with that
03:05 karolherbst: just the nouveau kernel module
03:05 Dezponia: But mah fps!
03:06 karolherbst: the issue is this basically: the vbios tells us to clock to a cstate, which needs a voltage higher than 1.215V
03:06 karolherbst: 1.2125V
03:07 karolherbst: I don't know why the vbios does this, but it seems to be a common thing
03:08 Dezponia: Hmm, broke a fanblade on a case fan while poking around at it. Guess that gives me an excuse to upgrade that fan to a PWM one since it was the only one that was still voltage controllered
03:08 Dezponia: Breaking stuff for science!
03:09 karolherbst: :D
03:11 Dezponia: karolherbst: So about that nouveau kernel module
03:18 karolherbst: Dezponia: git clone https://github.com/karolherbst/nouveau.git -b stable_reclocking_kepler
03:22 Dezponia: Done
03:25 karolherbst: cd drm
03:25 karolherbst: make
03:27 Dezponia: That did not work :P modules recepie failed
03:29 Tom^: yea comment out that silly tegra include
03:29 Tom^: karolherbst: what provides that anyways? tired of having to comment it out. i want it provided. ;)
03:30 Tom^: is it some kern config not enabled in arch kernels?
03:30 Dezponia: Tom^: Where do I comment that out? :P
03:31 Tom^: Dezponia: the place it errors in, dont recall it but its in a .h file
03:31 Tom^: nouveau/include/nvif/os.h
03:31 karolherbst: Tom^: no clue
03:33 Dezponia: Think it might be a problem with the 4.4 kernel I'm using, you're on 4.3 right Tom^?
03:33 Tom^: Dezponia: pastebin the errors
03:35 Dezponia: https://paste.kde.org/ps2ijmvek
03:36 Tom^: i would say you are just missing your kernel headers
03:36 karolherbst: mhhh
03:36 karolherbst: yeah
03:36 karolherbst: most likely
03:36 Dezponia: Ah right, never built those. Could be useful for this I guess... :P
03:36 Dezponia: I'm not a developer if thats not clear :P
03:37 Tom^: dont worry im not that either.
03:37 Tom^: we are all in the same swedish #välfärd
03:37 Dezponia: Learn ALL the things!
03:37 Dezponia: Neat
03:38 Tom^: you can run LANG=C make , to make the make output more karolherbst friendly. :p
03:38 Tom^: i think
03:39 Dezponia: As it turns out the headers are quite large. Oh well I'll wander off and do other things while waiting for them to download
04:42 karolherbst: Dezponia: and?
04:45 Dezponia: karolherbst: Doesnt build
04:45 karolherbst: ohh wait
04:45 Dezponia: I mean the kernel headers :P
04:45 Dezponia: They dont build either :P
04:45 karolherbst: is there a makefile inside /lib/modules/4.4.0-rc4-g800f1ac/build?
04:46 karolherbst: you need the kernel headers for the kernel you are running
04:47 karolherbst: otherwise you could also switch to a 4.3 stable kernel
04:47 karolherbst: my branch has most of the 4.5 changes already anyway
04:49 Dezponia: We'll see, I'll poke around a bit more at it, see what explodes
05:04 karolherbst: :D
05:05 karolherbst: Dezponia: well I would suggest you to just use the normal kernel package and the headers
05:05 karolherbst: this will work for sure
05:11 Dezponia: Pulling in 4.3.2 from arch testing
05:15 Dezponia: karolherbst: Seems like 4.3.2 also works, no crash on login with KDE set to opengl mode
05:16 Dezponia: And so far its building your thingy
05:23 karolherbst: :)
05:23 Dezponia: karolherbst: Ok, built from the makefile in the DRM folder
05:23 Dezponia: Now what? :P
05:23 karolherbst: now you have to install the module
05:23 Dezponia: Where is the thing and where do I put the thing?
05:23 karolherbst: mabe make install works if you regenerate your initramfs
05:23 karolherbst: yeah we could try that out
05:23 karolherbst: make install
05:24 karolherbst: regenerate initramfs
05:24 Dezponia: From drm?
05:24 karolherbst: yes
05:24 Dezponia: To many makefiles :P
05:26 karolherbst: ?
05:26 Dezponia: back
05:26 Dezponia: Forgot to set pstate but whatever. Can I see if its properly runnng or reboot again with the pstate line?
05:26 karolherbst: you need the pstate line
05:28 Dezponia: Alrighty, back again
05:30 karolherbst: Dezponia: is there a line like "nouveau 0000:01:00.0: clk: base: 705 MHz, boost: 797 MHz" for you in dmesg?
05:31 karolherbst: if so, you can start testing stuff, if not, then you have to wait, cause i am hungry :D
05:31 Dezponia: Nopp
05:31 Dezponia: Not even sure I'm running the nouveau thing I built, seemed to install
05:31 karolherbst: mhh
05:31 karolherbst: well then you need to overwrite the nouveau.ko(.xz) file inside /lib/modules/
05:32 karolherbst: for the 4.3 kernel
05:32 karolherbst: inside kernel/drivers/gpu and so on
05:32 Dezponia: "Initialized nouveau 1.3.0" from the dmesg if thats helpful
05:32 karolherbst: it should be 1.3.1
05:32 karolherbst: or maybe 1.3.2
05:38 Dezponia: Done, works
05:38 Dezponia: [ 11.802032] nouveau 0000:01:00.0: clk: base: 888 MHz, boost: 980 MHz
05:39 Dezponia: [ 12.645525] [drm] Initialized nouveau 1.3.1 20120801 for 0000:01:00.0 on minor 0
05:41 Dezponia: karolherbst: Now then? :)
05:42 Dezponia: karolherbst: I changed the pstate and the screen still flashed black for a fraction of a second but now the fan kicks up
05:43 Dezponia: seems the fan is working as intended right now at any clock level
05:43 Dezponia: Core also reclocks properly at 0a
05:44 Dezponia: on 0f now, core and memory reclocking properly and heaven running well
05:44 Dezponia: Fan around 2300rpm at 61C
05:47 Dezponia: Its not turboboosting to 980 or above though, rock solid on 887-888
05:57 Dezponia: Not quite Tom^'s levels of good but certainly miles ahead of before. 36.4 frames average, 26.1 min and 67.8 max
05:59 karolherbst: Dezponia: now boot with nouveau.config=NvBoost=2
05:59 karolherbst: this should clock up to 980MHz then
06:00 Dezponia: Neat, will do
06:03 pmoreau: karolherbst: Would it be possible/interesting to change NvBoost similarly to pstate?
06:05 karolherbst: pmoreau: this is more of a hack
06:05 Dezponia: karolherbst: works, on 979 core
06:05 Dezponia: 978 even
06:06 karolherbst: Dezponia: would be nice to know how hot the gpu will get with unigine
06:07 Dezponia: karolherbst: Will check, doesnt seem unresonable so far
06:08 Dezponia: karolherbst: Running the benchmark now at fullscreen 1080 4xAA ultra and normal tess
06:21 Dezponia: karolherbst: New benchmark run, fan seems to top out around 2910rpm at 67C (never seen it go above this for either RPM or C). Results where 40.2fps average, 27.9 min and 75.0 max
06:22 Dezponia: I should probably get an Intel i7 3770 to pair this with, the i7 2700K does not have PCI-E (this computer was not built with this card in mind)
06:23 Dezponia: The GTX680 I previously had was right at the edge of what PCI-E 2.0 could handle and a faster card like this can make use of 3.0
06:24 karolherbst: pcie doesn't make much of a difference
06:24 karolherbst: nouveau sets your pcie card speed to 2.5 only anyway
06:24 Dezponia: karolherbst: I know :P
06:24 karolherbst: mhh
06:25 karolherbst: now you could boot with nouveau.config=NvBoost=3
06:25 Dezponia: Thats the 1000ish core mark?
06:25 Dezponia: 1050* ?
06:25 karolherbst: something like that
06:25 karolherbst: not quite sure
06:25 karolherbst: didn't checked carefully enough for that
06:26 Dezponia: Weill do, I'm happy already :)
06:27 karolherbst: pmoreau: NvBoost=3 will be the default later anyway, the only problems are heat management
06:27 karolherbst: and maybe sane reclocking
06:27 karolherbst: but also the blob uses the highest clock possible as long as the temperature stays inside valid ranges(and power usage)
06:27 Dezponia: temps seem good though, I knew Nvidia had improved their stock cooler with the 600 series and much more with the 700 series and that seems to be true
06:28 karolherbst: well 70°C is a bit high already
06:29 karolherbst: I think nvidia tries to target 75 for desktop cardS? maybe 80
06:29 karolherbst: ohh wait
06:29 karolherbst: I can check with your vbios :D
06:29 Dezponia: I would guess 80
06:30 Dezponia: 0f is now 1084mhz
06:30 karolherbst: 82.5 °C is the target
06:31 karolherbst: and 95°C is the "max"
06:31 karolherbst: and max means: downclock to lowest clock
06:31 Dezponia: Going to check what 0f brings me just sitting on the desktop first with these clocks
06:31 karolherbst: and between 82.5 and 95 downclock a little, but currently we don't know how exactly
06:31 karolherbst: :D
06:31 karolherbst: a lot of heat
06:32 Dezponia: karolherbst: Look I totally NEED 1000fps on my wallpaper or it wont feel smooth when I move the mouse
06:33 karolherbst: :D
06:33 Dezponia: Seems to calm down at 58C with 2040rpm fan on the desktop
06:34 Dezponia: Slowly going to 59c
06:35 karolherbst: we can reduce the powr consumption a bit on idle though, but I didn't care enough yet to write a mainline patch for that
06:36 Dezponia: Running another benchmark again to see how the temps look after it and how it performs (if stable)
06:37 Dezponia: karolherbst: BTW which results am I looking for here? nouveau+shaderpatch? http://i.imgur.com/IIZVfd1.png
06:37 Dezponia: Or just regular?
06:38 Dezponia: With the nouveau I built from you
06:38 karolherbst: the left one I think?
06:38 karolherbst: no idea what those shader patches are
06:38 Dezponia: Alrighty!
06:38 karolherbst: but you are pretty close already to those :D
06:38 karolherbst: with 1080MHz you should go easily over them
06:39 karolherbst: Tom^: sorry :D
06:39 Dezponia: So I assume boost 4 would be that 1202 state or whatever?
06:39 karolherbst: Tom^: seems like a Titan Black card will beat yours :p
06:39 karolherbst: Dezponia: there is no boost 4
06:39 Dezponia: Ah, hurr durr
06:39 karolherbst: 1 means base clock, 2 means use avg boost clock, 3 means use highest clock supported
06:39 Dezponia: I just remember you mentioned the vbios for this card had a cstate for 1202 which is weird :P
06:40 Dezponia: Neat
06:40 karolherbst: nouveau can't use higher clocks because that would mean using a higher voltage than the card is able to handle
06:40 Dezponia: So I guess 1084 is the highest this card will boost
06:40 karolherbst: you could modify your vbios though :D
06:40 karolherbst: yeah something like that
06:40 karolherbst: though the nvidia driver won't use this clock
06:41 Dezponia: karolherbst: I might do that. Gigabyte already sells this card in a gigahertz edition with only the difference being the base and boost clocks going up 13%. From 889/980 to 1006/1111
06:41 Dezponia: They bundled it with a windforce triplefan cooler since Nvidia did not allow any custom cards for the titan black
06:41 karolherbst: :D
06:42 Dezponia: All titan blacks use the same pcb and cooler. So they got creative and sold it as the a bundle :P
06:42 karolherbst: nice
06:42 Dezponia: Thats why I thought perhaps using the vbios from the gigaherts edition would work
06:42 karolherbst: I bet the blob only uses a clock near 1000MHz though
06:42 Dezponia: Same card but cherry picked
06:42 karolherbst: mhhh
06:42 karolherbst: could be
06:42 karolherbst: but you can easily modify your vbios yourself
06:42 karolherbst: there is really no magic to it
06:43 Dezponia: As far as I know the only partner that were allowed to change ANYTHING about the titan black was EVGA which is basically Nvidias best friend. They sold a "hydrocooler" edition that came pre-fitted for watercooling but thats it. It also ran at the gigahertz speed
06:43 Dezponia: karolherbst: I'll have to learn that some day then :)
06:44 Dezponia: I always get a weird drop to 8fps minimum the first run in heaven, doing run 1 now since I guess its some shader caching or whatever
06:44 karolherbst: well you can always just OC through the nvidia-driver anyway
06:44 Dezponia: run2*
06:44 karolherbst: +135MHz through coolbits
06:44 Dezponia: Never done that :)
06:44 Dezponia: Will try, if something explodes I blame Nvidia for shoddy engineering
06:45 Tom^: karolherbst: never.
06:45 Tom^: im still awaiting results. :P
06:46 Dezponia: Tom^: Is your 780Ti a reference card?
06:46 Tom^: a what now?
06:46 Dezponia: Tom^: Does it use the stock cooler or one of those fancy dual fan thingies?
06:46 pmoreau: karolherbst: Sounds good :-)
06:46 Tom^: fancy dual things, vbios OC'ed by MSI
06:46 Tom^: :P
06:46 Dezponia: Yeah, I figured :)
06:46 pmoreau: karolherbst: Don't you have patches to let Dezponia change the PCIe speed with Nouveau?
06:47 Tom^: Dezponia: does my world record of unigine and nouvea still stand or did you beat it?
06:48 Dezponia: Tom^: Still running but think it might stand :) You are a messurably better CPU and such as well
06:48 Tom^: good, good.
06:48 Dezponia: 29.9 min, 81.1 max, 43.5 average 1097 score :)
06:48 Dezponia: 3 score away :P
06:48 Tom^: not even close
06:49 karolherbst: :D
06:49 Dezponia: Not sure if mesa changes performance any but this is on 11.0.7
06:49 Dezponia: Couldnt build 11.2 for some reason
06:49 karolherbst: yeah
06:49 karolherbst: 11.2 will be better for sure
06:49 karolherbst: Dezponia: isn't there any newer package available?
06:50 karolherbst: pmoreau: yeah well, we can just add 5% with that, but that's all
06:51 Dezponia: karolherbst: there is in the AUR but I cant build llvm-libs-svn which is required to build it :P
06:51 Dezponia: Need it for llvm-svn which is needed by clang-svn which is needed by mesa-git 11.2 :P
06:51 Dezponia: Tom^: How did you build mesa 11.2? Just AUR?
06:51 Tom^: Dezponia: mesa-git from aur indeed.
06:52 karolherbst: Dezponia: llvm 3.7 is enough
06:52 Dezponia: Tom^: Weird, guess it might've broken between your build and now
06:52 pmoreau: Dezponia: You will get troubles with building llvm due to an update of gcc
06:52 pmoreau: Still trying to solve that… --"
06:52 Tom^: that could be that then
06:52 Tom^: ABI c++ change eh?
06:52 pmoreau: Yeah
06:52 Dezponia: karolherbst pmoreau: I can just just change the aur package to accept 3.7 then?
06:53 Tom^: yea i built it before that landed
06:53 pmoreau: Dezponia: Yeah, should work
06:54 Dezponia: pmoreau: Alrighty, will give it a try, brb with questions about why it wont build :P
06:54 pmoreau: There are two patches for LLVM to get around that issue, but… didn't solved the problem after applying them.
06:54 pmoreau: Let's see if downgrading to GCC 5.2 helps
06:54 Tom^: rewrite LLVM in C instead.
06:55 Dezponia: Rust!
06:55 pmoreau: :-D
06:55 Dezponia: Also relicense it as GPLv3 while at it because thats what I like
06:55 pmoreau: Eh! I want to continue hacking on Mesa/Nouveau, not drown myself into rewriting LLMV. :-p
06:55 Dezponia: Going to do a quick check what temps I get before I move to building mesa
06:56 Dezponia: pmoreau: Rewrite the kernel in rust
06:56 Dezponia: And bring the grsec guys with you, those guys are cool
06:57 pmoreau: ;_;
06:58 pmoreau: Dezponia: No OpenCL for you! :-p
06:58 Dezponia: :8
06:59 Dezponia: Seems the card runs at 1084 core 6999 memory with 3390RPM fanspeed and 71C temps in heaven
06:59 Dezponia: 1080p 4x AA normal tess ultra, the usual
06:59 Tom^: sounds about exactly the same as mine
06:59 Dezponia: Neat, except I have a regular Nvidia cooler and no fancy dual fan thing :P
06:59 Tom^: or well my fanspeed was quite lower
06:59 Dezponia: Yeah
07:00 Dezponia: Tom^: Whats your max clock though? Since it has a factory OC on it I would assume it could go a bit higher than regular cards?
07:00 Tom^: 1097 on blob when it boosts
07:01 karolherbst: pmoreau: just use the old c++ ABI ;)
07:01 Dezponia: Tom^: And nouveau with boost=3?
07:01 Tom^: that option didnt exist when i tested
07:01 Tom^: =D
07:02 karolherbst: Dezponia: for your interest: mesa is also not GPL, nor is the nouveau kernel module :p
07:02 Tom^: iirc i was on 1084 because it was the highest i was allowed by the cstate and volt limits
07:02 Dezponia: karolherbst: I know :)
07:02 Dezponia: Ah, same as me then I guess
07:06 pmoreau: karolherbst: How do you tell that to Mesa?
07:06 karolherbst: pmoreau: you have to tell that to gcc
07:07 pmoreau: If anyone wants to try running an OpenCL hello_world: https://phabricator.pmoreau.org/F15369 (I haven't had time to check the generated code for Fermi+, and now I can't, so it might not work for those cards)
07:10 Dezponia: Hmmm, still no joy building mesa with llvm 3.7
07:10 Dezponia: Complains it cant find the llvm libraries but llvm-libs is installed
07:11 pmoreau: It complains about not finding the shared llvm libraries I guess?
07:11 pmoreau: s/shared/*shared*
07:12 Dezponia: Found where it wants the llvm-libs-svn in the package build, dident change that first time
07:13 karolherbst: Dezponia: that's an pkgbuild issue isn't it?
07:13 karolherbst: just don't depend on llvm-libs-svn, because that's stupid
07:14 Dezponia: karolherbst: Changed it to normal llvm-libs and we'll see if it explodes
07:14 Dezponia: Still explodes it seems
07:14 pmoreau: Add --disable-llvm-shared-libs when building Mesa
07:22 Dezponia: pmoreau: Where do I stick that part in the packagebuild? :P
07:22 Dezponia: https://aur.archlinux.org/cgit/aur.git/tree/PKGBUILD?h=mesa-git
07:23 Dezponia: Under line 42?
07:24 Dezponia: There I see it actually, line 48
07:24 pmoreau: Oh right :-)
07:29 Dezponia: Now then, lets see if this tihng will work for once :P
07:31 karolherbst: Dezponia, Tom^: seems like your cards are basically the same anyway
07:31 Tom^: it should be, just his has more VRAM
07:31 Dezponia: Yepp
07:31 karolherbst: yeah and more DP perf
07:31 Dezponia: And the double float thingy
07:33 karolherbst: would be interresting to know if there is a way to just enable it on other gpus :D
07:35 Dezponia: Not sure how they lock it, its the same die on both the 780ti and black
07:36 Dezponia: vbios lock?
07:39 karolherbst: no idea
07:39 Tom^: nah doubt that
07:39 Dezponia: Magic then
07:39 Tom^: ever since radeon did so with their 6950 which could just be vbios unlocked to an 6970 i think ever vendor started doing HW locks.
07:39 Tom^: or simply not including the proper pins
07:43 Dezponia: Yay! Finally got mesa 11.2 :P
07:43 Dezponia: Now to see if it runs :P
07:55 Dezponia: Tom^: Score of 1101 ;)
07:55 Tom^: ;_;
07:55 Dezponia: Totally worth all that extra cash now
07:55 Tom^: haha
07:55 Dezponia: Justified!
07:55 Tom^: no. :p
07:55 Dezponia: Hush you
07:56 Dezponia: I'll have the last laught once SuperTuxKart upgrades to mega resolution textures and require 6GB of VRAM!
07:57 Tom^: you mean when you actualy have an 8k monitor?
07:57 Dezponia: Nope :P
07:57 Dezponia: I did run 1920x1080 in triple surround before though
07:58 Dezponia: That sort of counts, right?
07:58 karolherbst: :D
08:00 Dezponia: karolherbst: BTW with mesa 11.2 I have not seen any render issues like the red textures in heaven yet
08:04 karolherbst: seems like we really have to fix all those issues now for kepler :D
08:04 Tom^: yes. :p
08:05 karolherbst: meh :D
08:06 Dezponia: What issues? :P So far on 11.2 I have not seen anything weird (yet)
08:07 Tom^: dynamic reclocking, proper volt handling. power sensors so it doesnt go above power budget etc.
08:08 Dezponia: Ah, right, the things I've been doing manually now :)
08:42 chillfan: hey :)
08:51 chillfan: I sent over a v-bios file for the gtx 780ti, wondering if that on it's own is helpful for the reclocking support, when I have some time I could try to help a little more with that
08:51 imirkin: karolherbst: --^
08:52 chillfan: imirkin: cheers :)
08:54 chillfan: have some stuff that needs doing here, but i can try to install nouveau from git today for sure
09:00 karolherbst: yeah I know :D
09:00 karolherbst: chillfan: where did you send the vbios to?
09:00 chillfan: the mmio dumps email
09:00 karolherbst: usually with newest nouveau and newest mesa the 780 ti should run at least on default clocks
09:00 karolherbst: yeah, I have no access to that :D
09:01 chillfan: oh okay, somewhere else it should go?
09:02 karolherbst: well I won't have much time to look into the issue today anyway, so it would be a good idea to test with a newer software stack first
09:02 karolherbst: otherwise Tom^ or Dezponia can also help you, because they have practiacally the same gpu ;)
09:02 imirkin: karolherbst: http://filebin.ca/2PpOToo6k4gd/gtx780ti-vbios.rom
09:03 karolherbst: imirkin: do you want to add it to the repository? with the strap peek? I really have no time today for this :/ maybe in 4 or 5 hours
09:03 chillfan: will do, working on getting this system on the 'edge', should I clone a specific repo or just the one mentioned in InstallNouveau?
09:03 imirkin: karolherbst: not particularly... i'm way behind on that repo, updating would be a huge annoying download
09:08 chillfan: ok back in a few :)
09:19 pmoreau: karolherbst: I could put it in the repo
09:33 Dezponia: Ahahahaha, found a way to cheese the heaven benchmark max fps score :P
09:34 Dezponia: Bet I could beat the Nvidia windows drivers max fps with this method if I just wanted to
09:53 karolherbst: pmoreau: k, thanks :D
09:56 Dezponia: Tom^: Heaven benchmark results: https://seafile.dezponia.se/d/d4ff2e0114/files/?p=/Unigine_Heaven_Benchmark_4.0_20151213_1805.html&dl=1 :P
09:57 Dezponia: Tom^: You still hold the highest max FPS at least, I've yet to beat it :)
09:58 karolherbst: Dezponia: pro tip: skip through every scene with enter unitl you get to the first one again, then start the benchmark ;)
09:59 Dezponia: karolherbst: Just starting and stopping the benchmark with F9 and Esc works just as well, restarts at that point
09:59 imirkin: karolherbst: shouldn't matter... we precompile all shaders now
09:59 Dezponia: karolherbst: Or just take free-cam and point it into the clouds, then you'll start with 100+ max fps and it'll count towards your score
10:00 Dezponia: imirkin: For me I did see a difference on run 2 compared to run 1 in the minfps
10:00 Dezponia: imirkin: Every run 1 I got 8'ish FPS, every run after that I never went below 29 basically
10:01 imirkin: hm ok
10:01 imirkin: also, Tom^, you should probably redo yours... i've pushed some stuff wrt correctness that may slow you down :)
10:01 Dezponia: imirkin: There are certain spots that tank the FPS for a split second
10:02 Dezponia: On the nvidia blob heaven still tanks to 10 fps in some spots on the first run, also runs with a lower clock and higher temp
10:05 Dezponia: Interesting, sensors wont pick up the temp on the nvidia blob
10:28 karolherbst: imirkin: yeah I know, but unigine adds some shaders later
12:08 mupuf: imirkin: hey, for nouveau's shaderdb, wouldn't it make sense to try to track the number of register used? Or you can divide them in pools of PoT numbers that can be used by a thread
12:09 mupuf: The latter is of course better but may be unpractical
12:42 RSpliet: mupuf: I take it "gpr" is "general purpose register"
12:42 mupuf: that would be one way of putting it, I guess :D
12:43 karolherbst: mupuf: any idea how I can set the divider caused by the first threshold on the FSRM?
12:44 imirkin: mupuf: i do track the number of registers used
12:44 mupuf: karolherbst: 20074 does not allow you to do that?
12:44 karolherbst: only for the second threshold
12:44 karolherbst: didn't matter what I put it in there, the first one always caused 2
12:44 mupuf: ah, so it changed since fermi then
12:45 sarnex: have you guys heard of nouveau causing cpu core stalls?
12:45 mupuf: karolherbst: may the fuzz be with you!
12:45 karolherbst: :D
12:45 karolherbst: yeah fun
12:45 mupuf: that's how I found where the reg was
12:45 mupuf: force it to the first level
12:45 mupuf: and start fuzzing
12:46 mupuf: but start with 20074,
12:46 karolherbst: yeah, this will be fun :D
12:46 mupuf: it shouldn't take too long
12:46 karolherbst: also we have to figure out wich thresholds to use actually
12:46 RSpliet: sarnex: pipeline stalls? well, only if it's an in-order CPU that waits for the bus on every reg read/write
12:46 karolherbst: yeah I know, not that many regs in there
12:47 mupuf: karolherbst: the values? Yes, we need to either find them in the vbios or hardcode them, like the blob does
12:47 karolherbst: mupuf: also I didn't find any way to trigger the THRESHOLD_4_DIV
12:47 sarnex: RSpliet: im not sure what that means really, basically if i have nouveau loaded(my main gpu is amd, nouveau is used with PRIME sometimes), i get weird behavior then like 10 seconds latter i see cpu stall detected in dmesg andi have to reboot. it doesnt seem to happen if i rmmod nouveau
12:47 mupuf: when I say "like the blob does", I do not imply that the blob is hardcoding it
12:48 RSpliet: sarnex: oh those kind of stalls, sounds like nouveau is trying to access a GPU that is suspended?
12:48 karolherbst: sarnex: trace pls
12:48 RSpliet: karolherbst: hold your horses ;-)
12:48 sarnex: karolherbst: its unpredictable, but how can i get a trace of this
12:48 karolherbst: and with trace I mean stacktrace
12:48 RSpliet: sarnex: what kernel?
12:48 karolherbst: sarnex: find the stuck kworker
12:49 karolherbst: and cat the trace out of /proc/$pid
12:49 sarnex: 4.3.2 but i tried all between 4.1 and 4.3.2 and they all bad
12:49 karolherbst: there should be a dead kworker
12:49 karolherbst: or a stalled one or whatever
12:49 sarnex: i cant reproduce i just have to use the kernel for a day or two and see if it happens
12:49 karolherbst: ohh wait
12:49 karolherbst: the stall in dmesg should also print a stacktrace, right?
12:49 sarnex: yeah
12:49 karolherbst: I fear the worst, but you could post that one :D
12:50 karolherbst: I bet it is a wait call waiting on a kworker to finish
12:50 sarnex: ill modprobe nouveua and see if it happens
12:50 karolherbst: mhh
12:50 karolherbst: I bet you have a dmesg log somewhere?
12:50 sarnex: oh maybe
12:50 karolherbst: well with systemd you have one for sure :p
12:50 sarnex: yeah can you tell me how to find it with systemd
12:50 karolherbst: and without systemd you also should have one
12:50 sarnex: is it journalctl ?
12:50 karolherbst: yeah
12:50 sarnex: got it
12:51 karolherbst: mupuf: got it :D
12:51 mupuf: karolherbst: that was fast :p
12:51 karolherbst: my brain works like a stack :p
12:51 sarnex: karolherbst: http://pastebin.com/5kuFJbSa
12:52 sarnex: ignore the date all the stalls print the same thing
12:52 karolherbst: sarnex: yeah well that is somehow mhhh nouveau unrelated :D
12:52 sarnex: are you sure? rmmoding nouveau prevents it?
12:52 sarnex: no question mark at the end
12:52 karolherbst: well if I say it is nouveau unrelated I mean there isn't nouveau involved in the stack at least
12:53 karolherbst: isn't there more?
12:53 sarnex: negative
12:54 karolherbst: okay, it seems like the cpu is in a state where it can't put into a sleep state or whatever
12:59 sarnex: karolherbst: yeah heres one from yesterday http://pastebin.com/raw.php?i=vEpsd8Vz
12:59 sarnex: its upside down because journalctl -r
12:59 sarnex: i have no idea how to begin diagnosing this
13:00 karolherbst: mhh
13:00 karolherbst: there should be a dead or stalled kworker
13:01 karolherbst: htop can show you this easily if you let it display kernel threads
13:02 sarnex: yeah i dont see kworker in htop
13:03 karolherbst: you have to disable the option first
13:03 karolherbst: "Hide kernel threads"
13:03 sarnex: oh sorry i havent used htop before
13:03 sarnex: got it
13:04 sarnex: how can i find the stalling one with this
13:04 karolherbst: then the S coloumn may display D for the broken one
13:05 sarnex: operation not permitted to attach but im running as root
13:05 sarnex: wut
13:05 karolherbst: why attack?
13:05 karolherbst: *attach
13:05 karolherbst: it is a kernel thread
13:05 karolherbst: of course you can't attach to it ;)
13:05 sarnex: sorry lol
13:05 karolherbst: you have to get the stack from /proc/$pid
13:06 sarnex: ok thanks ill insmod and wait for it to die
13:19 karolherbst: mupuf: found it :) 2010c
13:19 mupuf: here you go, sir :p
13:19 mupuf: document that :)
13:19 karolherbst: yeah
13:20 mupuf: mwk: your server seems down: http://ng.0x04.net/~mwk/scans/
13:20 imirkin: iirc ng is down for good
13:20 imirkin: try www
13:21 imirkin: mupuf: http://0x04.net/~mwk/scans/
13:23 mupuf: imirkin: thx!
13:23 karolherbst: mupuf: I think with PTHERM.I2C_SLAVE.THRESHOLD_8 there can be a interrupt or something be triggered from the gpu to the host, but this is currently in a guessing state
13:23 karolherbst: I think with that we could remove that temp polling stuff
13:23 mupuf: I am sure you can, that's the spirit
13:23 karolherbst: maybe
13:23 karolherbst: :D
13:24 mupuf: I did this for nv50
13:24 karolherbst: ohh k
13:24 mupuf: it uses the interupts
13:24 karolherbst: how can I check if I get such?
13:24 mupuf: but I stopped doing it because I was lacking interupts or it was not super safe
13:24 karolherbst: or will the pmu send one?
13:24 karolherbst: I don't know from where to expect this interrupt so
13:24 mupuf: nope, ptherm sends it
13:24 karolherbst: ohh okay
13:24 mupuf: but please, don't go this way
13:25 karolherbst: :D
13:25 karolherbst: why not?
13:25 mupuf: because that's an optimisation and we have more important stuff to do
13:25 karolherbst: k
13:25 mupuf: in the end, the pmu will do the monitoring
13:26 mupuf: so, why bother implementing chipset-dependent logic when you can just poll every second on it?
13:26 karolherbst: okay, my road is something like that: 1. fsrm ground work 2. find out which thresholds to use 3. finalize fsrm 4. finalize my kepler reclock patches
13:26 karolherbst: ohh right
13:28 mupuf: 1-3 should be 1% of 4, but sure :p
13:28 mupuf: don't try to over-do the fsrm work, it is mostly trivial
13:28 mupuf: and even if we hardcode conservative thresholds, that would be enough
13:29 karolherbst: yeah I know
13:29 karolherbst: 95°C and 100°C? or is this too high already?
13:30 karolherbst: ohh wait, we have three values
13:30 karolherbst: blob uses 99/102/104 for me
13:30 karolherbst: low_fsrm/high_fsrm/crit
13:31 mupuf: on my kepler, it was only two thresholds
13:31 mupuf: 97 and 100
13:31 karolherbst: which ones? the high fsrm and emergency disconnect?
13:31 mupuf: no idea
13:31 karolherbst: the low_fsrm one sets a div between 0 and 7, the high one between 8 and f
13:32 mupuf: what is this low_fsrm you are talking about?
13:32 karolherbst: PTHERM.I2C_SLAVE.THRESHOLD_6 is the low one
13:32 karolherbst: PTHERM.I2C_SLAVE.THRESHOLD_2 the high one
13:33 karolherbst: the blob uses 99/102/104 °C with 2/e/off as the configured "effect"
13:33 karolherbst: well off maybe just means stop all work and disconnect from bus or something
13:40 karolherbst: I think my EC just shut down my laptop :O
13:43 karolherbst: mupuf: input clock: 405 MHz, div 2: 202MHz, div 3: 166MHz, div e: 50 MHz
13:53 Tom^: imirkin: im never upgrading then, ever.
13:53 Tom^: =D
13:54 imirkin: Tom^: but... elemental? :)
13:56 karolherbst: how does that looks like? http://plotshare.com/sessions/617433473/Plot1.png
13:58 karolherbst: ohh I think I got it
13:58 karolherbst: maybe not
14:02 mupuf: karolherbst: off = cut the voltage regulator
14:02 mupuf: this has altready been REed
14:02 karolherbst: k
14:02 mupuf: and it is documented and explained in ... the paper :D
14:02 karolherbst: okay, I got the formular to calculate the fsrmed freq
14:03 karolherbst: real_freq = set_freq / ((fsrm_div / 2) + 1)
14:03 karolherbst: means, if 862 MHz is set, and the divider is 15 (max) you get a real freq of 101MHz
14:13 mupuf: why / 2?
14:13 mupuf: that is weird
14:13 karolherbst: no idea, but this is what the data say to me
14:14 karolherbst: http://plotshare.com/sessions/617433473/ExtData1
14:14 mupuf: what would be the formula in this form? real = set >> div;
14:14 mupuf: find div :D
14:14 mupuf: that's what the hw does
14:14 karolherbst: well maybe nvidia-settings reports wrong clocks then?
14:16 mupuf: no, trust it more than my jibberish
14:16 karolherbst: k
14:16 karolherbst: if I divide real/fsrmed_clock I get stuf like: 1, 1.5, 2, 2.5, 3, 3.5, 4... and so on
14:16 karolherbst: so this is the only formular which would really make sense
14:16 karolherbst: but it is a bit strange though
14:17 mupuf: yep :s Maybe they used a DLL instead of divider by two
14:17 mupuf: but it definitely was one at the time I reversed it
14:17 mupuf: I used the perf counters
14:17 karolherbst: yeah maybe it is different with kepler now
14:18 karolherbst: I also only get 50% power consumption with the highest divider
14:18 karolherbst: well "only"
14:18 karolherbst: so that much doesn't change
14:18 karolherbst: allthough 50% less powe consumption is still a big win if the gpu gets too hot :D
14:18 karolherbst: but
14:19 karolherbst: it makes no difference if the fsrm_div is 2 or e for the performance if you set the threshold to 60°C and keep load running
14:19 karolherbst: without forcing a temp
14:19 karolherbst: it will make a difference though when the fan still can't keep up with that
14:20 karolherbst: but as long as the current temp stays with the thresholds, the performance loss is the same with all dividers
14:20 mupuf: ?
14:20 mupuf: it's not supposed to
14:20 karolherbst: why not?
14:21 karolherbst: higher div: less heat => faster cooling => faster clock restore
14:21 karolherbst: well for the eye there was a difference though, but no difference in avarage
14:21 mupuf: http://fs.mupuf.org/mupuf/nvidia/graphs/temperature_slow_down_fsrm.pdf
14:22 mupuf: the FSRM is ridiculous for power efficiency
14:22 mupuf: because the voltage is super high
14:22 mupuf: just downclock!
14:22 mupuf: the FSRM is a safety feature
14:22 karolherbst: yeah I know
14:23 karolherbst: I was just playing around to check what the real effect is of the FSRM
14:24 karolherbst: so I would say we would set two thresholds with two different dividers. Then we should dicide on how much power consumption we want to cut with the low one and set the high one to max
14:24 mupuf: don;t try to be smart
14:24 mupuf: use what nvidia uses :D
14:24 karolherbst: k
14:24 karolherbst: nvidia uses half clock on the low one :D
14:24 mupuf: I do not want us to tweak stuff per GPU
14:24 karolherbst: so 2
14:25 karolherbst: no, me neither
14:25 mupuf: 2 and e are what I have seen in nature, but check mmiotraces
14:25 mupuf: and then we need to check the thresholds
14:25 karolherbst: yeah I also have 2 and e
14:25 karolherbst: but we should use 2 and f I suppose
14:25 mupuf: if they are constant across a lot of boards, then let's just hardcode them
14:25 mupuf: if we can't find them in the bios, we might as well take the lowest values we find and put them there
14:26 mupuf: we should never reach this temp anyway
14:26 karolherbst: right
14:26 karolherbst: this is for cards with broken fans and stuff I guess
14:26 karolherbst: or worse
14:26 karolherbst: australian summer, broken fan
14:26 karolherbst: :D
14:28 mupuf: hehe
14:28 karolherbst: mupuf: nvkm_clk_read should be adjusted accordingly?
14:28 mupuf: Nah, ignore it
14:28 mupuf: devs can just peek the reg
14:28 karolherbst: okay, I will add a todo then :D
14:29 mupuf: which todo?
14:29 karolherbst: no idea
14:29 karolherbst: maybe we should create a list of task especially suited for beginners or something, this could be one of them
14:29 karlmag: .. there's always a todo... :-P
14:33 mupuf: karolherbst: you have edit rights on the trello, right?
14:45 karolherbst: yeah
14:45 karolherbst: right I could add it there then :D
14:46 karolherbst: mupuf: I suppose I should also do "PM: Add PCIe link management support" at some point then :D
14:46 karolherbst: meh documentation :D
14:47 karolherbst: k, will go to bed now anyway
16:42 imirkin: nice... got the buffer stuff into a much more working state
16:51 imirkin: pretty soon it'll be cleanup time for that feature
16:51 imirkin: maybe even tonight
16:51 imirkin: [this is ARB_shader_storage_buffer_object + ARB_shader_atomic_counters, for those following at home]
16:52 Tom^: im following!
16:52 Tom^: but i dont understand :(
16:53 imirkin: well, in addition to the functionality presented by those exts themselves, it will also make it possible to start working on ARB_compute_shader and ARB_framebuffer_no_attachments
16:55 Tom^: cool i soon have gl 4.2 then :p
16:55 imirkin: well, gl 4.2 has images too
16:56 imirkin: which is another biggie
16:56 imirkin: and also all the surface stuff needs to be worked out for GK110 (it kinda is for GK104 already)
16:56 imirkin: on the bright side, i have a GK208 so i'll be able to do it
16:57 imirkin: anyways, assuming other people pitch in on some work, it's conceivable that nouveau/radeonsi/i965 will all have GL 4.3 with the next mesa release
16:57 Tom^: now we just need some software actually using GL 4.3 ! :P
16:57 imirkin: i think unreal does
16:57 imirkin: and a bunch of the DX11 games coming out...
17:07 chillfan: directx :/
17:08 Tom^: lets hope they create more vulkan games then dx11.
17:08 chillfan: hm vulkan, I guess I'd need a new card to use it?
17:08 imirkin: perhaps i should have said "ports of DX11 games"
17:09 imirkin: vulkan's not a thing yet
17:09 chillfan: oh ports, over to opengl?
17:09 imirkin: yeah
17:09 chillfan: that's something eh, what games are they?
17:10 imirkin: i dunno... a bunch. GRID Autosport just came out
17:10 imirkin: (and had some issues with mesa)
17:10 imirkin: i fixed one of them, dunno if there are more
17:10 chillfan: ah will have to keep my eyes open
17:11 imirkin: there are a few groups doing porting
17:11 imirkin: feral, eon, and virtual programming
17:12 imirkin: they've all developed various wrapper tech, and are just wrapping games in it
17:13 chillfan: glad someone is doing something other than steam then
17:14 imirkin: huh? they're on steam...
17:14 imirkin: steam is just an application distribution thing
17:14 chillfan: no i mean presumably you can get the game without steam?
17:14 imirkin: usually yeah
17:14 imirkin: i dunno the specifics tbh
17:42 Tom^: ugh compiling llvm is slow. i need xeons.
19:53 Tom^: im ricing so bad, wasting hours on compiling stuff with optimization flags, even the kernel even tried 1000hz and i didnt gain or lose a single score or fps in unigine. xD
19:54 Tom^:facepalms
19:55 Tom^: Dezponia: you have me beaten, for now.
19:55 imirkin: i thought you were ahead
19:55 imirkin: did the mesa update make it that much worse?
19:56 Tom^: i thought he had me with like 3 points or so
19:56 Tom^: but nah the mesa update didnt affect it, and elemental demo runs flawlessly now :p
19:58 Tom^: yea he has me with 4 in total score but my max fps was 0.3 higher. ;)
19:59 Tom^: so i hoped my ricing could get him when it was so close, but as ive always assumed. ricing is placebo. :P
20:03 imirkin: lol
20:15 Tom^: imirkin: amazing work tho, all the UE demos seems to run fine now
20:16 imirkin: excellent
21:05 Tom^: imirkin: sadly its introduced somewhere in the -git and not in 11.0.7
21:06 imirkin: ?
21:06 imirkin: did i miss something?
21:07 Tom^: perhaps it gone lost in the DC i got. :P
21:07 Tom^: the longer im ingame in cs:go the more the fps drops from each shot fired i would assume from the muzzle fire.
21:07 Tom^: until im at the point of it being 120fps and drops to 5 everytime i shoot
21:07 imirkin: weird!
21:07 Tom^: almost like a cache is overfilling or similiar, and its not happening on 11.0.7
21:08 imirkin: yeah.... i can't think of anything offhand that'd be different there -- all of my resource handling stuff went into 11.0.x
21:08 imirkin: but there's probably something i'm forgetting
21:08 imirkin: or it could be something else entirely
21:08 imirkin: perhaps a bisect? :)
21:08 Tom^: sure thing, however im going to work in uh 15 minutes.
21:09 Tom^: gonna take a while to bisect since it isnt happening directly it takes a ~4 minutes of playing :P
21:09 Tom^: can i check how much vram is used?
21:09 imirkin: yeah, sad
21:10 imirkin: i don't think we expose that anywhere
21:10 Tom^: was just wondering, so i could see if it leaks somewhere :P
21:10 imirkin: i wonder if 53cbb117 manages to cause the regression
21:11 Tom^: how much has llvm in play in things like this?
21:11 imirkin: Tom^: check if 11.1-rc3 is affected
21:11 imirkin: absolutely none... nouveau doesn't use llvm
21:11 Tom^: because i sort of have to build llvm-svn to get mesa-git runnning and running repo llvm 3.7 for repo mesa
21:11 Tom^: ah ok
21:11 imirkin: [except in extreme cases where st/mesa falls back to software rendering, like for GL_SELECT... basically the only time]
21:12 Tom^: imirkin: oki il try rc3 when i get home otherwise il just start bisecting. :p
21:12 imirkin: awesome
21:34 gnurou: imirkin: good news! looks like I have permission to release the Maxwell texture headers documentation!
21:34 gnurou: imirkin: looking at the rnndb format, any particular advice as to how I should write it? Is direct XML ok?
21:35 gnurou: imirkin: also there are actually 5 versions of the header, with the fields changing depending on which version is used, how can I reflect that? Is the <stripe> tag the correct way to do it?
21:36 imirkin: gnurou: have a look at https://github.com/envytools/envytools/blob/master/rnndb/graph/g80_texture.xml for the current definition
21:36 imirkin: here is an example of how to do versions: <bitfield pos="9" name="CUBE_SEAMLESS" type="boolean" variants="GK104-"/>
21:37 imirkin: presumably there aren't really *5* versions and we haven't noticed -- just a couple of fields changing here and there
21:37 gnurou: imirkin: right, but these versions depend on the chip - for the Maxwell headers, the version used depends on the value of a particular field in the first word of the header
21:37 imirkin: that said i know that G80 is missing a bunch of stuff that G84+ has
21:37 imirkin: ohhh i see
21:38 imirkin: define an enum (much like chipset is defined), and then do variants on that
21:38 imirkin: and yeah, stripe is somehow related... look at other examples :)
21:39 imirkin: i don't really know the right way, i just copy stuff and make sure it works when i use 'lookup'
21:39 imirkin: gnurou: if you have trouble, do it in *some* way, and i can clean it up later
21:40 imirkin: what's important is having the right info... getting the xml all pretty can come later ;)
21:41 imirkin: anyways... i'm off to sleep
21:42 gnurou: imirkin: thanks - good night!
23:15 Dezponia: Tom^: A valiant effort, but non shall compare to my mighty 6GB of VRAM which is totally required at all times to gain better FPS! Non I say!
23:20 Dezponia: Tom^: It is interesting how consistant the score is in Heaven though, even when the FPS seemingly changes a fair bit from run-to-run the score almost always comes out the same, even accross both our cards which seems to indicate it really is a very accurate benchmark