00:10chillfan: So far for my 940M (maxwell) I have found a way to cause a full lock up (without using nvboost). Regardless of clock setting used, I just needed to run xonotic from the phoronix test suite on the "ultra" setting (1920x1080). Kernel 4.10-rc2, Xubuntu 16.10
00:10chillfan: sysrq doesn't work when it locks up
00:17chillfan: with DRI_PRIME set to the nouveau card
01:07chillfan: perhaps it's just xonotic though, still testing :)
01:11imirkin: skeggsb: any advice on how to get more info out of the lockup for chillfan? how do you debug this stuff?
01:45skeggsb: imirkin: usually ssh, or netconsole if that fails
01:51imirkin: ... and he's gone
01:52imirkin: skeggsb: you really just do everything with netconsole/ssh? impressive.
02:04skeggsb: beyond error messages in dmesg, it's trial-and-error.. which, is less fun
02:06imirkin: skeggsb: btw, it seems like pre-fermi hdmi audio got broken at some point
02:06imirkin: that point was long ago. somewhere in the 3.10 ... 4.4 range
02:06imirkin: so that's 3 rewrites to pick from ;)
02:54funfunctor: How do I use nvascan again exactly?
03:02nyef: skeggsb: In nvkm/engine/disp/disp/hdagt215.c, around line 62 (give or take), is an nvkm_wr32(device, 0x61c440 + soff, (i << 8) | args->v0.data);. The  should be [i]. It's clearly a necessary change, but I have yet to determine if it is sufficient.
03:04nyef: (Said change at least gets the ELD to come through correctly again.)
03:06imirkin: nyef: send patches :)
03:07nyef: Send where?
03:08nyef: I _have_ patches.
03:08nyef: That said, I still haven't heard any evidence that something else isn't wrong.
03:09imirkin: firstname.lastname@example.org + email@example.com
03:09imirkin: subscribing first will prevent them from having to be manually approved
03:10nyef: Are attachments okay, or should I find a non-gmail email client to use?
03:10imirkin: you can use gmail to send them properly with git send-email
03:11imirkin: nyef: this is what i use: https://hastebin.com/notuteyapa.pl
03:11imirkin: (in .gitconfig)
03:11nyef: And since this is a kernel thing, the nouveau list is more appropriate than the dri-devel list, right?
03:11funfunctor: imirkin: nvascan 0xaddr 0xsize should be the correct invocation right?
03:12imirkin: funfunctor: sounds right. note that this will perform writes to your device, so you have to use care.
03:12nyef: Hrm! I should probably find a way to test that outside of a mailing-list context.
03:12funfunctor: imirkin: yes it could brick the hw or smoke could come out I know! But thanks!
03:12imirkin: nyef: dri-devel is hooked up to a patchwork instance, nouveau@ isn't. either or both lists are ok.
03:13imirkin: nyef: yeah, just send it to yourself ;)
03:13imirkin: the envelopesender thing is just coz i use a different email address than @gmail.com one
03:13imirkin: normally that's not needed
03:13imirkin: make sure to have a S-o-b line and all the usual kernel patch items
03:14funfunctor: imirkin: $ sudo ~/envytools/build/nva/nvascan 0xfea00000 0x10000
03:14funfunctor: where BAR0 == 0xfea00000 and its size being 64K
03:14funfunctor: I just says 'R' for everything?
03:14imirkin: funfunctor: nvascan is only designed to work with nvidia stuff
03:14imirkin: funfunctor: i think it always maps one of the BAR's
03:14funfunctor: imirkin: I ported envytools to my device
03:14imirkin: the offset would be relative to that bar, usually
03:14imirkin: not absolute, since bar location isn't generally constant
03:15funfunctor: imirkin: https://cgit.freedesktop.org/~funfunctor/envytools/commit/?h=decklink&id=04fb5b9d608338752413ff8fb849bf9042184928
03:15funfunctor: ah could be the reason
03:15funfunctor: imirkin: no the BAR is read from PCI conf space
03:16imirkin: right, i know :)
03:16nyef: Okay, patch sending isn't going to happen tonight, for various reasons.
03:17funfunctor: imirkin: $ sudo ~/envytools/build/nva/nvascan 0 0x10000 just gives me a bunch of 0xfff's
03:17imirkin: that means that all the bits are writable i guess
03:17funfunctor: I think it caused the hardware to reset to its default state
03:17imirkin: could be
03:18funfunctor: well all the state that was there before got wiped out
03:18imirkin: that's what nvascan does...
03:19funfunctor: but I think it should restore the state after
03:19funfunctor: [ 2399.677265] BlackmagicIO: WARNING: The device "DeckLink Duo 2" has firmware that is newer than the version shipped with the driver (device: 0xff driver: 0x14)
03:19funfunctor: [ 2399.677285] BlackmagicIO: Enabled device "DeckLink Duo 2 (8)" x15/5 Gbps (ffffffff,510103) FW Date: ff-ff ff:ff
03:19funfunctor: [ 2399.681376] BlackmagicIO: DeckLink Duo 2 (8) as blackmagic!io3 [0000:01:00.0]
03:19funfunctor: oh dear
03:20funfunctor: It may of wiped SPI flash
03:20nyef: funfunctor: Weren't you planning to take a backup of that?
03:20funfunctor: nyef: I did yes
03:21nyef: Okay, so you're not completely sunk, then.
03:21funfunctor: a hard reboot resolved it any way
03:36nyef: aplay -L lists two HDMI outputs, "hdmi:CARD=NVidia,DEV=0" and "hdmi:CARD=NVidia,DEV=1". Playing to them with -D results in silence. It's not the sound file, because it plays fine with no -D parameter, just over the built-in speaker rather than the TV.
03:37imirkin: nyef: i think it's supposed to be like aplay -D hw0,1
03:37nyef: How do I know what numbers to use, then?
03:37imirkin: oh wait. do aplay -l
03:40nyef: Still no luck.
03:40nyef: Are both devices supposed to b "HDMI 0 [HDMI 0]", or should one of them have a different number
03:40nyef:obviously can't type well tonight. /-:
03:44imirkin: dunno, sorry
03:44imirkin: i don't have hdmi
03:46nyef: Hrm. There's some possible indication that one of them should be "HDMI 1", not "HDMI 0".
03:47funfunctor: Can nvawatch consider a range instead of a specific address?
03:47nyef: ... Which suggests that if I put a displayport->hdmi adaptor into the chain (thus using the mini-DP connector instead of the HDMI connector), it may work on both channels.
03:49nyef: And... it just locked up.
03:51nyef: Not even the magic sysrq seems to be working.
03:55nyef: ... Okay, so, hw:0,7 corresponds with an HDMI cable plugged into the displayport interface.
03:55nyef: ... And there are three ELDs, but only two HDMI devices?
03:56imirkin: you're way beyond what i know about how this all works
03:57nyef: Okay then.
04:04funfunctor: I think perhaps a good addition to `nvapeekstat` is a flag that allows you to watch over the whole BAR and look for registers that change very very frequently?
04:05funfunctor: to identify counters or whatever they are?
04:09nyef: I guess the other question is, why do I have three ELDs but only two devices?
04:10nyef: Or is one of them for the non-HDMI digital output?
04:18nyef: ... Turns out that the "dynamic minors" option is important.
04:21nyef: ... And turning off the audio infoframe doesn't seem to affect the sound output.
04:27nyef: Ahh... There's the DP output, the HDMI-over-DP output, and the HDMI output. That'd explain the three ELDs.
04:28nyef: And suggests something to try in terms of disambiguation.
07:24funfunctor: mwk: Hi
07:24funfunctor: I think i've found quite a number of things now but I need to discuss with someone
08:41karolherbst: interesting, a fermi card without the speedo value
11:18RSpliet: airlied: Could you chase whoever is necessary to get https://github.com/skeggsb/nouveau/commit/4cf443042eaaaf0520442d2580ac0e9a2d65828c upstream (and backported?) asap? It's a serious regression...
11:20RSpliet: skeggsb: thanks for chasing that bug further
11:26mwk: matching crazy hardware perfectly is never easy
11:27mwk:has extracted all possible results of the rcp instruction and is now trying to figure out how to compute these
11:33funfunctor: mwk: What interesting facts came you come up with about the number 0x40404040 ?
11:34mwk: umm... none really?
11:35funfunctor: dam ok
11:35funfunctor: mwk: i've started to identify some registers now
11:36funfunctor: mwk: I've noticed that most of the state seems to repeat itself over 0x200 block multiples?
11:36funfunctor: is that usual?
11:37funfunctor: mwk: like check this out https://paste.fedoraproject.org/523627/48396185/
11:41funfunctor: it is this pattern I am trying to make sense of https://paste.fedoraproject.org/523628/14839620/
11:41funfunctor: I don't understand the relevance of 0x404040..
11:49RSpliet: funfunctor: I think I've seen that in low-end hardware before... could be just a matter of having a few "don't care" bits in the address translation
11:50RSpliet: I mean, the "replicate over 0x200" thing
11:50funfunctor: it seems like one of those 0x55AA tricks
11:51airlied: RSpliet: chase skeggsb :-)
11:51RSpliet: although 0x224 appears to differ from 0x20, so could just as well be a replicated resource
11:51RSpliet: airlied: thanks, will try
11:51RSpliet: he's a busy man though :-)
11:52RSpliet: funfunctor: either way the 0x40404040 has too little context to make sense of it. What's interesting is that there's 10 of them...
11:52RSpliet: two-and-a-half 32-bit registers
11:52karolherbst: RSpliet: send email to linux-stable and CC ben and airlied
11:53karolherbst: and the drm ML in CC
11:53RSpliet: karolherbst: I think Ben's patches require manual labour before they can be merged into the kernel - some path adjustment I think
11:54Jeansf: karolherbst: actually as instructions with zero operands did not skip as nop's according to HetPas tests, i slightly changed the flow of the code, following the backup plan which is almost the same efficient
11:56karolherbst: RSpliet: you have to adjust the path a little
11:56karolherbst: it's no big deal
11:57Jeansf: i am kinda a ready to work that out, but lots of details can be done in several ways, but it looks as if 3-4x improvement on radeon is real, only thing that needs to be done is to get address space right with pointers so hw scoreboard would nop when needed, and then repartition the lane
11:57karolherbst: just appened "drivers/gpu"
11:57pmoreau: karolherbst: "drivers/gpu/" don’t miss the final "/" ;-)
11:58RSpliet: karolherbst: of course it isn't, but it doesn't feel like my end decision to wrap it up and push it forward, no matter how trivial the patch seems.
11:58karolherbst: I think skeggsb has a script for this, but quite sure
11:59Jeansf: but i allready described this base hack, and in future i can do that again if i get resources, like offer slides and talks about the base principals
11:59karolherbst: RSpliet: well, it's a regression
11:59karolherbst: if you are affected, it's already enough to push it forward
12:01Jeansf: i have docs also here, which say that swizzling the lanes and repartitioning would improve 2x , but i'd have enhanced version of that
12:01Jeansf: that is in particular some AMD employee on closed source drivers who stated this
12:01Jeansf: i later give you the pdf
12:03funfunctor: RSpliet: yes 48bits a interesting number to pop up
12:06Jeansf: anyways the biggest principal, is that when ds range function start, every contiguous address is pointed against the same variable, until regfile is being signalled, that some of the tid uploaded an alias to it
12:07Jeansf: then hw reschedules and executes that instruction, the trick is very simple, that you in regfile ask for current tid from the pointer
12:09Jeansf: it is almost the same as zero operanded instruction would be skipped, but it would need little bit more reg lanes to store the pointers
12:10Jeansf: but the cost should be neglible
12:14Jeansf: but nouveau sence, i'd offer the secondary better scheduling algorithm to the code, and fix multithreading at some point, other bugs i am not very current about
12:14Jeansf: with even
12:15Jeansf: things like HDMI audio, and reclocking and most the other ogl spec bugs, i know nothing about
12:19Jeansf: even if you were to try to stay current with them, those are no gimmes, cause the codebase and other peoples additions are continuosly so large and changing , i.e code is too large to monitor
12:24Jeansf: well i dunno maybe it can be done, but i have let it slide, imo it requires 100percent dedication daily basis, to be able to fix others bugs and stay always current with the code
12:26RSpliet: funfactor: I don't quite know what it even is you're REing, but for all I know it's "we have ten replicated resources, and bit 6 is the enable bit". Just looking at numbers hardly ever gets you enough information
12:26Jeansf: funfunctor: all that is with LLVM is imo particularly rough code in the core, sorry to have been annoying you if you follow this, but tom stellard has managed it, i feel no joy to conflict with any of the LLVM folks
12:27RSpliet: mmiotrace existing drivers to understand in what context these bits are touched, see what the mask is of these registers, have a specific goal you're searching for rather than trying to understand every bit
12:27funfunctor: RSpliet: right that is what I am doing
12:28RSpliet: good, sorry if I'm stating the obvious then :-)
12:28funfunctor: RSpliet: the hw is a blackmagic decklink capture card
12:28funfunctor: RSpliet: nar thats ok
12:32funfunctor: RSpliet: I am looking at per 0x200 offsets from BAR0 since it looks to me like the state is replicated more or less https://paste.fedoraproject.org/523533/48393718/
12:32funfunctor: with only a few exceptions
12:32funfunctor: that state transition is interesting
12:33RSpliet: It did look that way, but presumably a write in the 0x0-0x200 area is not reflected in the 0x200-0x400 area?
12:33funfunctor: see line 9, i've identified that 067f300f is a register value and can toggled it from a method call to the vendor SDK
12:34funfunctor: RSpliet: that's a good question I will try it tomorrow when I am next by the hw
12:34Jeansf: karolherbst: i head out now, but generally aside from all the details, the rough easier guideline of the magic bullet is ready, and as soon as it becomes possible it needs a bit work to comprehend all the details for integration with current code
12:35funfunctor: RSpliet: I am just trying to get enough of a picture to slice the problem up a little from a large 64K window
12:35Jeansf: although it touches very few lines, in parallel hw, a minor change could behave absolutely differently, and it is still hard to maintain it
12:38Jeansf: so if this method is needed to be there for good and stay there, everyone who commit should understand the underlying principles, otherwise it would be broken very quickly with dirty commits
12:40funfunctor: RSpliet: maybe its a joke 404 not found 404 not found
12:45Jeansf: karolherbst: and also remember when you have regfile entry alias capability in hw, which is very easily designable on gpu, this thing is awesomely powerful feature, and almost alone with that anthying can be done, i read nouveau hunks of this feature, and of course it has it very long time
12:45Jeansf: i mean nvidia has indirect load and stores and i looked at nouveau files where it was used
12:46Jeansf: little more difficult is to design the scoreboard in hw, but all they have that too
13:14Jeansf: right, so , injection time, cheers.
13:32pmoreau: stikonas: ping
14:53Jeansf: but needs some testing, i am thinking if memory operations in seperate elements, i.e on range , when other elements have pointers to the same first element
14:54Jeansf: will the hw be wise enough still to skip, i.e not running this same command/instruction serially all over again
14:59Jeansf: but definitely if we recurse a long latency memory write on the 1st element then it said, all the readers will be skipping to runahead mode, and rescheduled once the memory operation finnishes
14:59Jeansf: it is said
15:02Jeansf: karolherbst: but i think you understand the hack now, i was accounting with that possibility when i talked to mannerov some time ago, you alias in contiguous address space that of reg pointers, instructions dest operand against the hw internals method to query the current thread or lane
15:04Jeansf: so every time it is written it comes alive and mask is write after the dest in address space, and data share operations will capture it
15:04Jeansf: and unlock the blocked alus
15:23Jeansf: the thing is there is pointer and pointee, in estonia viitaja ja viidatav
15:23Jeansf: you poll the pointer
15:25nyef: Ah! Found out where I re-broke HDMI audio on my MCP89.
15:26Jeansf: nyef: nice, have you managed to resume your eDP from sleep too?
15:27nyef: Jeansf: Yes, but I haven't worked out a fix that I'm comfortable putting into the kernel.
15:27Jeansf: nyef: well, absolutely fanstastic then
15:28nyef: Turns out there's a GPIO that needs to be set during (or after) resume that powers the panel up properly.
15:28Jeansf: i read some of that before from the logs, ok, nice catch/find though
15:31Jeansf: on cpu too, i forget how the pointers work all the time, those are really powerful, i have a video on youtube that i use when i start to forget about them
15:34Jeansf: nyef: i was pinging Lekensteyn , Peter Wu, about it, you had some non-standart mix of hw, he confirmed that all external monitors had resumed with his nvidia hw
15:38Jeansf: you are thinking like which one of the dest pointers gets toggled and how do you know which is the pc program counter to get mask from
15:38Jeansf: well it's about alternating between ordered and unordered views of data share operations, you connect it to so that it keeps an offset
15:39Jeansf: and branches spawn new data share operetions
15:42nyef: It's not a non-standard mix of hardware, it's that there's a certain bandwidth level that you have to hit before it makes sense to switch from LVDS to eDP, which means things like retina, 4K, or 120Hz, so relatively recent stuff.
15:42nyef: (At least, relatively recent in terms of laptops.)
15:43Jeansf: hmm, what you mean to switch, ah of course it's common, that you only use one of the monitors right?
15:44nyef: "Switch" as in a hardware design. Apparently they make laptops with more than one screen, but they're a bit rare.
15:45Jeansf: ok i understood now, there is certain frequency feed crcuit on hw then
15:45Jeansf: or band or something i mean
15:46Jeansf: nyef: you are pretty intelligent, i mostly ruin all the logs, i am ok in theory, but i do not know how to express myself, not native english speaker either
15:48nyef: LVDS has a bandwidth cap. eDP has a higher bandwidth cap. You can do 1920x1080 60Hz on LVDS, and it's presumably cheaper than eDP, so most systems still use LVDS. The hardware that I have contains both an LVDS and an eDP connector, but can only have one connected at a time. The cheaper screen uses the LVDS connector, and the better (more expensive) screen uses the eDP connector.
15:49Jeansf: makes sense i suppose
15:50nyef: So, there's definitely some economic considerations involved in the design, which would mean that eDP would be relatively rare until not so long ago. Retina macbooks might also use eDP, but they might also do something else crazy with power control because they're Apple hardware.
15:52Jeansf: well i spotted from the dcb docs, was it display connector block or something, that it is sommon that even when there are many connectors that only one at time works
15:52Jeansf: i dunno little weird bit catchy
15:53Jeansf: oops device control block
16:01Jeansf: nyef: my friend gifted me an AMD low power micro pc, it's a size of a cigarette pack
16:02Jeansf: this one has two HDMI outputs , i can at some point look, how would they behave
16:04Jeansf: AMD A10 Micro-6700T Quad core 1.2-2.2 GHz + Radeon R6
16:05Jeansf: i will do some work with this one at some point too, but i guess , tomorrow or so i buy wireless keyboard for it
16:05Jeansf: i get back my nvidia kepler gt730 card for my main pc too at some point
16:06karolherbst: good nes
16:06karolherbst: atlassian bought trello, so I guess we will need an atlassian account soon for trello :D
16:08nyef: Hrm. Looks like the audio infoframe configuration is optional (supplied from the HDA device?), but one of the bits on the "generic" infoframe can kill it anyway?
16:10Jeansf: but this fitpc, is very interesting piece of hw, it has 4,5W tdp, and onboard SSD for my version
16:14Jeansf: my friend , old friend has supported me with hw, and others too, during the bad time, i owe them all my know how, allthough they gave things for me for free, i think i could help them back at one point
16:19Jeansf: so though some closest family members and huge amounts of other people have always tried to get rid of me, during the worse times of probably or possibly when starting to fallapart, i have always people who have helped me too
16:23Jeansf: but ok, i remember year 2004-2006 when there had to be a big success and they were very close to knocking me out entirely, and there was out of the blue when i was basically duying , a guy who just helped me to get through of those times
16:24pmoreau: karolherbst: Hum…
16:27Jeansf: battle is still on, but i look like having survivied most the attacks now, those were the roughest times, i received around 3000-5000 dollars help from one fellow
16:29NanoSector: hmm, since kernel 4.10 I think my laptop uses a lot more power in idle
16:29NanoSector: right now it's using 14W compared to ~6/7W on kernel 4.8
16:30NanoSector: Nouveau also keeps suspending and resuming my GPU at seemingly random so I think that might be related?
16:32pmoreau: Nouveau shouldn’t be resuming your GPU, unless told to.
16:33NanoSector: pmoreau: dmesg shows nouveau resuming my GPU and then putting it to sleep a second or two later at random
16:33NanoSector: I'm also getting ACPI errors on boot since kernel 4.10
16:33karolherbst: there are various applications which do silly things
16:33karolherbst: every lspci call will resume your gpu
16:33nyef: It... why?
16:34karolherbst: because it reads the pci config space
16:34nyef: Or, better question, how?
16:34NanoSector: well unless something in Plasma 5 is continuously forgetting my laptop's specs and needs to recheck I don't know what is
16:34NanoSector: I'll upload my dmesg somewhere
16:34NanoSector: well I'll wait a little actually
16:35pmoreau: Have you tried reverting back to an older kernel while keeping the same userspace stack?
16:35NanoSector: pmoreau: I think 4.8/4.9 was fine, I could check both tonight
16:36NanoSector: I'm pretty sure 4.8 was fine since I was here diagnosing an issue where my GPU would never sleep with the new sleep method and it was eventually fixed causing powertop to report 7W again
16:36pmoreau: But, since you tried 4.8, did you update Plasma 5 or some other tools?
16:37NanoSector: I honestly don't know if it still happens on 4.8 so I'dn eed to recheck
16:37pmoreau: If 4.9 was fine, it shouldn’t be too difficult (hopefully) to bisect.
16:37NanoSector: I only remember this 'breaking' since 4.10
16:37pmoreau: Though there was the atomic modesetting serie…
16:37NanoSector: launching QupZilla seems to trigger nouveau
16:38NanoSector: yup that's it
16:38NanoSector: I think QtWebEngine probes available acceleration methods
16:39pmoreau: I would imagine
16:39karolherbst: why would they do that
16:39NanoSector: still my laptop uses 14W even with QupZilla closed
16:40pmoreau: Wait a bit for the card to go back to sleep, maybe?
16:40NanoSector: I closed it more than a minute ago
16:40NanoSector: I'll check if vgaswitcheroo reports it off
16:41NanoSector: wait, vgaswitcheroo isn't in /sys/class
16:41NanoSector: or in lsmod
16:41nyef: Time to check your /proc/config.gz to see if it's enabled at all?
16:41pmoreau: cat /sys/kernel/debug/vgaswitcheroo/switch
16:41pmoreau: (as root)
16:41NanoSector: heh, it's there, thanks pmoreau
16:42NanoSector: 1:DIS: :DynOff:0000:01:00.0
16:42pmoreau: Should be good then…
16:42pmoreau: Still saying 14W?
16:42karolherbst: it could be that something else is drawing more power or the gpu isn't turned off properly
16:42NanoSector: dinner's up though, will play around tonight :)
16:43pmoreau: Either l1k or Lekensteyn was looking at one case where the GPU was reported off, but was still consuming power
16:43pmoreau: I *think* there was a bug report, let me check
16:46pmoreau: The certificate for bugs.freedesktop.org expired today.
16:46pmoreau: Like 24 minutes ago
16:46karolherbst: I assume there is no new one? :D
16:46karolherbst: of course not
16:47karolherbst: like this is the first time this happens
16:47Lekensteyn: letsencrypt time?
16:48Lekensteyn: NanoSector: if your gpu is waking at random times, it is time to patch libdrm
16:48RSpliet: Hahaha, I wonder if NVIDIA intentionally tried to make a reference to the 1988 "The Titan Graphics Supercomputer architecture" ( http://ieeexplore.ieee.org/document/14344/?arnumber=14344 )
16:48pmoreau: "This site uses HTTP Strict Transport Security (HSTS) to specify that Firefox may only connect to it securely. As a result, it is not possible to add an exception for this certificate." …
16:49karolherbst: pmoreau: we have a phrase for that in germany: "einmal mit profis arbeiten" what lieterally means: working with professionals
16:50karolherbst: maybe you could add a "once" to that
16:50Lekensteyn: oh they are already using LE, their cron is probably broken
16:50pmoreau: I didn’t know "profis"
16:50pmoreau: Lekensteyn: LE?
16:51Lekensteyn: Let's Encrypt
16:51Lekensteyn: I reported to #freedesktop now
16:53RSpliet: Who's coming to SHA2017?
16:54pmoreau: Most likely not :-/
16:54RSpliet: I'm unsure right now tbh, need to see where other activities go
16:55karolherbst: RSpliet: I am most likely
16:55RSpliet: but if I do, it'd be nice to organise a hackathon there ;-)
16:55karolherbst: we should create a nouveau tent
16:55karolherbst: any villages we could add outselves to?
16:56RSpliet: I'm likely to stay in the hypothetical "Hack In The Random 2600NL Data Box" village... who last year decided to sit inside the family corner for reasons of peace and quiet
16:57RSpliet: but we can contemplate something nicer... if I go that is
17:24nyef: Got both 3D output and working audio. Progress!
17:28nyef: ... Working volume controls on the audio would be nice, but I suppose that's what pulseaudio is for.
17:31Jeansf: nyef: yeah it ticks on top of some other api though, dunno why it's being used as default but, yeah i have HDMI controls in alsamixer and pulsaudio gui also has them
17:33Jeansf: i use MINT those days, have tried several distros and their package management code, and sometimes still intermix, but debian based stuff has it all the packages those days
17:33Jeansf: on cinnamon there is an applet i suppose, cinnamon is the one i use those days
17:36Jeansf: it has all the gui controls for any of the sound mixer entries, and also, it is very nice system overall, every single Xinput option is also in gui environment
17:36Jeansf: basically all tunables are under options for any of the hw
17:40Jeansf: matthew allum caught my eye with different Xephyr hacks and X hacks, very long time ago, apparently, he is very successful programmer behind the cinnamon and clutter projects
17:44NanoSector: hmm powertop lists my gpu with 0% power usage
17:47NanoSector: no it's definitely something from 4.10 as 4.9 has the 'normal' power usage of ~7W
17:47NanoSector: <Lekensteyn> NanoSector: if your gpu is waking at random times, it is time to patch libdrm
17:47NanoSector: missed that, sorry
17:48NanoSector: it's not random but it's when i launch qupzilla, i just found
17:49NanoSector: pmoreau: so 4.9 and 4.8 work fine, it's just 4.10
17:50Jeansf: sometimes it seems like some do envision a great hacker some sort of small geek, who is use to consuming computers help, but in most parts when there is a question of adding some large amounts of open source code, usually it seems strong men do that, who can resist a lot of pressure, i.e are more the less over the life itself
17:52Jeansf: well look at allum matthew or brian paul, or keith whitwell those are very strong fellows, big guys
17:53Jeansf: mostly such commit to the code
18:05Jeansf: or earlier on there was those swedish guys david reveman, mostly big time coding needs the body to resist let alone to be over all the other issues sametime
18:08Jeansf: i watched some video about the most popular fps games development team, and same thing there too, tall and strong fellows were programming the stuff
18:30Jeansf: i am average though myself, troubles are too big , cause i am just an average guy i am unable to be over this stuff, so i could code big projects
18:35Jeansf: but i am enough of a talent to be explaining how things work sometimes though, taking lots of shortcuts embarrassingly though, that is the fuzz, i get very violative attitude , but most of them have been proven to be idiots anyways, every decent moral man is on my side anyways
18:40imirkin: nyef: hdmi audio doesn't have volume controls. you're supposed to have a volume control on whatever the output source is.
18:40imirkin: nyef: separately, perhaps CEC is achievable on nvidia, in which case you can control your tv's volume
18:40imirkin: [assuming your tv talks CEC as well]
18:40imirkin: i'm not aware of anyone looking into this for nvidia hw.
18:42imirkin: however recently linux has gained support for a CEC framework + hw support on some SoC
18:44Jeansf: in conclusion they are searching beef and bully all the time and bluff that they are someone, humiliating me publicly but non of the experts have never bought that circuis anyways
18:45Jeansf: AFAIK, possibly they can continue doing this here, it's just bunch of incapable persons and so it is...
18:45nyef: imirkin: Okay, that makes a certain amount of sense. And pa should be able to pre-scale the data before it hits the HDMI channel.
18:47imirkin: nyef: alsa can too. pulseaudio has ~0 actual uses
18:47imirkin: unfortunately someone decided to ship it by default in popular distros, so users end up having to deal with it
18:47nyef: alsamixer doesn't seem to have any controls that affect the sound level other than a simple mute.
18:49imirkin: that's right
18:49imirkin: you have to set up a software thing in front of it
18:50imirkin: nyef: http://alsa.opensrc.org/How_to_use_softvol_to_control_the_master_volume
18:50imirkin: look at the softvol thing
18:50nyef: Fair enough. I think that my next step is to try and get some patches ready for submission or at least review.
18:50pmoreau: NanoSector: Well… I guess you now have the great privilege of bisecting the issue! :-D
18:50imirkin: and/or dmix
18:51pmoreau: NanoSector: If you feel like it, of course :-)
18:51NanoSector: pmoreau: but i iz noob :P
18:52imirkin: NanoSector: time to lose the training wheels
18:52NanoSector: noooo D:
18:52Jeansf: i have some HDMI controls, but i never wired up my tv, which has HDMI inputs, and tried to control the volume via main cpu's mixer
18:53Jeansf: it's due to the work i am trying to do, still some stuff to think about
18:54NanoSector: pmoreau: honestly though I'm willing to provide debug data and stuff, but I don't know how to bisect issues (except for endlessly recompiling different revisions of the 4.10 kernel)
18:54imirkin: nyef: nevermind - according to aplattner, desktop gpu's don't have the hw to support CEC: https://devtalk.nvidia.com/default/topic/642712/hdmi-cec-support-/
18:54imirkin: NanoSector: search for 'git bisect' on the google machine
18:55NanoSector: oh that's in git, derp
18:55NanoSector: should've known
18:55pmoreau: NanoSector: Bisecting would require to recompile different versions of the kernel, except if you do the bisection on the out-of-tree Nouveau repo, in which case fewer compilation of the kernel should be needed
18:56NanoSector: which repo do i take? skeggsb's?
18:56Jeansf: it is not much to think about, but 1year and counting i have delt with the same subject, only to merge the code which is still something i should do, to escape the troll status
18:57NanoSector: aight will see if i can get anywhere
18:57pmoreau: Thanks! It should make it easier to find out what broke it.
18:57RSpliet: NanoSector: the only special thing about bisect is that it cuts the search space in half for every iteration (by taking the middle git revision and ditching the irrelevant half on your feedback). So instead of compiling n kernels, you'll compile 2log(n) kernels. Adds up if you don't know where to search for it :-)
18:59RSpliet: (eg. if you think it's one of a thousand commits on nouveau, you'll have 10 compiles at most. Make sure to point git bisect to the nouveau dir to have it make better decisions on how to reduce the search space, you don't want it to pick irrelevant points :-) )
19:04Jeansf: i don't pretend that anyone would talk to me, maybe nyef here knows if nouveaus driver has some AA infrastructure that can be enabled
19:04haagch: gnurou: hi there. do you have a tegra k1 for testing and does kde plasma run on it without locking up?
19:04Jeansf: msaa for instance
19:04NanoSector: hmm, so if i don't know where to start, I can simply mark the last commit as bad and the first as good?
19:05NanoSector: Jeansf: i don't understand what you're trying to say so i can't respond, sorry :P
19:05RSpliet: NanoSector: make sure not to include commits for 4.9, but yes
19:05NanoSector: i'm on branch linux-4.10
19:05RSpliet: yeah, it has history that goes *way* back :-)
19:05NanoSector: so not a good idea? :P
19:05Jeansf: Nanosector: well opengl has thing called multisampled antialias, but it needs hw support
19:05NanoSector: Jeansf: no before that I mean
19:05Jeansf: because every stage in the pipeline is affected by this in hw
19:06Jeansf: NanoSector: aah ok
19:06RSpliet: NanoSector: https://git.kernel.org/cgit/linux/kernel/git/stable/linux-stable.git/log/drivers/gpu/drm/nouveau?id=refs/tags/v4.9 that's the last commit for kernel 4.9.0. If 4.9 was good, then that top one is the one you want to mark as good :-)
19:06Jeansf: as you know there is a thing called post processing aa which would not require much of hw support though
19:07NanoSector: RSpliet: ahh, thanks
19:07RSpliet: (and again: definitely limit search to drivers/gpu/drm/nouveau only - otherwise you'd be searching through way more commits than necessary :-) )
19:08imirkin: NanoSector: is there anything to suggest it's a nouveau issue?
19:08NanoSector: oh I'm looking at https://github.com/skeggsb/nouveau
19:08NanoSector: imirkin: when my GPU is turned on under 4.9 I'm observing the same power usage
19:09imirkin: ok, so you're sure it's nothin in, e.g., the pci subsystem
19:09imirkin: or the acpi subsystem
19:09imirkin: or who knows where
19:09NanoSector: i have no clue
19:09imirkin: which is my point - unless you're damn sure it's inside nouveau, don't restrict it to just nouveau
19:09NanoSector: 4.9 and up started spitting ACPI errors related to the GPU but 4.9 doesn't seem like it's affected
19:10NanoSector: actually 1 is related to Intel GFX and one to NVIDIA
19:11NanoSector: 1.209840] ACPI Error: Method parse/execution failed [\_SB.PCI0.RP05.PEGP.DD02._BCL] (Node ffff8802568e3690), AE_NOT_FOUND (20160831/psparse-543)
19:11nyef: ... Looks like a backlight control problem?
19:11NanoSector: but that works fine :x
19:11NanoSector: [ 1.209824] ACPI Error: [\_SB_.PCI0.GFX0.DD02._BCL] Namespace lookup failure, AE_NOT_FOUND (20160831/psargs-359)
19:12NanoSector: other one which is for Intel I think
19:13nyef: Brightness Control Levels or something like that.
19:13NanoSector: but anyway that's pretty much the reason I'm suspecting Nouveau, even more since it had more issues with powering down my GPU in the past
19:13nyef: Part of the ACPI backlight driver.
19:14nyef:was digging through some of this stuff for his own system two weeks ago.
19:14NanoSector:is digging in the dark
19:16Jeansf: actually there has been not too many interest in this subject by the community, aside from some cool #asm guys, but i understand it is diffuclt to comprehend my talks, but i try again, pointers is sometimes difficult, they are used with intuation by me, but to explain is bit more difficult
19:16Jeansf: i might had messed even up, i try another round
19:18Jeansf: but actually there was intel thread, where one guy explained very very well how pointers on gpu work, and also AMD code comments, but this thread was not meant to be sharing, cause it contained confidential information
19:19Jeansf: the idea is basically that you can have large virtual address space numbers, which are aliased to regfile
19:21Jeansf: unlike on cpu , where pointers go through the memory, on gpu they do not
19:26Jeansf: so for instance if you have an instruction that writes to dest register, and you make a point that dest register against pointee which is a register that stores
19:27Jeansf: internally a waveID or Thread id register, which hw responds to, it will be signalled so, that
19:27NanoSector: Jeansf: i'm not even sure who you are talking to at this point, except to a void maybe..
19:27Jeansf: the dest follows the current thread lane
19:28Jeansf: so when you poll the pointee, with data share operations
19:29Jeansf: it will then determine the pc in code where it happened, gets the masks and unblocks the idle lanes you know
19:30Jeansf: because where as the registers in code are randomly presenting themselves, the pointers to them are in contiguous address space
19:30Jeansf: 48bits is allowed to use, which covers all the possible biggest shaders address space
19:36Jeansf: how the hw behaves is bit more complex, as it starts off by aliasing that address to element nr one in data share, it would not always poll that number and waste warps, when suddenly it has changed it's value to real address that has thread id in it
19:36Jeansf: then hw rechsedules to quickly execute that offset
19:40Jeansf: when it reschedules it for execution the poller having done no real work, fetches the address with simple wiring the pc where it happens , the mask from vgpr from the virtual address space, and unblocks the unneeded idling warp lanes
19:41Jeansf: then a simple connection repartitions this to a new instruction according to inverted masks
19:42Jeansf: and such code will perform 2times faster then nvidias default scheduling policy which is pretty good, but ontop of radeons fabulous driver with almost no scheduling it will do like 5x better
19:44Jeansf: i see from comments that for instance tom really understands how to do it ontop of AMD open source drivers, but it is still work in progress at this stage, but the drivers cpu code is so good that it will do miracles probably
19:45Jeansf: but real work is solving the bugs and hangs though
19:45Jeansf: where i am not that competent
19:45Jeansf: and generally do not have time for this
19:53imirkin: Jeansf: your specialty appears to lie in annoying the bejesus out of people who *are* competent and actually *do* the work. please stop.
19:56Jeansf: well i do stop, but i might had done some mistakes on describing how that works, but it is conceptually clear to me in code how that works, and i tried my best, and definitely it was correct trip towards pefromance and still is
20:03Jeansf: all the concept of vliw vs. SIMD/SIMT i.e the later gcn design, is that one the later one scales in performance with when giving it more waves though, when you have minor load it will work, but if you need bigger load, you run out of waves, because threads in the same warp are idling , scratching nuts aimlessly
20:06airlied: imirkin: fyi msg chanserv quiet #nouveau Jeansf or stuff like that
20:06imirkin: airlied: he's got other nicks
20:06airlied: wow he went as far as using tor to get in
20:06airlied: yeah quieting lets him think he is talking to channel for a while before he notices
20:06NanoSector: imirkin: you just banned everyone with tor and sasl, I guess
20:06imirkin: NanoSector: indeed i did.
20:07NanoSector: intended? :P
20:07imirkin: indeed it was.
20:07airlied: seems like a fine plan, I doubt we have many ppl who use it
20:07NanoSector: imirkin: indeed you did fine