<bshah>
I am afraid I did not manage to bisect it off (well bisect is strong word because I dont have any
<bshah>
erm
<bshah>
I don't have any good commit to begin with or which repo even
<bshah>
as I understand previously frequency of this was quite low, but now its almost instant gpu crash on rotation :(
<anarsoul>
so did it start with your compositor update?
<anarsoul>
or with mesa update?
<anarsoul>
or something else?
<bshah>
now that I think about it, I realize that on pmOS where I can not reproduce this it have mesa 20.2 and other systems where I can reproduce this is 20.3 or master
kaspter has quit [Ping timeout: 256 seconds]
camus has joined #lima
camus is now known as kaspter
<bshah>
hm or not
<bshah>
both had mesa 20.2.3 :(
<bshah>
20.3 was released just yesterday
Barada has joined #lima
Net147 has quit [Quit: Quit]
Net147 has joined #lima
kaspter has quit [Ping timeout: 265 seconds]
kaspter has joined #lima
Barada has quit [Quit: Barada]
Viciouss has quit [Ping timeout: 246 seconds]
<enunes>
bshah: can you upload all shaders you captured somewhere, just to take a look?
<enunes>
bshah: hmm nothing too weird, I dont remember seeing one with that many texture references
<enunes>
maybe tyring to simplify the most complex ones and seeing if it makes a difference could be something
<enunes>
but kind of a blind shot
<bshah>
enunes: if by some way I can figure out what shader exactly is causing this then I can do something about it, currently I have almost 0 idea where to look :/
<enunes>
as in, finding wherever they are defined in Qt or something, patching the shader (removing mostly loops, conditionals, long calculation sequence and things hard to optimize) and rebuilding that component, and running to see if it makes a difference
<enunes>
I'd just try to grep some part of that shader in the sources for the involved components
<enunes>
again, I dont know if this will solve anything, but it would be interesting data to at least eliminate that
<bshah>
there is no way to add some dbeug in mesa or something to see what shader it is processing?
<bshah>
mostly because this 20-ish shaders are kind spread across 5-6 different repos
<rellla>
you can try first pp, then gp and see what lima compiler does with the shaders and if there is any dubious things ...
<rellla>
thanks for the link
<enunes>
that probably requires some knowledge of how pp and gp work, I was thinking more to find a way to help narrowing the issue down without having to know that
<bshah>
huh of-course now with LIMA_DEBUG exported I somehow cant reproduce this
<bshah>
:'(
<bshah>
had been running for like 4 mins but thats' already more then what I was able to in past
<enunes>
rellla: I'm a bit out of the loop in gpir, but seems like it tried to schedule a node as it is, and if it's not successful it needs to insert a move for that node and try again later
<enunes>
in the loop at 1563, so the logic makes sense to me
e is now known as demiurge
<rellla>
true - i try to figure out, why we have an endless loop here ...
<enunes>
endless loop while compiling or endless loop in the generated code?
Viciouss has joined #lima
<rellla>
i tried the remaining deqp test, which hit the 512 limit and if i drop sched_move i end up at a ready list, which can't be scheduled any more and generates no-op instructions until we reach 512
<rellla>
but i guess i have the re-add the sched_move again and see what it does...
<enunes>
yeah it is probably important, it is by design since the gp doesnt have registers so those moves are needed to keep the values alive until their nodes can be scheduled
<rellla>
i guess, the bug results in having (mul0|mul1|add0|add1|pass|cmpl) blocked with sched_moves and so we are not able to schedule the instruction anymore
<bshah>
sorry for noob question ... so we had been trying to narrow down issue, one thing I noticed is this crash happens only when device is locked, after we dismantled our lock greeter to bare bone it still caused crash
<bshah>
now our current theory is there is some issue wrt window in background stopping their rendering when screen is locked
<bshah>
(well not window, but compositor deciding to not update any windows in background)
camus has joined #lima
kaspter has quit [Ping timeout: 240 seconds]
camus is now known as kaspter
<enunes>
bshah: when you hit the issue, what does it say in status= ?