alyssa changed the topic of #panfrost to: Panfrost - FLOSS Mali Midgard & Bifrost - Logs https://freenode.irclog.whitequark.org/panfrost - <daniels> avoiding X is a huge feature
alyssa has quit [Remote host closed the connection]
raster has quit [Quit: Gettin' stinky!]
stikonas has quit [Remote host closed the connection]
alyssa has joined #panfrost
kaspter has joined #panfrost
camus1 has joined #panfrost
kaspter has quit [Read error: Connection reset by peer]
camus1 is now known as kaspter
vstehle has quit [Ping timeout: 258 seconds]
archetech has quit [Quit: Leaving]
kaspter has quit [Ping timeout: 260 seconds]
kaspter has joined #panfrost
icecream95 has joined #panfrost
ezequielg has quit [Read error: Connection reset by peer]
ezequielg has joined #panfrost
icecream95 has quit [Quit: leaving]
davidlt has joined #panfrost
kaspter has quit [Ping timeout: 265 seconds]
kaspter has joined #panfrost
Green has quit [Quit: ...]
Green has joined #panfrost
kaspter has quit [Ping timeout: 256 seconds]
kaspter has joined #panfrost
kaspter has quit [Ping timeout: 265 seconds]
kaspter has joined #panfrost
davidlt has quit [Ping timeout: 272 seconds]
kaspter has quit [Ping timeout: 258 seconds]
kaspter has joined #panfrost
vstehle has joined #panfrost
archetech has joined #panfrost
_whitelogger has joined #panfrost
davidlt has joined #panfrost
<bbrezillon> warpme_: would you mind adding your Tested-by on danvet's patch?
raster has joined #panfrost
<warpme_> bbrezillon: oh yes. please :-)
stepri01 has joined #panfrost
<bbrezillon> warpme_: nevermind, it's already there
stikonas has joined #panfrost
archetech has quit [Quit: Textual IRC Client: www.textualapp.com]
yann|work is now known as yann
stepri01 has quit [Quit: leaving]
gcl_ has joined #panfrost
gcl has quit [Ping timeout: 272 seconds]
gcl has joined #panfrost
gcl_ has quit [Ping timeout: 260 seconds]
stepri01 has joined #panfrost
camus1 has joined #panfrost
kaspter has quit [Ping timeout: 258 seconds]
camus1 is now known as kaspter
<bbrezillon> stepri01: so, on a second thought, I'm not sure having the drm_sched_start() call protected by the reset_lock makes things easier. We still have a race if a timeout handler is called before the reset_lock is release, and in that case we never reset the GPU.
stikonas has quit [Remote host closed the connection]
stikonas has joined #panfrost
alpernebbi has joined #panfrost
stikonas_ has joined #panfrost
stikonas has quit [Read error: Connection reset by peer]
archetech has joined #panfrost
stikonas_ is now known as stikonas
gcl_ has joined #panfrost
gcl has quit [Ping timeout: 240 seconds]
<stepri01> bbrezillon: yeah I think we need a better way of coordinating the two schedulers. Rather than assuming that if the mutex is held then nothing needs to be done, the code should somehow check if the reset is going to happen and either bail out (if the reset is going to happen), or block waiting for the mutex to trigger another reset if necessary
<stepri01> kbase has a kbase_prepare_to_reset_gpu() function (and friends) which keeps track of whether a reset is in progress or not and ensures the dance happens correctly. But equally kbase doesn't have to deal with two schedulers driving the same GPU which is Panfrost's problem
<bbrezillon> yep
<bbrezillon> and drm_sched_job_timedout() seems to expect the timeout handling to happen synchronously (it restarts the timer after calling ->job_timedout()) , which is another problem
<bbrezillon> otherwise we could schedule another work to do the reset
<stepri01> yeah - kbase mostly makes reset aynchronous - it's natural for the GPU design, but sadly doesn't fit well with the DRM architecture
<stepri01> the other option is to simply stop using the reset hammer and actually handle job failure properly ;)
<bbrezillon> so maybe the solution is to allow asynchronous timeout handling
<stepri01> but there might be some bugs hiding which still require resets to recover from
<bbrezillon> yep, that's what I was about to ask
<bbrezillon> I guess we sometimes need a reset
<stepri01> kbase effectively has a watchdog for if the GPU stops behaving
<bbrezillon> so the problem remains
<bbrezillon> this being said, I'd like to find a fix that does not involve invasive changes :)
<tomeu> can't we just mainline kbase? :p
<alyssa> please no
<alyssa> :p
<Lyude> no
<Lyude> my response means nothing but i don't like kbase :P
<alyssa> tomeu: A channel op said it, how can you disagree ? :p
<kinkinkijkin> im not entirely sure what kbase is in this context and that frightens me
<alyssa> kinkinkijkin: Arm's kernel driver for mali midgard+
<kinkinkijkin> oh THAT
<macc24> nope nope nope nope
<alyssa> Notoriously legacy code, that's why we're here ;p
<kinkinkijkin> might as well merge the libmali blob into mesa while you're at it
<kinkinkijkin> blobs*
<HdkR> #include <libmali.so>
<daniels> i mean that was basically lima
<macc24> daniels: huh?
<daniels> the original lima driver used kbase, did extremely primitive command-stream construction in standalone demo programs, and linked to Mali's offline shader compiler DSO to do all the compilation
<macc24> that's... cursed...
felipealmeida has quit [Ping timeout: 256 seconds]
felipealmeida has joined #panfrost
<narmstrong> Luc Verhaegen is a weird guy, he spent an infinite amount of time r-e, but when asked to make something that could useful for long-term, he says na I prefer hacking my stuff on my side and do weird q3 demo while adapting the GL api to meet my hacks
<narmstrong> we lost 5years until Qiang take over, we could have lima upstream in linux & mesa in 2012
<narmstrong> it's insane
<HdkR> There's no need to bang on history. Everyone has different motivations that may not necessarily align with what people want.
<narmstrong> yeah no offense, he really changed thing by r-e lima
* alyssa <-- certified weirdo
<narmstrong> alyssa: all people on this channel could be certified weirdos :-)
<narmstrong> I mean idling on IRC on a GPU r-e dev channel
<alyssa> well.. :p
<kinkinkijkin> don't look at me i failed my weirdo certification failing to meet my exam date
<alyssa> 16 files changed, 777 insertions(+), 618 deletions(-)
<HdkR> Lucky 777
<alyssa> Let's see what CI says..
<kinkinkijkin> two deletions too far to meet 777 616
zkrx has quit [Quit: I'm done]
sphalerite has quit [Quit: nixos 20.09, here I come!]
raster has quit [Quit: Gettin' stinky!]
davidlt has quit [Ping timeout: 240 seconds]
zkrx has joined #panfrost
zkrx has quit [Quit: I'm done]
zkrx has joined #panfrost
raster has joined #panfrost
sphalerite has joined #panfrost
archetech has quit [Quit: Leaving]
Elpaulo has quit [Read error: Connection reset by peer]
Elpaulo has joined #panfrost
mmind00 has quit [*.net *.split]
mmind00 has joined #panfrost
kaspter has quit [Ping timeout: 260 seconds]
kaspter has joined #panfrost
remexre has quit [Read error: Connection reset by peer]
remexre has joined #panfrost
raster has quit [Quit: Gettin' stinky!]
raster has joined #panfrost
zkrx has quit [Quit: I'm done]
zkrx has joined #panfrost
alpernebbi has quit [Quit: alpernebbi]
karolherbst has quit [Quit: duh 🐧]