#panfrost on 2019-12-18 — irc logs at freenode.irclog.whitequark.org

2019-09-06 11:20 alyssa changed the topic of #panfrost to: Panfrost - FLOSS Mali Midgard & Bifrost - Logs https://freenode.irclog.whitequark.org/panfrost - <daniels> avoiding X is a huge feature

00:01 <alyssa> First try will be to act on instructions instead of cycles, which is not really correct but should ballpark it and save some complexity

00:01 <alyssa> It's just a heuristic, so let's give it a try?

00:04 icecream95 has quit [Quit: leaving]

00:07 icecream95 has joined #panfrost

00:11 * alyssa is scared this heuristic is going to totally faceplant

00:17 stikonas has joined #panfrost

00:26 _whitelogger has joined #panfrost

00:39 rtp_ has quit [*.net *.split]

00:39 mmind00 has quit [*.net *.split]

00:39 rtp has joined #panfrost

00:41 mmind00 has joined #panfrost

00:47 * alyssa debugging heuristic

00:48 <alyssa> I have LDUBO(r) computed exactly, and I'm working on a spill heuristic

00:57 megi has quit [Ping timeout: 245 seconds]

00:59 <alyssa> https://people.collabora.com/~alyssa/take1

01:00 <alyssa> ^ Not quite there yet, but I think getting there!

01:07 <chrisf> is looking pretty good

01:11 nerdboy has quit [Ping timeout: 248 seconds]

01:53 stikonas has quit [Remote host closed the connection]

01:58 lmcloughlin has quit [Quit: Connection closed for inactivity]

02:02 <alyssa> Over dinner I had the realization that the #1 problem by far is just predicting if there could possibly be any spilling -- we're not terribly interested in gradiations of spilling

02:02 <alyssa> So that means we don't need to guess the cost of spilling, it's more of a boolean thing

02:03 <alyssa> (Or even better - a probabiliy that a program will spill)

02:03 <alyssa> Unfortunately, while I barely know enough calculus for the boolean heuristic, I definitely don't know enough statistics for a probabilistic approach. Alas.

02:05 robmur01_ has quit [Ping timeout: 260 seconds]

02:05 <alyssa> That's a very good thing, because modeling spilling correctly is expensive, and approximating involves a lot of guesswork

02:06 Stenzek has quit [Ping timeout: 265 seconds]

02:10 Stenzek has joined #panfrost

02:13 <alyssa> So the next question is how do we *accurately* predict whether we will spill <===> the maximum register pressure

02:14 <alyssa> In theorry liveness analysis ought to do that

02:14 <alyssa> In practice liveness analysis doesn't quite capture everything, because of vector registers (SIMD :V), pipeline registers (sometimes live things need not spill), non-work registers, and spilled non-work registers

02:15 <alyssa> A simple live channel counting algorithm will underestimate, overestimate, overestimate, N/A respectively

02:16 <alyssa> But an overestimate should be okay, since that will bias towards reducing spilling instead of reducing UBO traffic

02:17 nerdboy has joined #panfrost

02:18 <alyssa> --Indeed. If instead of counting channels you count entire vec4s (even for just a scalar), and then pay *only* attention to spilling with no regard for the UBO stuff (since that will naturally sort it outself out at the moment), again ignoring threading effects for now...

02:19 <alyssa> (99.9% reduction in spilling, which is what we're after)

02:38 * alyssa is now experimenting with a threading heuristic as well

02:39 <alyssa> Doing one well however ... may not be strictly simple ....

02:40 <alyssa> :f

02:46 nerdboy has quit [Ping timeout: 246 seconds]

02:48 <icecream95> Interesting... I tried darkplaces and it has the same flickering problems that quakespasm has. Xonotic (which uses darkplaces) doesn't, though.

02:48 <alyssa> icecream95: What is the flickering problem exactly/

02:48 <icecream95> There is an issue about it on Mesa gitlab

02:49 <alyssa> IIRC I couldn't see anything obviously wrong on the apitrace but maybe I'm mixing it up with a different issue

02:51 <icecream95> The apitrace trace for the other issue about quakespasm I made (textures missing) didn't show the flickering problem.

02:52 <alyssa> Ah

02:54 <icecream95> https://gitlab.freedesktop.org/mesa/mesa/uploads/b8aa4172034b9002971c5acef922321b/quakespasm.trace.xz

02:58 <alyssa> icecream95: Hmm, it's called quake*spaskm* =P

02:59 * alyssa tries a drastically simpler heuristic

03:07 nerdboy has joined #panfrost

03:59 nerdboy has quit [Ping timeout: 265 seconds]

03:59 vstehle has quit [Ping timeout: 265 seconds]

04:34 icecream95 has quit [Ping timeout: 268 seconds]

04:38 icecream95 has joined #panfrost

04:43 davidlt has joined #panfrost

04:47 <icecream95> alyssa: Here is an apitrace showing the currently broken shadows in darkplaces: https://gitlab.freedesktop.org/snippets/754

04:52 icecream95 has quit [Ping timeout: 252 seconds]

05:19 icecream95 has joined #panfrost

05:33 jolan has quit [Quit: leaving]

06:00 jolan has joined #panfrost

06:00 vstehle has joined #panfrost

06:09 icecream95 has quit [Quit: leaving]

06:18 icecream95 has joined #panfrost

07:13 megi has joined #panfrost

07:13 guillaume_g has joined #panfrost

07:42 nerdboy has joined #panfrost

08:27 yann has quit [Ping timeout: 246 seconds]

09:05 icecream95 has quit [Ping timeout: 258 seconds]

09:20 yann has joined #panfrost

10:55 raster has joined #panfrost

10:56 <bbrezillon> alyssa: would you be okay with such a change => http://code.bulix.org/rq6ss3-1021200 ?

10:57 <bbrezillon> I just resumed working on the vk implem, and knowing descriptor sizes is important (things are allocated from pools, and we need to reserve memory ahead of time)

10:58 <bbrezillon> this patch is not mandatory of course, but it makes job desc size calculation much easier

11:00 stikonas has joined #panfrost

11:01 abordado has joined #panfrost

11:01 abordado has quit [Remote host closed the connection]

11:02 abordado has joined #panfrost

11:15 abordado has quit [Quit: Leaving]

11:15 abordado has joined #panfrost

11:56 abordado has quit [Remote host closed the connection]

11:57 abordado has joined #panfrost

12:12 davidlt has quit [Ping timeout: 268 seconds]

12:29 abordado has quit [Ping timeout: 245 seconds]

12:35 abordado has joined #panfrost

12:39 abordado has quit [Ping timeout: 248 seconds]

12:43 abordado has joined #panfrost

12:48 abordado has quit [Ping timeout: 248 seconds]

13:22 flacks has quit [Ping timeout: 250 seconds]

13:23 TheCycoONE1 has quit [Ping timeout: 246 seconds]

13:23 EmilKarlson has quit [Ping timeout: 245 seconds]

13:23 thefloweringash has quit [Ping timeout: 246 seconds]

13:37 tgall_foo has quit [Ping timeout: 265 seconds]

13:48 youcai has joined #panfrost

13:50 youcai has left #panfrost [#panfrost]

14:04 davidlt has joined #panfrost

14:15 CrystalGamma has joined #panfrost

14:40 tgall_foo has joined #panfrost

14:50 <alyssa> bbrezillon: NAK. We don't use the next_job_32 fields on any platform to avoid duplicating code paths, and the blob doesn't use next_job_32 on 64-bit platforms (so as long as we have an aarch64 board+blob for a given hw, we can get dumps)

14:50 megi has quit [Ping timeout: 268 seconds]

14:50 <alyssa> So just drop next_job_32 entirely and have a single 64-bit next_job field and avoid all the indirection :)

14:52 TheCycoONE1 has joined #panfrost

14:58 <tomeu> alyssa: deqp-gles3 run :) https://lava.collabora.co.uk/scheduler/job/2127182

14:58 <tomeu> maybe we should run 1 in 10 tests, to keep the run time low

15:07 <bbrezillon> alyssa: ok

15:07 nerdboy has quit [Ping timeout: 250 seconds]

15:08 <bbrezillon> alyssa: I guess I can keep the offset calculation in decode.c without keeping the next_job_32

15:12 <alyssa> tomeu: How do other drivers cope?

15:12 <tomeu> alyssa: by sharding

15:12 <tomeu> we can do that as well once we get all the kevins online

15:12 <alyssa> Ah :|

15:13 <tomeu> but for now I think we could run 1 in 10 tests easily

15:13 <alyssa> Yeah, let's do that if that's doable at all

15:13 <tomeu> do you think that would be helpful atm?

15:13 <tomeu> cool

15:13 <alyssa> At least to get a sense of where we're at?

15:13 <tomeu> there's some crashes we should fix

15:13 <alyssa> Also I think some of the slowness is from failing

15:14 <tomeu> we don't seem to be that far

15:14 <tomeu> I expect above 90%

15:14 <tomeu> by running 1 in 10, I expect to have decent coverage at a low cost

15:14 <tomeu> as tests are grouped by functionality

15:15 abordado has joined #panfrost

15:18 <tomeu> deqp-gles3 is 44k tests, but I think deqp-gles31 is fairly small

15:19 <tomeu> maybe we can also run 1 in 2 of gles31 or so, not sure how useful that would be

15:20 <alyssa> tomeu: GLES31 would not be useful right now, no

15:20 <alyssa> GLES3 we're a lot closer to (I don't think there are any big things missing for GLES3, just a huge number of small things -- so basically where we were at for GLES2 in early 2019)

15:31 flacks has joined #panfrost

15:31 EmilKarlson has joined #panfrost

15:31 thefloweringash has joined #panfrost

15:44 stikonas has quit [Remote host closed the connection]

15:46 EmilKarlson has quit [Write error: Connection reset by peer]

15:46 TheCycoONE1 has quit [Remote host closed the connection]

15:46 flacks has quit [Write error: Connection reset by peer]

15:46 thefloweringash has quit [Remote host closed the connection]

15:46 stikonas has joined #panfrost

15:57 TheCycoONE1 has joined #panfrost

16:01 megi has joined #panfrost

16:03 abordado has quit [Remote host closed the connection]

16:04 abordado has joined #panfrost

16:05 <tomeu> alyssa: whole 1/10 run: https://gitlab.freedesktop.org/tomeu/mesa/-/jobs/1190572

16:05 <tomeu> there's a few crashes we should deal with

16:06 <tomeu> and towards the end stuff went bad

16:19 <alyssa> tomeu: Hmm. I remember this awkward growing phase with dEQP-GLES2 bringup as well

16:20 * alyssa wishes she remembered how we got to our first clean (no crashes, minimal faults) run..

16:29 yann has quit [Ping timeout: 245 seconds]

16:32 <daniels> alyssa: months which are now even more of a blur than they were at the time?

16:33 <daniels> as for big things missing, well, we still need xfb

16:41 <alyssa> daniels: We have XFB! ..in a branch

16:41 <alyssa> lfrb: :1

16:43 guillaume_g has quit [Quit: Konversation terminated!]

16:46 davidlt has quit [Ping timeout: 248 seconds]

16:47 abordado has quit [Ping timeout: 252 seconds]

16:51 <daniels> alyssa: oh, I'm very aware :)

16:53 flacks has joined #panfrost

16:53 EmilKarlson has joined #panfrost

16:53 thefloweringash has joined #panfrost

16:54 cowsay_ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

16:54 cowsay has joined #panfrost

17:07 gcl_ has joined #panfrost

17:56 gcl_ has quit [Quit: Lost terminal]

18:42 davidlt has joined #panfrost

19:03 raster has quit [Quit: Gettin' stinky!]

19:35 icecream95 has joined #panfrost

19:42 yann has joined #panfrost

19:45 raster has joined #panfrost

20:23 davidlt has quit [Ping timeout: 258 seconds]

20:47 TheCycoONE1 has quit [Quit: User has been idle for 30+ days.]

20:49 stikonas has quit [Remote host closed the connection]

20:51 stikonas has joined #panfrost

21:02 adjtm_ has joined #panfrost

21:04 adjtm has quit [Ping timeout: 265 seconds]

21:07 hanetzer has quit [Quit: ZNC 1.7.1 - https://znc.in]

21:08 hanetzer has joined #panfrost

21:31 stikonas has quit [Remote host closed the connection]

21:35 phh has quit [*.net *.split]

21:35 phh has joined #panfrost

21:41 CrystalGamma has quit [Quit: Leaving]

21:59 raster has quit [Quit: Gettin' stinky!]

22:01 stikonas has joined #panfrost

22:16 robertfoss has quit [Ping timeout: 245 seconds]

22:23 robertfoss has joined #panfrost

23:18 TheCycoONE1 has joined #panfrost

23:45 nerdboy has joined #panfrost