#lima on 2020-02-10 — irc logs at freenode.irclog.whitequark.org

2019-07-03 10:24 ChanServ changed the topic of #lima to: Development channel for open source lima driver for ARM Mali4** GPUs - Kernel has landed in mainline, userspace driver is part of mesa - Logs at https://people.freedesktop.org/~cbrill/dri-log/index.php?channel=lima and https://freenode.irclog.whitequark.org/lima - Contact ARM for binary driver support!

00:00 dllud has quit [Quit: ZNC 1.7.4 - https://znc.in]

00:00 dllud_ is now known as dllud

01:21 yuq825 has joined #lima

02:22 tlwoerner has joined #lima

02:55 dddddd has quit [Ping timeout: 265 seconds]

03:07 <anarsoul> yuq825: nice work with multiple pending submits (or rather jobs :))

03:08 <yuq825> thanks

03:09 <yuq825> extra free week due to the SARI

03:10 <yuq825> lead to the work:)

03:11 <anarsoul> we should probably rename lima_flush_submit and lima_submit_flush since names are almost identical

03:14 <anarsoul> yuq825: I have suspicion that lima_flush_submit() doesn't work correctly for map_transfer() if transfer is read

03:15 <yuq825> rename lima_flush_submit to lima_flush_submit_for_bo?

03:16 <anarsoul> so in lima_transfer_map() we call lima_flush_submit() with write = true if transfer writes to BO and with write = false if transfer reads from BO

03:16 <anarsoul> but we need to flush any pending jobs that may write to BO if transfer reads from BO

03:17 <anarsoul> and probably do the same if transfer writes to BO just to be safe

03:18 <anarsoul> yuq825: something like 'lima_flush_jobs_writing_bo'?

03:18 <anarsoul> I have strong itch to rename submit to job :)

03:20 <yuq825> parameter write means the operation to the bo, not to the submit, when write=false, it will find submit write to the bo and flush, you may look at lima_submit_has_bo

03:22 <anarsoul> hm

03:22 <anarsoul> yeah, right

03:23 <yuq825> that's fine, I can change the name to resolve ambition, but better with a last commit at once

03:23 <anarsoul> I wonder what's going on with FBO deqp tests in CI then...

03:23 <anarsoul> yuq825: yeah, either last or first commit should be fine

03:25 megi has quit [Ping timeout: 240 seconds]

03:27 <anarsoul> yuq825: do you have deqp locally to check what's wrong with regressed tests or you need some help with it?

03:33 <anarsoul> yuq825: don't we have to use "lima_flush_submit(ctx, bo, usage & PIPE_TRANSFER_READ);" in transfer_map() instead of '& PIPE_TRANSFER_WRITE'?

03:34 <anarsoul> we want to flush jobs that write to resource if usage is PIPE_TRANSFER_READ, and flush jobs that read the resource if usage is PIPE_TRASFER_WRITE

03:34 <yuq825> I don't have deqp setup on my side, need some time to check it, help is appreciated

03:38 <MoeIcenowy> strange

03:38 <MoeIcenowy> this multi submit patchset contains a UnexpectedPass

03:39 <yuq825> anarsoul: so you mean change the parameter write's meaning to refer to the submit instead of bo?

03:39 <anarsoul> yuq825: no, I mean that we're currently passing incorrect parameter

03:40 <yuq825> MoeIcenowy: is it clear related?

03:40 <MoeIcenowy> yuq825: seems

03:40 <MoeIcenowy> dEQP-GLES2.functional.fbo.render.shared_colorbuffer.rbo_rgba4_stencil_index8

03:40 <MoeIcenowy> at least I think it's reload related

03:42 <anarsoul> MoeIcenowy: likely it skips unnecessary flush that we have with single job

03:42 <anarsoul> and thus doesn't need depth/stencil reload

03:42 <yuq825> MoeIcenowy: I changed the clear to optional flush submit, so two calls of glClear(color) and glClear(depth) will result single submit instead of two before

03:56 <yuq825> anarsoul: "lima_flush_submit(ctx, bo, usage & PIPE_TRANSFER_READ);" will flush both GPU read/write jobs when CPU read with PIPE_TRANSFER_READ and flush only write GPU jobs when CPU write with PIPE_TRANSFER_READ

03:56 <yuq825> anarsoul: PIPE_TRANSFER_READ means CPU side

03:59 <anarsoul> right

04:06 buzzmarshall has quit [Remote host closed the connection]

04:13 <anarsoul> "dEQP-GLES2.functional.buffer.write.recreate_store.random_2" fails even with LIMA_DEBUG=singlesubmit

04:36 <anarsoul> "lima: use lima_submit_create_stream_bo for lima_ctx_buff" is first bad commit

04:50 <anarsoul> I've been eyeballing it for 15 mins but I don't see any obvious bugs in this commit :)

04:53 <anarsoul> commenting out "pipe_resource_reference(&pres, NULL);" in lima_submit_create_stream_bo() fixes it

04:53 <anarsoul> so apparently we're freeing resource to early or there's some other issue with BO ref-counting

05:05 <anarsoul> yeah, with LIMA_DEBUG=nobocache I get gpmmu fault on read

05:05 <anarsoul> darn

05:05 <anarsoul> :)

05:21 Barada has joined #lima

05:22 <anarsoul> yuq825: looks like it's some existing bug that was exposed by your MR

05:22 <anarsoul> we're freeing BO too early

05:57 <anarsoul> ouch

05:58 <anarsoul> yuq825: so the issue is that we're allocating gp uniforms via u_uploader

05:58 <anarsoul> but we allocate it and add it to submit only if it's dirty

05:58 <anarsoul> but then we're reusing va for each draw

05:59 <anarsoul> the issue is that BO may be gone by then

05:59 <anarsoul> same for pp uniforms

06:00 <anarsoul> I'd say we should probably use separate BOs for that

06:01 <anarsoul> one from u_uploader doesn't really fit this purpose

06:01 <anarsoul> and it makes no sense to hold 1mb BO if we haven't updated uniforms in a while

06:03 <anarsoul> same for textures

06:04 <anarsoul> *sigh*

06:08 <yuq825> anarsoul: oh right, I forgot another functionality of lima_ctx_buff which keeps BO across flushs, will drop this commit

06:09 <anarsoul> yuq825: we probably shouldn't keep BOs across flushes

06:09 <anarsoul> pp/gp uniforms should nicely fit into 4k (min BO size)

06:09 <anarsoul> same for texture descriptors

06:10 <anarsoul> we should use individual BOs for them and let BO cache do its job

06:10 <anarsoul> u_uploader uses 1M BOs and it's too wasteful to keep them around if we haven't updated uniforms

06:11 <anarsoul> yuq825: you can drop this commit for now and I can prepare an MR that uses individual BOs for uniforms and tex descriptors

06:14 <anarsoul> yuq825: also we don't re-add these BOs to submits even if you drop this commit

06:15 <anarsoul> and it means that these BOs may be freed before GPU reads from them

06:15 <anarsoul> i.e. consider following:

06:15 <anarsoul> allocate gp_uniforms (that adds them to submit)

06:15 <anarsoul> flush

06:15 <anarsoul> re-use gp uniforms (doesn't add them to submit)

06:15 <anarsoul> flush

06:16 <anarsoul> <GPU is busy with something else and hasn't processed last job yet>

06:16 <anarsoul> allocate new gp_uniforms, drop old BO

06:17 <anarsoul> <GPU starts processing the job, but BO is gone => MMU fault >

06:18 <yuq825> anarsoul: yeah, I forgot this too, that's why we add bo to submit in lima_ctx_buff_va before

06:18 <yuq825> no in alloc

06:19 <anarsoul> ah, I see

06:19 <anarsoul> then yeah, dropping this commit should be fine

06:19 <anarsoul> and should fix regressions in CI

06:21 <yuq825> also this commit need change "lima: remove lima_ctx_buff_va submit flags"

06:26 <yuq825> after filter out the buffer exist cross flush, we can get these two changes back

08:42 yann|work has quit [Ping timeout: 240 seconds]

09:29 yann has joined #lima

09:42 yann|work has joined #lima

09:43 niceplace has quit [Ping timeout: 260 seconds]

09:45 niceplace has joined #lima

09:45 yann|work has quit [Client Quit]

10:58 megi has joined #lima

11:33 Barada has quit [Quit: Barada]

11:56 dddddd has joined #lima

12:36 wiewo has quit [Ping timeout: 248 seconds]

12:39 wiewo has joined #lima

12:41 Barada has joined #lima

13:01 deesix has quit [Read error: Connection reset by peer]

13:05 dddddd has quit [Ping timeout: 268 seconds]

13:11 deesix has joined #lima

13:11 dddddd has joined #lima

13:17 deesix_ has joined #lima

13:17 dddddd_ has joined #lima

13:20 deesix has quit [Ping timeout: 265 seconds]

13:20 dddddd has quit [Ping timeout: 240 seconds]

13:20 dddddd_ is now known as dddddd

13:21 deesix_ is now known as deesix

13:40 buzzmarshall has joined #lima

13:44 Barada has quit [Quit: Barada]

13:45 buzzmarshall has quit [Quit: Leaving]

14:10 yuq825 has quit [Quit: Leaving.]

15:00 buzzmarshall has joined #lima

15:41 buzzmarshall has quit [Ping timeout: 260 seconds]

17:05 yann has quit [Ping timeout: 272 seconds]

18:07 marex-cloud has joined #lima

18:12 yann has joined #lima

19:30 <anarsoul> dEQP-GLES2.functional.fbo.render.repeated_clear.tex2d_rgb passes with LIMA_DEBUG=singlejob :(

20:32 Xalius has joined #lima

21:44 Elpaulo has quit [Read error: Connection reset by peer]

21:45 Elpaulo has joined #lima

21:54 buzzmarshall has joined #lima

22:32 warpme_ has quit [Quit: Connection closed for inactivity]

23:32 warpme_ has joined #lima

23:39 Xalius has quit [Remote host closed the connection]

23:54 <anarsoul> OK, fixed dEQP-GLES2.functional.fbo.render.repeated_clear.tex2d_rgb and dEQP-GLES2.functional.fbo.render.repeated_clear.tex2d_rgba

23:55 <anarsoul> there's still flakes and failures in dEQP-GLES2.functional.fbo.render.shared_colorbuffer.*