#panfrost on 2019-08-27 — irc logs at freenode.irclog.whitequark.org

2019-02-15 17:52 alyssa changed the topic of #panfrost to: Panfrost - FLOSS Mali Midgard & Bifrost - https://gitlab.freedesktop.org/panfrost - Logs https://freenode.irclog.whitequark.org/panfrost - <daniels> avoiding X is a huge feature

01:04 vstehle has quit [Ping timeout: 272 seconds]

02:37 megi has quit [Ping timeout: 258 seconds]

02:38 rcf has quit [Ping timeout: 268 seconds]

02:53 davidlt has quit [Ping timeout: 248 seconds]

02:56 rcf has joined #panfrost

03:24 davidlt has joined #panfrost

03:33 hopetech has joined #panfrost

03:33 hopetech has quit [Client Quit]

03:34 hopetech has joined #panfrost

05:00 vstehle has joined #panfrost

05:35 davidlt has quit [Remote host closed the connection]

05:36 davidlt has joined #panfrost

05:43 davidlt has quit [Remote host closed the connection]

05:43 davidlt has joined #panfrost

06:51 pH5 has joined #panfrost

07:06 adjtm_ has quit [Ping timeout: 246 seconds]

07:06 megi has joined #panfrost

07:18 yann has quit [Ping timeout: 244 seconds]

07:49 adjtm_ has joined #panfrost

08:25 davidlt has quit [Ping timeout: 245 seconds]

08:48 yann has joined #panfrost

10:23 raster has joined #panfrost

10:26 raster has quit [Read error: Connection reset by peer]

10:30 raster has joined #panfrost

10:37 <bbrezillon> shadeslayer, alyssa, tomeu: is anyone working on the "allow job pipelining" task?

10:37 <bbrezillon> I was about to resume working on it

11:34 <robher> bbrezillon: Steven posted a patch on that (if using _NEXT registers is what you mean).

11:36 <bbrezillon> robher: not sure, I'll check. I was talking about the job serialization we have in mesa

11:37 <daniels> bbrezillon: no-one's picked that up, no

11:39 herbmilleriw has quit [Remote host closed the connection]

11:39 <bbrezillon> daniels: ok, thx

13:05 grw has joined #panfrost

13:16 <raster> oh dear

13:16 <raster> something regressed....

13:19 <daniels> raster: anything in particular?

13:19 <raster> buffer age/partial updates

13:19 <raster> wetson works ok

13:19 <raster> but enlightenment is all messed up

13:20 <raster> nothing changed for us in that area

13:20 <raster> i just updated mesa+linux-next

13:21 <raster> and we havent added support for the new extension you pushed into mesa to pre-declare your update region before you draw

13:22 <raster> (forgot the name)

13:22 <raster> what i see seems definitely to be "buffer age is not agreeing with backbuffer data content"

13:22 <raster> so areas outside the update region are all black

13:23 <raster> tho it seesm to leave some trails in some places -

13:23 <raster> anyway

13:23 <raster> time to hunt

13:23 <raster> unfortunately... i dont know if it was kernel or mesa that broke yet. have to figure it out. i guess mesa most likely

13:24 <bbrezillon> you can try to revert

13:24 <bbrezillon> 0c5633036195 panfrost: Workaround bug in partial update implementation

13:24 <bbrezillon> 65ae86b85422 panfrost: Add support for KHR_partial_update()

13:24 <raster> oh

13:24 <raster> that sames me time :)

13:24 <raster> i was going to start a bisecting run :)

13:25 <raster> my weston tho will be old-ish so not sure it even has support for this

13:25 <raster> so wondering why it didnt break

13:27 <raster> yup

13:27 <raster> those 2 reverted - it works again

13:27 <raster> :)

13:27 <bbrezillon> hm

13:27 <bbrezillon> can you try to revert only the first one?

13:27 <raster> that is my next port of call :)

13:28 <bbrezillon> alyssa: I wonder how we end up with damage_{width,height} = 0

13:28 <raster> gah

13:28 <raster> -Werror is causing it to not compile

13:28 <bbrezillon> what's the warning?

13:29 <raster> or is it?

13:29 <raster> let me do a clean build and see

13:29 <raster> i do get lots of warnings

13:30 <raster> hmm no - those arent causing it to err. gimme a min

13:32 <raster> oh damn what

13:32 <raster> i got a conflict?

13:32 <raster> how? ...

13:34 <raster> how on earth did i revert these before with no conflicts?

13:34 <raster> never mind i must have missed something

13:36 <raster> ummmm

13:36 <raster> it has reverted the wrong thing it seems... wth...

13:36 <raster> oh never mind

13:36 <raster> i have the revert still her

13:39 <raster> with only the earliest of those its still broken

13:39 <raster> so the first is the key problem

13:39 davidlt has joined #panfrost

13:40 <raster> ok

13:40 <raster> reading 65ae86b85422 i see the problem

13:40 <bbrezillon> raster: can you add traces printing the damage extent in panfrost_draw_wallpaper()

13:40 <raster> it ASSUMES apps are using khr partial

13:40 <raster> which they will not be because that extension didnt exist for a long tiem

13:41 <raster> yet buffer age did

13:41 <bbrezillon> normally no

13:41 <bbrezillon> I mean, I have a reset_damage_region() call

13:42 <raster> let's see

13:42 <raster> but my guess is this is wrong/off

13:42 <raster> let me undo my reverts and start printf debugging :)

13:43 <bbrezillon> I realize we don't do the reset when importing a resource

13:44 <bbrezillon> can you try with http://code.bulix.org/tsxqlv-849566 ?

13:46 herbmilleriw has joined #panfrost

13:52 <raster> yup

13:53 <raster> region reported is not "the whole buffer"

13:53 <raster> DMG REGION 72,104 -> 616,600

13:53 <raster> DMG REGION 568,560 -> 624,600

13:53 <raster> DMG REGION 576,544 -> 624,600

13:53 <raster> (minx/y -> max/y)

13:53 <bbrezillon> hm, so something is calling the ->partial_update() hook

13:54 <bbrezillon> raster: what's the res size?

13:54 <raster> 1920x1080

13:54 <raster> oh wait

13:54 <raster> wtf?

13:54 <raster> we DO have support for partial update?

13:54 <raster> i never through we added it

13:55 <raster> wait

13:55 <raster> pause

13:55 <raster> this may be us

13:55 <raster> it was added in 2016...

13:55 <raster> well well

13:55 <raster> wth...

13:56 <raster> i didnt thnik we had it...

13:57 <raster> we've been doing this in x11, wayland clients and drm/kms ever since then

14:01 <bbrezillon> still doesn't explain why it doesn't work :)

14:02 <raster> i'm digging

14:05 <mmind00> urjaman: just saw in the logs that you seem to be doing x11 with panfrost ... any hints on what needs to be set? It seems to find panfrost for glamor, but things like glmark-es2 still seem to fall back onto llvmpipe ... glmark2-es2-drm works fine though, so it looks like some sort of x11 setting I'm missing

14:05 <alyssa> mmind00: What does es2_info say?

14:10 <urjaman> also i wouldnt know for i didnt try glmark (and i'm not running x11 w/ panfrost right now)

14:15 <daniels> raster: are you sure your partial_update use is correct? :)

14:17 <raster> i'm checking

14:18 <bbrezillon> daniels: I'm also not sure my partial_update() implementation is correct :)

14:18 <raster> it seems to be

14:18 <raster> i was just tracing our regions higher up the stack

14:18 <raster> they match the geometry that min/max x/y region give

14:18 <raster> a small snippet of debug:

14:19 <raster> BUFFER AGE: 2

14:19 <raster> MASTER CLIP 72 104 8x24

14:19 <raster> SWAP TIME...

14:19 <raster> REGION SET 72 104 8x24

14:19 <raster> DMG REGION 72,104 -> 80,128

14:19 <raster> the DMG REGION is from inside mesa

14:19 <bbrezillon> raster: did you try with after adding the missing damage_reset() call in panfrost_resource_from_handle()?

14:21 <bbrezillon> raster: BTW, what are the symptoms?

14:21 <raster> oh no

14:21 <raster> not yet

14:22 <raster> umm

14:22 <raster> a black screen with some trails of update reagions

14:22 <raster> with 1 frame seemingly mostly correct

14:22 <raster> (most of the screen redrew)

14:23 <raster> its just "buffer age/partial update" smells all over it :)

14:23 <bbrezillon> well, if reverting 65ae86b85422

14:23 <raster> gimme a bit

14:23 <raster> going ot get more debug

14:24 <bbrezillon> make it work, it's definitely related to partial-update :)

14:24 <raster> ki wonder if perhaps we call eglSetDamageRegionKHR multiple times per draw/swap

14:24 <raster> let me check

14:24 <raster> like set it up for 1region, then draw

14:24 <raster> then another, then draw

14:26 <raster> nope. we set them all up in advance in one go

14:26 <raster> would our scissor clips be the issue?

14:26 <raster> we specifically also set up scissor clips to be our update region

14:26 <raster> ?

14:28 <raster> interesting

14:29 <bbrezillon> can you check the fine rendering region (batch->{minx,maxx,miny,max}) in draw_wallpaper()?

14:29 <raster> when we set a single update region this code does the right yhing for mg region

14:29 <raster> when we setup > 1 ... it drops back to "full update"

14:29 <raster> https://pastebin.com/9sGFkCSN

14:30 <raster> was that the intent?

14:30 <mmind00> urjaman: but you didn't do anything specific I guess?

14:31 <bbrezillon> raster: nope

14:31 <mmind00> urjaman: I'm just guessing that I must be missing something very basic

14:31 <raster> :)

14:31 <daniels> raster: just to be clear, you know that partial_update and swap_buffers_with_damage take totally different regions, right?

14:32 <bbrezillon> raster: you mean several rects passed to a single partial_update() call?

14:32 <bbrezillon> or several calls to to partial_update()?

14:32 <raster> the logic reading this does seem to only handle 1 region for update

14:32 <daniels> the partial_update spec has a pretty good visual explanation, and after writing a buggy Weston implementation, I went through and commented the hell out of all the surrounding code to explain the region juggling we go through

14:32 <raster> daniels: i know - for us they have always been 1:1

14:32 <daniels> ok, that's incorrect :)

14:32 <daniels> SwapBuffersWithDamage takes _surface-relative_ damage - the change relative to the last time you called SwapBuffers(WithDamage)

14:33 <daniels> SetUpdateRegion takes _buffer-relative_ damage - the region you're intending to change relative to the last time you _used that buffer_

14:34 <raster> oh

14:34 <raster> for us i think that doesnt matter because we "accumulate"

14:34 <raster> let me mull it over but i think i've been here before

14:34 <daniels> so your swap region is going to be driven by only the area you're redrawing in that cycle; the partial-update region needs to look at buffer_age, and accumulate all the damage back to the last time that buffer was current

14:35 <raster> so if we're triple buffering our update region is this frame + last frame +_ frame before that

14:35 <raster> so we kind of redraw a moving window across N frames

14:35 <urjaman> mmind00: i did that revert that anarsoul linked above to get glamor (or even mpv) to eglInitialize() but that shouldnt have an effect if you have glamor working ...

14:35 <daniels> right

14:35 <raster> its not as efficient as it could be

14:36 <raster> but it kind of side-steps these issues .. i think...

14:36 <raster> (and if buffer age ever changes from being a constant value we throw our hands in the air and do a full redraw for that frame thus throwing in a full update in the pipeline if there is a hiccup)

14:37 <raster> so let me think

14:38 <raster> yeah a quick mull makes me think its moot due to the "over n frame accumulate" thing

14:40 <grw> hiya- is panfrost known to work/not work on amlogic s912?

14:40 <grw> i get this on boot- https://pastebin.com/xJ7JA3c7

14:40 <grw> using 5.3.0-rc6

14:42 <daniels> grw: unfortunately it is currently known to not work on T820/AmLogic

14:43 <grw> daniels: thanks :)

14:43 <grw> i remember seeing demo running on s912 a while back, guess there is some regression

14:44 davidlt has quit [Ping timeout: 246 seconds]

14:45 <daniels> yeah, unfortunately amlogic support has been a bit neglected over the past couple of months, so it's regressed pretty badly

14:45 <daniels> the only working platform we have right now is T860, as found in RK3399

14:45 <daniels> the T760 in RK3288 is semi-working, and the T820 in S921 should also be not _too_ far away from working

14:46 <daniels> we're hoping over the next few weeks to get T720, T760, and T860, into workable shape

14:46 <grw> hm i see, thanks. if i can help test anything let me know :)

14:46 <daniels> will do, but at this point the most help would be being adventurous and stepping in to do the fixing ;)

14:47 <anarsoul> urjaman: do clean rebuild, revert is not necessary

14:48 <urjaman> ok i'll try to remember that next time i test

14:48 <grw> will take a look to see where my error come from but im afraid its beyond my abilities to fix

14:49 <raster> daniels: https://pastebin.com/zhnxbxAC

14:49 <raster> so let's assume 0 == actual updates for the current frame

14:50 <raster> thats where things changed this frame

14:50 <raster> 1 == what changes 1 frame ago

14:50 <raster> and 2 == 2 frames ago

14:50 <raster> every draw we actually have a sliding window and we actually draw the union of 0, 1, 2

14:51 <raster> but the 1, 2 regions have no actual changes in them - tis dumbly repainging what was drawn a frame ago

14:51 <raster> in general this isnt actually that bad in performance as most of the time those regiosn tend to be very close together or identical. but let's leave that aside

14:52 <raster> i dont think this would break the partial update as such if it was per buffer not per surface

14:52 <daniels> right, that's the only correct implementation if you don't have EGL_SWAP_BEHAVIOR_PRESERVED

14:52 <raster> in this case we can actually dumbly keep them 1:1

14:53 <daniels> you have to redraw the union regardless - you can optimise to only pass 0 to SwapBuffersWithDamage, but you have to pass (0 u 1 u 2) to both actual redraw as well as partial_update

14:53 <raster> so as best i can tell here... we're submitting the right stuff if we are assuming regions 2, 1 are dumb "what was there before anyway" paints

14:53 <raster> yeah

14:53 <raster> we definitely could optimize the last bit for sure

14:53 <raster> but it's been kind of moot in real life data so never bothered out of simplicity :)

14:53 <daniels> (this is assuming that your buffer_age query returns 2)

14:53 <raster> so my mulling is... wer're doing the right thing...

14:53 <raster> am i wrong?

14:54 <raster> i just want to not hunt in the wrong place

14:54 <raster> yeah

14:54 <raster> its returning 2

14:54 <raster> so my debug says

14:54 <raster> https://pastebin.com/xHQYGtni

14:54 <raster> the -- is my printf inside panfrost_draw_wallpaper()

14:54 <raster> anyway

14:55 <raster> the rest was from the gl engine code where its setting a bunch of regions (0, 1, 2) in one submission to eglSetDamageRegionKHR

14:55 <daniels> co-ord system is the right way up?

14:56 <raster> the master clip is our max possible scissor clip size per region we draw as we draw each

14:56 <raster> yeah

14:56 <raster> 0, 0 is top-left

14:56 <raster> :)

14:56 <raster> none of that gl goofiness :)

14:57 <raster> we set up transforms accordingly so we can think/work in "right way up" space

14:57 <raster> oh wait

14:57 <daniels> heh

14:57 <daniels> ...

14:57 <raster> i swapped amage and batch in my printf

14:57 <daniels> partial_update takes GL co-ord space, not display co-ord space

14:58 <raster> we convert

14:58 <daniels> to lower-left (0,0)

14:58 <raster> _glcoords_convert() does that

14:58 <raster> also handles rotations too

14:58 <daniels> for partial_update

14:59 <raster> we do that when sticking them on the rect list

14:59 <raster> actually i see the problem

14:59 <raster> --panfrost: rsrc: 0 0 1920x1080 | dmg: 0 0 1920x1080 | batch: 0 0 1920x1080

14:59 <raster> that does not matuch up with the small rects we submitted for our partial update rects

15:00 <daniels> rects

15:00 <raster> oh

15:00 <raster> hmm

15:00 <raster> then its right

15:00 <daniels> The list should consist of <n_rects> groups of four values, with each group representing a single rectangle in surface coordinates in the form {x, y, width, height}. Coordinates are specified relative to the lower left corner of the surface. It is not necessary to avoid overlaps of the specified rectangles.

15:01 <raster> bbl

15:01 <raster> meeting

15:02 <daniels> don't worry if the damage co-ords are larger - for various reasons we don't submit each rect to reload as separate reload jobs, but we just take the largest possible union area and reload that

15:02 <shadeslayer> quick question, I noticed that we can have up to 4 color buffers attached to a panfrost job, but we check for one color buffer here https://gitlab.freedesktop.org/mesa/mesa/blob/master/src/gallium/drivers/panfrost/pan_context.c#L159-161

15:02 <shadeslayer> should that check instead be >=1 instead?

15:02 <bbrezillon> daniels: that's not entirely true

15:02 <shadeslayer> or, actually, <= 1

15:03 <bbrezillon> daniels: we actually pick the biggest damage region and relaod around this damage box

15:04 <bbrezillon> so there might be up to 4 reload jobs (the 4 rects surrounding the biggest damage box)

15:07 <bbrezillon> and of course, we limit the rendering area to the union of all damage rects so we don't have to reload the whole FB

15:07 <daniels> bbrezillon: yeah, correct, that was a glib and incorrect description

15:07 <daniels> the point being that we are very pessimistic at reload, so may reload a larger region than required

15:07 <bbrezillon> for sure

15:08 <daniels> but that's correct if not maximally performant

15:08 <narmstrong> trying to understand why `if (cfg->ias != 48 || cfg->oas > 40)` doesn't pass anymore on T820

15:08 <daniels> narmstrong: there's a thread about that

15:09 <narmstrong> daniels: oh

15:10 <narmstrong> must have missed it

15:10 <daniels> narmstrong: starting at https://lists.freedesktop.org/archives/dri-devel/2019-May/thread.html#218756

15:10 <shadeslayer> well, I guess what I'm asking is, a context can have more than one colour buffer attached right?

15:11 <narmstrong> daniels: oh, ok, but I don't understand why it worked without this at some point.... thanks for the pointer

15:12 <daniels> narmstrong: i've made many mistakes in my life, but knowing too much about MMUs isn't one of them

15:12 <daniels> shadeslayer: yeah, it can, but on the other hand Panfrost doesn't actually support simultaneous emission to multiple render targets

15:13 <daniels> the hardware doesn't support it, so we have to emulate it by running the fragment shader multiple times, masking the outputs so we write to one cbuf at a time

15:14 <narmstrong> daniels: no offense ! I only understand the basic concepts of IOMMU, I'm only wondering why it worked as-in on the initial panfrost patchset, and now no more, but it seems this `ias` check was there already...

15:17 <daniels> narmstrong: no no, absolutely no offence taken! :) i'm very happy & comfortable with not knowing the details of Arm MMUs. my brain is already full enough with random crap.

15:17 <daniels> narmstrong: so why it's regressed I have no idea ...

15:17 <shadeslayer> daniels: ok, so any reason we take 4 cbufs here https://gitlab.freedesktop.org/mesa/mesa/blob/master/src/gallium/drivers/panfrost/pan_job.c#L92-100

15:18 <narmstrong> I'll need a serious bisect now :-)

15:18 <daniels> shadeslayer: i couldn't tell you which part was incorrect

15:18 <shadeslayer> ok, so at least they're definitely at odds with each other?

15:18 <shadeslayer> alyssa: ^^ thoughts?

15:26 megi has quit [Ping timeout: 245 seconds]

15:37 <alyssa> shadeslayer: If there are multiple cbufs attached, it can't be scanout; MRT must be off-screen

15:38 <alyssa> So the check in context.c#L159 is right

15:38 <alyssa> The comment is misleading

15:38 <alyssa> daniels: "the hardware doesn't support it, so we have to emulate it by running the fragment shader multiple times, masking the outputs so we write to one cbuf at a time"

15:38 <alyssa> On T760+, I'm not sure if this is true

15:39 <narmstrong> ok I know now, the initial panfrost had :

15:39 <narmstrong> + .ias = 48,

15:39 <narmstrong> + .oas = 40,

15:39 <shadeslayer> alyssa: I see, any particular reason we only copy 4 cbufs then?

15:40 <alyssa> shadeslayer: We only support 4 cbufs

15:41 <shadeslayer> alyssa: but PIPE_MAX_COLOR_BUFS is 8 :S

15:42 <alyssa> shadeslayer: Yes, but we don't support more than 4 even on chips that do MRT

15:42 <alyssa> Desktop OpenGL requires 8 cbufs

15:42 <alyssa> OpenGL ES only requires 4, and Mali only does 4 (when it does MRT at all... T720 and T6xx don't do any and have to soft-emulate it)

15:42 <alyssa> Could we emulate 8 cbufs? Yes. Do we? No.

15:43 davidlt has joined #panfrost

15:43 <shadeslayer> alyssa: so maybe this loop can be shortened? https://gitlab.freedesktop.org/mesa/mesa/blob/master/src/gallium/drivers/panfrost/pan_job.c#L346

15:46 <alyssa> shadeslayer: I mean, it could be, but... it doesn't really matter, if that makes sense?

15:46 <alyssa> I would like to leave open the possibility of doing 8 MRT eventually; it's a lot of work but no need to make more

15:47 <shadeslayer> alyssa: fair enough

16:09 pH5 has quit [Quit: bye]

16:35 <mmind00> alyssa: es2_info says ... "libEGL_warning: DRI2: failed to create dri screen"

16:38 <alyssa> mmind00: That sounds bad

16:39 <alyssa> Are you sure you have glamor enabled?

16:40 <mmind00> at least X11 tells me in the log

16:40 <mmind00> [ 997.284] (II) modeset(0): glamor initialized

16:40 <mmind00> [ 997.284] (II) modeset(0): glamor X acceleration enabled on panfrost

16:40 <alyssa> Hmph.

16:41 <mmind00> but then the display is nicely non-standard on this scarlet ... with 1536x2048 pixels

16:42 <alyssa> scarlet? cute :p

16:42 <mmind00> glmark2-es2-drm needed the "--visual-config stencil=1:alpha=0" option though not sure if that figures into it

16:43 <mmind00> at least kde-plasma seems inocent ... lxde produces the same behaviour

16:46 <alyssa> https://people.collabora.com/~alyssa/scheduler.txt

16:46 <alyssa> So this will take the rest of eternity to implement.

16:46 <mmind00> "rest of eternity" doesn't sound thaaaat long

16:47 <raster> bbrezillon: yeah. the code looked pretty pessimistic reading it it seemed to want to do a bbox basically (well the inverse of one actually)

16:48 <anarsoul> alyssa: looks pretty much like utgard to me

16:50 <raster> bbrezillon: it seems the damage rect (the one NOT to copy in) doesnt match what we're submitting as regions it seems

16:52 yann has quit [Ping timeout: 244 seconds]

16:53 <anarsoul> although we don't have condition register

16:54 <anarsoul> select takes two slots -- condition is output of smul; branch can consume condition directly (any reg)

16:57 <alyssa> anarsoul: Do you have the crazy scheduling reqs?

16:57 <anarsoul> like what?

16:57 <alyssa> All the VLIW stuff

16:58 <alyssa> Because of scheduling alone, the blob is getting better compiles than us

16:58 <alyssa> So I aim to fix that today

17:03 <alyssa> I feel

17:03 <alyssa> frozen not being able to start this um

17:03 <alyssa> time to put on some music and start breaking things.

17:03 <raster> bbrezillon: what damage_reset() call? cant find one i mesa grepping... ?

17:04 <anarsoul> alyssa: yeah, Utgard PP is also VLIW

17:04 <alyssa> anarsoul: Good luck.

17:04 <anarsoul> it's more or less figured out

17:04 <anarsoul> :)

17:04 <alyssa> The lgood luck was for writing a scheduler.

17:04 <anarsoul> we already do that

17:05 <anarsoul> I'm working on merging loads into instruction where it's used

17:05 <alyssa> :+1:

17:05 <anarsoul> well, "working"

17:06 <anarsoul> I know how to do that, but actual work hasn't been started yet

17:06 <alyssa> In other news I've seen all of "LUT" "VLUT" and "VSFU" used to refer to the fifth unit in the ALU... not sure which is the real Arm term, I think VLUT but who knows

17:07 <alyssa> (I've seen both VLUT and VSFU in public Arm stuff I think? I dunno)

17:08 <anarsoul> what is vlut?

17:08 <alyssa> anarsoul: vector lookup table

17:08 <anarsoul> load uniform/temporary?

17:08 <anarsoul> oh

17:08 <alyssa> The unit for sin/cos as well as an extra multiplier

17:10 <alyssa> The lookup tables are scalar but the multiplier/moves are vector so go figure

17:20 <anarsoul> same here

17:20 <anarsoul> :)

17:21 <anarsoul> we lower sin/cos to scalar for this reason

17:21 <anarsoul> but there's no dedicated unit for it in Utgard

17:22 <bbrezillon> raster: http://code.bulix.org/tsxqlv-849566

17:22 <raster> bbrezillon: ooh that...

17:22 <raster> sorry

17:23 <raster> skipped that

17:23 <raster> was going over scrollback

17:25 <raster> manualyl did that

17:25 <raster> lets see

17:27 <raster> hmm no

17:27 <bbrezillon> raster: can you be more specific about the damage rect mismatch?

17:27 <raster> doesnt help

17:27 <raster> ok

17:27 <raster> take a look at this

17:27 <raster> i'll explain

17:27 <bbrezillon> is it bigger? smaller? completely differetn?

17:27 <raster> https://pastebin.com/MkAZt1js

17:27 <raster> so the first few frames its all right

17:28 <raster> setting update region == what panfrsot draw wallpaper thinks it has to do

17:28 <raster> the samage area is the whole buffer

17:28 <raster> then some smaller rects come through the pipeline

17:28 <bbrezillon> can you share the diff?

17:28 <raster> on screen u see a mouse cursor appear

17:28 <raster> and a cursor is blinking

17:29 <raster> also the scrollbar appears

17:29 <raster> thus those 3 regions as thsoe went from not there to visible etc.

17:29 <raster> then after that the 8x24 is just the cursor blinking

17:29 <raster> so ok - the batch of 3 regions ends up with the damage rect ebing everything

17:29 <raster> odd that its that pessimistic... but ok

17:30 <raster> but the frame after that... its wrong

17:30 <raster> its the last region only from the frame with 3 regions

17:30 <raster> not THIS frame which is 8x24 etc.

17:31 <raster> so what we're pushing in via api is not matching what panfrost is thinking it should be internally frame by frame

17:31 <raster> well that frame there is a hiccup and is a sign of some issue

17:31 <raster> anyway

17:32 <bbrezillon> raster: again, it's hard to debug without looking at the code changes (those adding traces)

17:32 <raster> let me undo my reset dmg changes and share

17:32 <raster> sure

17:32 <raster> yup

17:32 <raster> the only trace IN mesa is --panfrost

17:32 <raster> the rest are from higher up

17:32 <raster> the EEE:

17:32 <bbrezillon> you should also print traces at the partial_update() level (dumping all rects) to make sure they match what the app expects

17:32 <raster> its debug in the code that calls egl/gles

17:33 <raster> that is indeed a good thing to do

17:33 <raster> tho my printfs like REGION[x] are like in the code right above the egl call

17:34 <raster> so i am pretty sure its not being messed up on the way in within the calling code

17:34 <raster> not sure my 1 line of debug is useful to you though :)

17:35 <bbrezillon> raster: indeed, I need more :)

17:35 <bbrezillon> first thing to do, dump the rects past to panfrost_set_damage_region()

17:35 <bbrezillon> *passed

17:36 <bbrezillon> and compare them to the REGION[x] traces

17:37 chewitt has joined #panfrost

17:37 <bbrezillon> I meant panfrost_resource_set_damage_region()

17:38 <raster> so happy to see someone else lazy enough to use x, y, w, h like me

17:38 <raster> and not width, height :)

17:38 <raster> werll regions match on the way in

17:38 <raster> panfrost_resource_set_damage_region()

17:39 <raster> wait a sec....

17:39 <raster> i smell badnesses

17:39 <raster> this isnt inverse

17:39 <raster> its a bounding box

17:39 <raster> which is the INVERSE of what you want

17:40 <raster> so hmm 1 line high display. i provide 2 boxes

17:40 <raster> | [1] [2 ] |

17:40 <raster> if u take a bounding (min/max bounds as i read)

17:41 <raster> u'll create

17:41 <raster> | [bbox ] |

17:41 <raster> right?

17:41 <raster> but i as the caller have NO intention of updating the area BETWEEN [1] and [2]

17:41 <raster> because its not in my update rects....

17:42 <bbrezillon> there's 2 different things

17:42 <bbrezillon> the damage extent, and the biggest damage box

17:43 <bbrezillon> damage extent is used to restrict the rendering area

17:43 <raster> but the comments at least in panfrost_draw_wallpaper() tell me that you're going wallpaper boxes 1, 2, 3, 4 surrounding the damage rect

17:43 <bbrezillon> the biggest damage rect, yes

17:43 <bbrezillon> not the damage extent

17:44 <raster> hmmm though damage extent is wrong

17:44 <raster> oh wait sorry

17:45 <raster> if its a bounds it isnt never mind

17:45 <raster> i have something at 0,0

17:45 <raster> actually no wait...

17:45 <raster> it is wrong as that should be the extents a frame earlier

17:45 <raster> its like its 1 frame out of sync

17:46 <bbrezillon> then it's likely the problem reported by daniels

17:46 <raster> what is submitted via panfrost_resource_set_damage_region doesnt match then the rects we get in panfrost_draw_wallpaper

17:46 <raster> my debug definitely seems to indicate that its an off-by-1 in time

17:46 <bbrezillon> can you paste the dump?

17:47 <raster> let me add some more for you and get u a patch :)

17:50 <TheCycoTWO> weston is failing to start on the latest mesa master - core dump

17:51 <daniels> TheCycoTWO: which platform, and can you please give a backtrace?

17:51 <daniels> if it's line 909 of kms.c, revert to 7.0.0 and it'll be fixed tomorrow

17:52 <TheCycoTWO> kevin (rk3399). I don't have symbols in my build - guess I should change that. It's calling munmap_chunk

17:53 <alyssa> My beautiful scheduler algorithm is becoming a lot less beautiful the more non-SSA SIMD quirks I have to deal with :c

17:54 <raster> bbrezillon: just adding soem debug in getting buffer age and swap buffers as those are the important "clamping pieces" of the frame draw

17:57 <alyssa> Can reproduce Weston crash

17:57 <alyssa> bbrezillon: ^^

17:59 <raster> bbrezillon: https://pastebin.com/iQBnfxpC

17:59 <raster> sorry didnt get the buf age+ swap

17:59 <raster> looking

17:59 <raster> quick patch so far

17:59 <bbrezillon> alyssa: one of my patches?

17:59 <bbrezillon> weird that it passed CI?

17:59 <raster> but u will see that the input regions if slightly "exotic" dont match the regions decided on later in panfrost_draw_wallpaper

18:00 <alyssa> bbrezillon: 5882e0def97a47aff050f5a3f412b97a7f440e27

18:02 <bbrezillon> alyssa: is CI working?

18:02 * alyssa shrugs

18:02 <alyssa> Should I revert the patch?

18:02 <bbrezillon> yes please

18:03 <bbrezillon> actually, I can do it

18:03 <raster> bbrezillon: oh noes!

18:03 <raster> damage_extent->minx = 0xffff;

18:03 <raster> damage_extent->miny = 0xffff;

18:04 <raster> what will you do when we have backbuffers > 64k in size! :)

18:05 <bbrezillon> raster: I think the scissor state is also using u16

18:05 <raster> it probably is. was just joking :)

18:05 <bbrezillon> alyssa: just let me know if you want me to do the revert

18:05 <raster> just am reading code with my "looking for bugs" glasses on

18:05 <alyssa> bbrezillon: Yes, please.

18:05 <alyssa> (Either fix it or revert.)

18:06 stikonas has joined #panfrost

18:06 <bbrezillon> alyssa: I'll revert for now

18:07 <raster> i wonder if the panfrost_resource is even the same one

18:08 <raster> nope. it isn't

18:08 <bbrezillon> alyssa, TheCycoTWO: done

18:08 <TheCycoTWO> \o/

18:09 <bbrezillon> sorry for the mess

18:15 <raster> panfrost isnt even getting the same resource from the same texture

18:15 <raster> oh waint

18:15 <raster> no

18:15 <raster> it is

18:16 <alyssa> Okay, bite size task -- let's schedule without any of the register pressure estimation bits (no S-U)

18:16 <raster> same ptr anyway

18:16 <raster> but a different rsrc

18:16 <raster> oh not wait

18:17 <raster> sorry mixed up frames

18:17 <raster> yeah

18:17 <raster> pipe resource changes

18:25 <raster> bbrezillon: i wonder if.... somehow the update regions are being stored on an fbo we have bound in context instead of the left backbuffer?

18:30 adjtm_ has quit [Ping timeout: 245 seconds]

18:39 chewitt has quit [Read error: Connection reset by peer]

18:39 chewitt has joined #panfrost

18:42 <bbrezillon> raster: https://gitlab.freedesktop.org/mesa/mesa/blob/master/src/gallium/state_trackers/dri/dri2.c#L1875

18:42 <raster> yeah

18:42 <raster> found that already

18:43 <raster> and that resource != the one found in panfrost_draw_wallpaper

18:43 <raster> for now i don't know why (just don't know mesa code well enough to just knwo where to look)

18:43 <raster> but i'm looking about

18:43 <raster> i just printf'd the ptr for that and it differs

18:43 <raster> thus whyt he rects differ

18:45 <raster> ctx->pipe_framebuffer.cbufs[0] <- is my current "culprit" i'm looking into

18:47 <bbrezillon> raster: you might want to look at panfrost_set_framebuffer_state()

18:48 <raster> hmmm

18:50 <bbrezillon> you might be able to track when new FBs are bound

18:50 <raster> that was just what i was trying to do

18:51 <raster> i was beginning to suspect that the fb is bound later than expected

18:51 <raster> why - don't know

18:52 <bbrezillon> and we indeed execute the WP job on the currently bound FB

18:54 <raster> but the current bound fb->texture != the dri left backbuffer->texture

18:54 <raster> well not at the time the regions are set vs when the buffer is swapped

18:54 <bbrezillon> yes, maybe we shouldn't use this FB

18:54 <bbrezillon> let me check

18:55 <raster> the next frame when we start adding regions its the same fb as in the previous swap

18:56 <raster> i am not sure my userspace code is wrong as reading the specs quickly this only applies to backbuffer(s)

18:56 <raster> not fbo's etc.

18:56 <raster> well it keeps talking about backbuffers

18:56 <raster> (any postable surface)

18:58 <bbrezillon> can you try with that => http://code.bulix.org/66sy81-849758

19:00 <bbrezillon> sorry, http://code.bulix.org/myun77-849761

19:01 <raster> some typo fied

19:01 <raster> yeah

19:01 <raster> job->batch

19:01 <raster> :)

19:01 <raster> still problems

19:02 <raster> didnt fix it

19:02 <bbrezillon> hm, that's really weird

19:02 <raster> it changed which buffers are "Wrong"

19:02 <bbrezillon> the res pointers still don't match?

19:06 <raster> yup

19:06 <raster> still not

19:06 <raster> https://pastebin.com/LaBKKdGG

19:06 <raster> so u can see the buffer age query

19:07 <raster> then submit a bunch of regions with pres=0xaaaaf2979fb0

19:07 <raster> then.... stuff... (rendering)

19:07 <raster> then swap and in that ... pres=[0xaaaaf23efd60]

19:07 <raster> not a match

19:08 <raster> and of course thus rects dont match etc.

19:08 <raster> those master clip printfs mean we're going ot set a scissor clip of at least that OR more restricted if we have to

19:09 adjtm_ has joined #panfrost

19:10 <bbrezillon> raster: can you add traces in panfrost_set_framebuffer_state()

19:11 <raster> sec

19:11 <bbrezillon> ideally with "%s:%i", __func__, __LINE__

19:11 <raster> ga

19:11 <raster> ok

19:11 mps has joined #panfrost

19:11 <raster> panfrost_get_job_for_fbo is called a lot :)

19:11 <alyssa> This is true.

19:12 <raster> shall look for gold in less noisy functions :)

19:12 <raster> bbrezillon: so yup

19:13 <raster> it'scalled just after we sety our first master clip

19:13 <raster> scissor clip

19:14 <raster> lets see if its the wp batch

19:14 <bbrezillon> did you print the new/old resource addr?

19:14 <raster> not yet

19:14 <raster> its not the wp batch. ok

19:15 <alyssa> Who knew that rewriting history would be so hard

19:16 <alyssa> Anyway, I'm now able to traverse blocks backwards by source via the worklist so

19:16 <alyssa> that was like 1% of the work

19:17 <bbrezillon> raster: can you paste the new dump?

19:17 <mps> are there any guide for newbie to run panfrost on samsung chromebook one plus with T860 gpu

19:18 <alyssa> What distribution?

19:18 <mps> I have kernel 5.2.10 with panfrost enabled

19:18 <alyssa> OK

19:18 <mps> alyssa: Alpine linux

19:18 <alyssa> Not sure if anyone has tried on Alpine

19:18 <alyssa> But https://panfrost.freedesktop.org/building-panfrost-mesa.html is the usual guide

19:19 <alyssa> Er that's outdated

19:19 <mps> I built mesa with panfrost enabled and I have /usr/lib/xorg/modules/dri/panfrost_dri.so

19:19 <alyssa> Don't use the linked tree; your 5.2 kernel with panfrost enabled is perfect.

19:19 <alyssa> Alright.

19:19 <alyssa> What's the issue, then? It sounds like you should be panfrosting all over the place! :p

19:20 <mps> from Xorg.0.log 'falling back to /sys/devices/platform/display-subsystem/drm/card0'

19:20 <alyssa> Try with weston first.

19:20 <mps> no, I don't use weston

19:21 <alyssa> Please try.

19:21 <mps> never tried

19:21 <mps> uhm, ok will do

19:21 <mps> have no idea how to use it, TBH

19:22 <mps> but will look

19:22 <raster> bbrezillon: just extracting the surf->texture in an ugly way

19:24 <raster> so yes

19:24 <raster> this "overwrites" the current fb state with the new one

19:25 <bbrezillon> and I guess it triggers a flush

19:26 <raster> ok

19:26 <raster> one of them is in drawarrays

19:26 <raster> the other in swapbuffers

19:26 <raster> i'm grabbing backtraces

19:28 <bbrezillon> one of these set_fb_state() is caused by the u_blitter call done in panfrost_blit_wallpaper()

19:28 <bbrezillon> but this one should not trigger a flush

19:28 <raster> https://pastebin.com/jFV29D57

19:29 <raster> actually the first one in drawarrays screws it up as u can see the pres changed at that drawarrays

19:30 <raster> so the first bt is the problem

19:31 <bbrezillon> when is set_damage_region() called?

19:32 <raster> before the drawarrays, after getting of buffer age

19:32 <raster> EEE: BUFFER AGE: 2

19:32 <raster> EEE: REGION[0] SET 72 104 8x24

19:32 <raster> --panfrost: [pres=0xaaaaab0073d0] in -> region[0] 72 104 8x24

19:32 <raster> those tell me that :)

19:34 <raster> sot he blit code here is doing the right thing

19:34 <bbrezillon> and the draw arrays is targeting this buffer, right?

19:34 yann has joined #panfrost

19:34 <raster> yup

19:35 <raster> well ok wait

19:35 <raster> the drawarrays COULD tasrget an fbo

19:35 <raster> b ut i doubt it in these circumstances

19:35 <raster> but i can check what we bindframebuffer to

19:35 <bbrezillon> yep

19:37 <raster> but it shouldnt matter what fb we have bound or not, region stuff should always apply to the backbuffer here

19:37 <raster> we just use fbo's for offscren rendering

19:37 <raster> not other buffers

19:38 <raster> yup

19:38 <raster> we bindefb to 0 at the start

19:38 <raster> and we dont ever change it ever again

19:43 <raster> double chedked

19:43 <raster> double checked

19:43 <raster> no other calls

19:43 <bbrezillon> raster: I think damage region is applied to the proper resource

19:43 <raster> i threw in printfs in the only others (they are used for special circumstances and not this basic stuff)

19:44 <raster> hmm

19:44 <raster> why does drawarrays mess it up?

19:44 <raster> or... is drawarrays doing the right thing?

19:44 <bbrezillon> it's the resource we get when doing the wallpaper blit that's wrong

19:44 <raster> and when we set region we should "push along some new cbuf fb's" ?

19:44 <raster> hmmm

19:44 <raster> is it?

19:45 <raster> i am not so sure

19:45 <raster> we attach our regions to a panres

19:45 <raster> but drawarrays uses a different panres

19:46 <raster> that is right?

19:46 <raster> (forget the wallpaper thing for now) ?

19:46 <bbrezillon> depends which FB the draw arrays is targeting I guess

19:47 <raster> well in this case there is only one "presentable" one

19:47 <raster> the backbuffer

19:49 <bbrezillon> so, we have should have setDamage(backFB, ...), drawArrays(backFB), swapBuffers()

19:49 <raster> yup

19:50 <alyssa> ../src/util/u_dynarray.h:171:59: error: invalid initializer

19:50 <alyssa> #define util_dynarray_append(buf, type, v) do {type __v = (v); memcpy(util_dynarray_grow_bytes((buf), 1, sizeof(type)), &__v, sizeof(type));} while(0)

19:50 <alyssa> Oy

19:50 * alyssa was missing an asterisk

19:51 <alyssa> Okay... I have instructions emitted in the right order, at least.

19:54 <bbrezillon> raster: can you print the old and new pres in set_fb_state()?

19:56 <raster> ...

19:57 <mmind00> alyssa: found my issue ... "someone" forgot to add my user to the render usergroup

19:58 <raster> let me not segv

19:59 <raster> bbrezillon: as expected

19:59 <raster> --panfrost: [pres=0xaaaad1852f70] in -> region[2] 952 528 40x40

19:59 <raster> EEE: MASTER CLIP 0 0 32x32

19:59 <raster> --panfrost_set_framebuffer_state [pres=0xaaaad1852f70] -> [pres=0xaaaad12c92b0]

19:59 <raster> ..

19:59 <raster> --panfrost: pres=[0xaaaad12c92b0], rsrc: 0 0 1920x1080 | batch: 0 0 1920x1080 | damage: 0 0 1920x1080

20:00 <raster> set fb state "loses" our rects

20:00 <raster> at least by the time we get to a swapbuffer its not the same panres

20:00 <bbrezillon> and the swap happens before the first trace?

20:00 <raster> and thus not the same rects

20:01 <bbrezillon> can you print the old -> new backbuffer res pointer in the swapBuffer call?

20:01 <raster> https://pastebin.com/7vR5rNjn

20:02 <raster> the ones that do the wallpaper batchie

20:02 <raster> i.e

20:02 <raster> if (ctx->wallpaper_batch) {

20:02 <raster> that if

20:02 <raster> dont change the fb state

20:02 <raster> well same texture to same texture

20:06 <raster> bbrezillon: should we continue tomorrow?

20:14 <alyssa> Argh, I gotta do a whole bunch more on RA now... just when I thought I was making forward progress... well I still am but

20:14 <raster> RA?

20:15 <alyssa> register allocation

20:15 <raster> aaaah

20:15 <raster> why bother?

20:15 <alyssa> Huh?

20:15 <raster> just use 1 register and swap in and out

20:15 <raster> al good

20:15 <raster> all good

20:15 * alyssa blinks

20:15 <raster> :)

20:15 <raster> ok. 2. luxury. src and dest operands

20:15 <raster> :)

20:16 <raster> no one ever needs more than 2 registers!

20:16 <raster> :)

20:17 * raster mumbles something about 640k...

20:17 <alyssa> 6502 had Y in addition to A and X

20:17 <alyssa> was a real luxury/

20:17 <raster> see? luxury!

20:17 <raster> superfluous registers

20:19 <raster> ok

20:19 <raster> i really do need to go

20:19 <raster> bbrezillon: we'll catch up tomorrow

20:19 <raster> i need dinner

20:19 <raster> already 9:20pm

20:20 <raster> time to roll -> my stomach is telling me :)

20:20 <raster> nite!

20:20 raster has quit [Remote host closed the connection]

20:26 * mmind00 was a total rebel and removed the blacklist for chrome from panfrost :-D

20:27 * mmind00 was a total rebel and removed the blacklist for chrome from panfrost :-D

20:27 <mmind00> whops ... wrong keyboard

20:36 chewitt has quit [Read error: Connection reset by peer]

20:37 chewitt has joined #panfrost

20:43 chewitt has quit [Quit: Adios!]

21:05 <alyssa> With out-of-order scheduling, most of glmark is working

21:05 <alyssa> Discards and csels are broken, so I guess fixing that is next.

21:12 <anarsoul> alyssa: don't forget about nir regs :)

21:14 <anarsoul> (or rather about write after read deps for nir regs)

21:22 <alyssa> anarsoul: What about thrm?

21:23 <anarsoul> you probably have to add them. Otherwise your out of ordering scheduler can reschedule something like: r0 = r1; r1 = r1 + 1;

21:24 <alyssa> I think I handle that

21:30 <anarsoul> cool

21:31 <alyssa> Gah who knows rewriting so much code would break so much

21:32 <alyssa> Accomplishments for the day:

21:32 <alyssa> - Broke a bunch of shaders

21:32 <alyssa> - Caused performance to drop by 4x

21:35 <anarsoul> :(

21:35 <alyssa> Sounds like a good to me!

21:35 <xdarklight> the good thing about this: if someone says that you "didn't do anything today" then you have numbers to prove them wrong

21:35 <anarsoul> (everything is fine meme here)

21:35 <alyssa> xdarklight: Yup!

21:43 megi has joined #panfrost

21:57 stikonas has quit [Remote host closed the connection]

21:58 <bbrezillon> daniels: are we sure drawable->textures[ST_ATTACHMENT_BACK_LEFT] is up-to-date when dri2_set_damage_region() is called?

21:59 raster has joined #panfrost

22:01 <bbrezillon> https://gitlab.freedesktop.org/mesa/mesa/blob/master/src/gallium/state_trackers/dri/dri2.c#L1875

22:02 raster has quit [Remote host closed the connection]

22:02 mps has left #panfrost [#panfrost]

22:03 raster has joined #panfrost

22:09 <bbrezillon> raster: you might want to try with http://code.bulix.org/jcn8f9-849883

22:10 <raster> bbrezillon: i'll try that tomorrow. dont have my panfrosty board accessible now

22:11 <raster> bbrezillon: but this seems odd that you have to dig into common infra here that drivers share....

22:12 <bbrezillon> raster: the generic partial_update() have been added for panfrost

22:12 <bbrezillon> *generic partial_update() bits

22:12 <raster> oooh

22:12 <bbrezillon> so it's not been heavily tested

22:12 <raster> i didnt look

22:12 <raster> got it

22:13 <raster> i kind of dismissed that code as being "heavily/well tested and thus the issue must be inside panfrost"

22:13 <raster> well then thanks for that background... very useful :)

22:13 <raster> efl may have been helpful in finding a bug... let's see tomorrow :)

22:16 <alyssa> Well, I have csel working.

22:16 <alyssa> It breaks a ton of other stuff apparently

22:18 <alyssa> Oh, nah

22:28 <alyssa> grmb

22:34 raster has quit [Remote host closed the connection]

22:52 <alyssa> Anyway, got csel working correctly.

22:52 <alyssa> Next up, branching

23:05 <alyssa> Glump

23:29 tlwoerner has quit [Read error: Connection reset by peer]

23:45 tlwoerner has joined #panfrost