#panfrost on 2020-05-06 — irc logs at freenode.irclog.whitequark.org

2019-09-06 11:20 alyssa changed the topic of #panfrost to: Panfrost - FLOSS Mali Midgard & Bifrost - Logs https://freenode.irclog.whitequark.org/panfrost - <daniels> avoiding X is a huge feature

00:20 <HdkR> Heads up to the people using equipment hosted at my location. This weekend the network will be going down for about an hour or two

00:21 stikonas has quit [Remote host closed the connection]

00:22 <HdkR> Moving networking gear to a server rack and potentially stripping ATT's modem off my network (if eap_proxy works correctly)

01:08 <alyssa> HdkR: glhf :P

01:09 <alyssa> My instructions are just... vanishing

01:11 <HdkR> Sounds like a good optimization :P

01:11 <alyssa> lol

01:12 <alyssa> HdkR: Oh hey, that was the bug, thanks :)

01:12 <alyssa> was stuck for a while

01:13 rhyskidd has quit [Read error: Connection reset by peer]

01:14 rhyskidd has joined #panfrost

01:26 vstehle has quit [Ping timeout: 256 seconds]

01:43 kaspter has joined #panfrost

02:24 kaspter has quit [Remote host closed the connection]

02:24 kaspter has joined #panfrost

04:04 davidlt has joined #panfrost

04:11 _whitelogger has joined #panfrost

04:16 cowsay_ has joined #panfrost

04:17 cowsay has quit [Ping timeout: 256 seconds]

04:31 NeuroScr has quit [Quit: NeuroScr]

05:00 vstehle has joined #panfrost

05:06 davidlt has quit [Ping timeout: 256 seconds]

05:08 davidlt has joined #panfrost

05:10 cowsay has joined #panfrost

05:12 cowsay_ has quit [Ping timeout: 272 seconds]

05:19 buzzmarshall has quit [Remote host closed the connection]

06:07 kinkinkijkin has joined #panfrost

06:11 bnieuwenhuizen has quit [Ping timeout: 246 seconds]

06:43 <la-s> so I'm running weston with panfrost, but it's quite slow with a 4k resolution, is that expected, or is it just me that fucked something up? It's still faster than llvmpipe, but it's not that different.

06:47 <kinkinkijkin> which board?

06:48 <HdkR> That's like what, minimum 1GB/s memory bandwidth?

06:48 <HdkR> if 4k60

06:48 <kinkinkijkin> firstly panfrost is relatively young, secondly 4k60 is a *lot* of work, regardless of maturity of driver

06:50 <kinkinkijkin> on most boards, weston runs pretty slow even at 1080p with gpu accel

06:50 <kinkinkijkin> if you can, try with gnome3

06:51 <kinkinkijkin> ironically long-hated as being super slow, it's the only wayland compositor fast enough to maintain 1080p60 constantly on the majority of boards that can launch it

06:53 vstehle has quit [Quit: WeeChat 2.7.1]

06:53 vstehle has joined #panfrost

06:58 NeuroScr has joined #panfrost

07:01 <la-s> odroid n2

07:01 <la-s> huh, thanks!

07:01 <la-s> This works surprisingly well onestly

07:05 <HdkR> Wait, when did N2 start working in Panfrost? I thought it was still busted

07:08 <la-s> I don't know

07:08 <la-s> I literally just got it working

07:09 <icecream95> kinkinkijkin: Maybe we need to blacklist weston to stop people from using it

07:10 <la-s> using 5.7rc4 with the patches in github.com/superna9999/linux's amlogic/v5.8/bifrost branch along with adding arm,mali-bifrost or something to the compatible dt thing in the panfrost drm driver and mesa is just latest master with gitlab.freedesktop.org/tomeu/mesa's bifrost branch's patches.

07:11 <la-s> Anyway, why is weston so bad?

07:11 <la-s> I thought it was supposed to be good since it's kind of like a reference implementation of the protocols

07:12 <icecream95> la-s: Weston is pretty slow even on Midgard, but optimisation work hasn't started yet for Bifrost so everything is going to be very slow

07:12 <la-s> huh

07:12 <kinkinkijkin> because it doesn't see as much development as gnome 3, weston has stayed pretty much the same thing since it was originally put out

07:13 <kinkinkijkin> there's also sway and way cooler, but I haven't tried them on a non-x86 device yet

07:13 <la-s> going to try sway now since I know how to get that working easily

07:13 <kinkinkijkin> sway might not work, or will require a good amount of extra work nobody's done yet to get it working

07:14 <kinkinkijkin> like plasma 5 was when I got it working on my board

07:15 * tomeu cannot think of a good reason why weston would be slower than mutter

07:15 soldi_ has joined #panfrost

07:16 <soldi_> mutter saw some gpu magic happening to it some point in i think late 2018

07:16 <soldi_> weston hasn't yet had similar improvements last I checked

07:16 <icecream95> glmark2-es2 scores with Weston are half of what they are with Sway

07:18 kinkinkijkin has quit [Disconnected by services]

07:18 soldi_ is now known as kinkinkijkin

07:20 <la-s> related: is there any way to use mutter without the gnome shell?

07:56 <daniels> kinkinkijkin: weston has seen a surprising amount of development and optimisation, and we can do 4k60 even on terrible imx boards, so it's not an architectural issue

07:57 <daniels> kinkinkijkin: as a quick fix, try putting '[core]\nrepaint-window=15' in weston.ini

07:57 <kinkinkijkin> that wasn't me who needed that help

07:57 <kinkinkijkin> was la-s

07:57 <daniels> icecream95: ^ also try that

07:57 <daniels> the default repaint window aims to give you sub-frame latency, so clients can present in the same vblank cycle as they render

07:58 <daniels> but this is too aggressive for a lot of cases; 15 means that you always present in the next frame (like GNOME), but also gives you the whole refresh cycle to render, instead of half

08:00 <kinkinkijkin> I'd try that out but my only personal board is currently set up for some other stuff

08:00 <kinkinkijkin> no wayland for me yet

08:12 <icecream95> daniels: That doesn't seem to have any effect

08:14 <tomeu> icecream95: does top suggest that CPU is the bottleneck?

08:14 <tomeu> and if so, which process takes the most?

08:17 rhyskidd has quit [Remote host closed the connection]

08:19 <icecream95> tomeu: For glmark-es2-wayland -b pulsar, weston uses 70% CPU, with 40% of that in the kernel

08:20 <tomeu> icecream95: wonder what perf top says then

08:20 <icecream95> v7_dma_clean_range is at 10% in perf top

08:31 raster has joined #panfrost

08:39 <daniels> yeah, that's super broken and sounds to me like a lot of texture uploads and/or ReadPixels fun

08:39 <daniels> can you please send your weston log, and also ideally the output of running `weston-debug timeline` from when you've started the compositor with --debug, and it is running slowly?

09:02 kinkinkijkin has quit [Read error: Connection reset by peer]

09:16 stikonas has joined #panfrost

09:21 * tomeu suspects it is related to missing modifiers support

09:24 <la-s> fwiw doing that doesn't fix the issue for me, but it might not be anything being slow honestly; there's just a big delay when I type something until it appears on the screen

09:31 <daniels> la-s: still shouldn't be the case ...

09:33 <icecream95> la-s: How responsive is using the cursor keys to rotate in es2gears_*?

09:34 <la-s> dunno, will try now

09:34 <la-s> I couldn't even get es2_info to work though I think

09:34 <la-s> display not found or something (WAYLAND_DISPLAY is set, does it use X?)

09:35 <daniels> ... is your session even accelerated?

09:36 <la-s> It says it is using panfrost and the g52 gpu

09:36 <la-s> daniels: can it still somehow not be using acceleration even if it says that?

09:36 <icecream95> la-s: Try passing --xwayland to weston

09:38 <la-s> es2gears_wayland works for me, but what do you mean using the cursor keys to rotate icecream95?

09:38 <icecream95> la-s: That only works in es2gears_x11

09:38 <la-s> lol

09:40 <icecream95> (and glxgears, but I'm not sure if GL_QUADS is supported on Bifrost yet)

09:41 <la-s> well, running with --xwayland causes it crash on startup

09:41 <la-s> * well, running with --xwayland causes it to crash on startup

09:42 <daniels> la-s: capturing the output of WAYLAND_DEBUG=client es2gears_wayland would say for sure

09:45 <la-s> daniels: well I captured it, but how can I tell?

09:49 <daniels> la-s: send me a URL and I'll look at it for you :)

09:54 <la-s> daniels: http://[240d:1a:164:8700:21e:6ff:fe42:3663]:8000/debug

09:56 <daniels> yeah, the fact it's using zwp_linux_dmabuf_v1 means that it's using hw accel

09:56 <la-s> thanks

09:56 <la-s> well, I'll try using sway now and see if it works

09:58 <daniels> [2673232.554] wl_callback@11.done(14232868)

09:58 <daniels> [2673233.467] wl_buffer@9.release()

09:58 <daniels> [2673233.853] wl_callback@12.done(14232868)

09:58 <daniels> there's some _really_ weird and broken client behaviour going on here

09:58 <daniels> it's asking for two frame callbacks in the same frame, and one isn't being proceesed for ... far too many milliseconds

09:59 <daniels> i did at some point start untangling this, but the pseudo-winsys-abstraction-layer inside the Mesa demos didn't make it easy

09:59 <daniels> are you able to try with another client, like weston's simple-egl?

09:59 <la-s> sure

10:14 icecream95 has quit [Quit: leaving]

10:28 <la-s> daniels: http://[240d:1a:164:8700:21e:6ff:fe42:3663]:8000/

10:30 kinkinkijkin has joined #panfrost

10:55 NeuroScr has quit [Quit: NeuroScr]

11:33 <la-s> welp, sway doesn't really work. It launches, but the output is way too garbled and is mostly black. There are a lot of errors saying "Failed to add atomic DRM property: Timer expired" and "Plane 0 doesn't support format 0x34325258"

11:39 <robmur01> hmm, apropos of that, I had been wondering what the deal with atomic DRM is - last time I checked, `kmscube -A` refuses to work on RK3399, but it does on RK3328 with lima, so I assume it's not the display driver

11:46 raster has quit [Quit: Gettin' stinky!]

12:00 karolherbst has quit [Quit: duh 🐧]

12:10 stikonas has quit [Remote host closed the connection]

12:11 stikonas has joined #panfrost

12:44 raster has joined #panfrost

13:01 kinkinkijkin has quit [Read error: Connection reset by peer]

13:11 <alyssa> la-s: ...wait, GNOME works on Bifrost? what?

13:12 kinkinkijkin has joined #panfrost

13:12 <alyssa> not complaining but

13:17 karolherbst has joined #panfrost

13:19 <la-s> alyssa: sorry, haven't actually tried GNOME yet, but weston works, and is actually usable

13:19 <alyssa> la-s: neat :o

13:19 <alyssa> I mean, I know weston starts and gears go, but I didn't try actully using it since I don't have my n2 setup

13:19 <la-s> if you change ttys it fucks up a bit, and windows have to be a specific size to not be garbled

13:19 <la-s> I got firefox working too

13:20 <la-s> text input on higher resolutions is hard because of it's performance though

13:20 <la-s> its*

13:24 <alyssa> "windows have to be a specific size to not be garbled" This is interesting,I remember seeing this on t860 when first bringing it up

13:24 <alyssa> tomeu: ^ Could you look into sometime maybe? I suspect stride issues.

13:24 <alyssa> actually wait

13:26 <alyssa> la-s: https://people.collabora.com/~alyssa/0001-panfrost-Align-widths-when-calculating-tiled-strides.patch

13:26 <alyssa> ^ Not even compiled tested but should fix it :p

13:28 <la-s> thanks, will try

13:28 <alyssa> +1

13:32 * tomeu is happy to see that he's not needed :p

13:32 <alyssa> :P

13:32 <alyssa> let's see if it worked =P

13:33 <tomeu> alyssa: want me to clean up some of the patches you have accumulated in your branch?

13:33 <alyssa> tomeu: yes, please! :)

13:33 <tomeu> ok, I will find for it tomorrow

13:37 buzzmarshall has joined #panfrost

14:17 <la-s> alyssa: it works!

14:17 <la-s> without any problems at all

14:17 <la-s> I could actually use this as my daily driver now I think

14:21 <la-s> there is this black border around windows though, and the background for the mouse pointer is also black

14:21 <la-s> minor inconvenience

14:35 <alyssa> la-s: hehe, nice :)

14:35 <alyssa> tomeu: ^ that'd be blending

14:36 <alyssa> la-s: I remember taking in this machine to school with early Panfrost + Weston... teachers never understood why I had to recompile mesa during class but maybe they never noticed either ^^

15:42 <la-s> alyssa: wow, you're still a student?

15:43 <alyssa> la-s: yeah, or at least, I was pre-COVID :p

15:43 <la-s> lol

15:44 <alyssa> Ostensibly studying at University of Toronto. currently University of Internet.ca

15:56 cwabbott_ has joined #panfrost

15:57 cwabbott has quit [Ping timeout: 240 seconds]

15:57 cwabbott_ is now known as cwabbott

16:15 <rando25902> sudo wpa_cli

16:15 <rando25902> whoops

16:22 <HdkR> sudo woodo

16:30 <rando25902> those were for library times

16:30 <rando25902> mostly

17:00 nerdboy has joined #panfrost

17:24 nerdboy has quit [Ping timeout: 256 seconds]

17:32 <daniels> la-s: ha, glad it works! :)

17:33 <la-s> yeah, it's really nice, just wish it would work faster on 4k, but it seems to me that that might be an issue with panfrost, oh well, at least it works pretty fine on lower resolutions

17:41 adjtm_ has joined #panfrost

17:45 adjtm has quit [Ping timeout: 246 seconds]

17:52 <daniels> yeah, Bifrost support is ... not quite perfectly optimised yet ;)

17:53 <Lyude> i'm surprised it works at all w/ weston o:

18:00 <daniels> Lyude: weston doesn't even use fbos!

18:00 <Lyude> oh, huh

18:01 <daniels> yeah, doing multiple render -> texture -> render passes is just too inefficient

18:21 stikonas has quit [Remote host closed the connection]

18:22 stikonas has joined #panfrost

18:23 bnieuwenhuizen_ has joined #panfrost

18:23 bnieuwenhuizen_ is now known as bnieuwenhuizen

18:33 raster has quit [Quit: Gettin' stinky!]

18:46 raster has joined #panfrost

19:58 bbrezillon has quit [Ping timeout: 272 seconds]

20:08 bbrezillon has joined #panfrost

20:09 davidlt has quit [Ping timeout: 246 seconds]

20:31 bbrezillon has quit [Ping timeout: 260 seconds]

20:51 icecream95 has joined #panfrost

21:08 bbrezillon has joined #panfrost

21:52 <icecream95> Problem: NetworkManager crashes after connecting to wifi. Solution: Don't let systemd restart NetworkManager. ++hacks;

22:00 <alyssa> icecream95: \o/

22:00 <alyssa> icecream95: BTW, I'm not sure derivatives were working before either..

22:00 <alyssa> https://people.collabora.com/~alyssa/0001-pan-mdg-Set-types-for-derivatives.patch does fix the crash but deqp tests still fail more than no

22:01 <alyssa> (dEQP-GLES3.functional.shaders.derivate.*)

22:02 NeuroScr has joined #panfrost

22:07 <icecream95> alyssa: Some of the tests are failing register allocation, for example dEQP-GLES3.functional.shaders.derivate.fwidth.texture.basic.vec3_mediump

22:09 <alyssa> Uh oh.

22:10 * alyssa was hacking on the midgard RA just now actually, though not for that

22:10 <alyssa> (Hoping to improve perf a bit.)

22:13 <alyssa> Compile-time perf, that is.

22:15 <alyssa> (With the goal of making shader-db useable again, skia shaders being added makes things a lot slower.)

22:16 <alyssa> In particular, we don't *really* need per-byte.

22:16 <alyssa> Since it doesn't make sense to alloc a single byte on midg... 16-bit is the minimum that's properly atomic.

22:17 <icecream95> alyssa: Speaking of skia shaders, someone needs to implement blend_equation_advanced

22:18 <alyssa> Ah, yeah, the "let's stick Photoshop in OpenGL" extension

22:18 <alyssa> Nobody has a hw impl afaik

22:18 <alyssa> GLSL IR has the lowering in lower_blend_equation_advanced, but..

22:19 <HdkR> Nvidia is the only company that has a fast implementation. So it's pretty yolo everywhere else

22:19 <alyssa> 1) That should really be in NIR, GLSL is not the right place for it. Original author (Kayden) agreed iirc that rewriting in NIR is sane.

22:19 <alyssa> 2) For us, we generate blend shaders fresh from NIR without hitting GLSL at all, so.

22:19 rando25902 has quit [Ping timeout: 240 seconds]

22:20 <alyssa> I'm not sure if #2 is a hard constraint for blend_equation_advanced, though.

22:20 <alyssa> Traditionally it gets lowered to tilebuffer reads within the fragment shader (there's also a Mali extension for this unrelated to blend shaders), I'd assume blend shaders are the right place for us to do it but who knows

22:20 <alyssa> I haven't looked what the blob does

22:21 <alyssa> Almost certianly perf would be better with blend shaders since then we can key them appropriately.

22:24 <icecream95> alyssa: Possibly, but only once someone changes the work_count = 16 in panfrost_frag_meta_blend_update

22:24 <alyssa> Oops.

22:24 <alyssa> I keep forgetting that's still there :p

22:26 <alyssa> IIRC it should be MAX2(work_count_frag, work_count_blend)

22:27 <alyssa> ISTR the blob forces blend shader pressure <= 4 so you can probably get away with simpler but yeah

22:27 <alyssa> (4 is the first thread break, 8 is the other one)

22:34 <icecream95> robmur01: kmscube -A requires PIPE_CAP_NATIVE_FENCE_FD, but other clients can to atomic DRM without it

22:39 <robmur01> icecream95: ah, I guess fences are a thing the render node has to be involved in too, so that's logical enough for my tiny brain - thanks!

22:51 <icecream95> alyssa: The register allocation failures in derivate tests was with an ancient distro Mesa - I had set LD_LIBRARY_PATH to somewhere that didn't exist

22:59 <alyssa> icecream95: Ah.

23:00 <alyssa> Ok, I think I just did a big loop and rederived LCRA albeit in a much saner way.

23:00 <alyssa> oops?

23:26 gcl_ has quit [Ping timeout: 240 seconds]

23:29 gcl has joined #panfrost

23:40 <alyssa> ...but short of a total rewrite I'm not sure how to benefit

23:41 <alyssa> Turns out - yes, there is a massive constant factor improvement possible by vectorizing the solver with bitsets

23:41 <alyssa> But I'd rather not be tied up debugging a new RA impl for the next indefinite

23:44 <alyssa> I guess shelving for if I figure out another ah-ha

23:46 <alyssa> But it would speed up shader-db considerably to fix

23:49 icecream95 has quit [Remote host closed the connection]