#panfrost on 2019-01-31 — irc logs at freenode.irclog.whitequark.org

2018-12-27 00:26 alyssa changed the topic of #panfrost to: Panfrost - FLOSS Mali Midgard & Bifrost - https://gitlab.freedesktop.org/panfrost - Logs https://freenode.irclog.whitequark.org/panfrost - Transientification is terminating. Memory reductions in progress.

01:50 stikonas has quit [Remote host closed the connection]

03:36 <Lyude> lvrp16: got my system, thanks a ton!

03:36 <lvrp16> Lyude: speak of the devil

03:37 <lvrp16> ask and your wish is granted by the USPS

03:37 <Lyude> Hehe

03:37 <Lyude> alyssa: ^ that's my rk3399

04:01 <alyssa> Lyude: Awesomesauce :)

04:33 <anarsoul> Lyude: so now you're working on midgard? :)

05:48 <tomeu> o/

06:12 <hanetzer> o/

06:54 <Lyude> anarsoul: a bit! I really needed a reference system for it since a lot of the work that's getting done right now with winsys and others is happening on there

06:57 <Lyude> hanetzer: also re chromeos using xwayland: Huh.

06:57 <Lyude> Huh.....

06:57 <Lyude> so are they using glamor with gbm?

06:59 <hanetzer> Lyude: unsure about whole specifics, but if you install their linux-on-chromeos thing (its some form of aarch64 vm while the real system is compiled in 32 bit arm mode with some flags that make it work better on aarch64) and check the env its fulla waylandstuff :)

07:01 <Lyude> :3

07:28 <tomeu> Lyude, hanetzer: have you seen this? https://www.youtube.com/watch?v=WwrXqDERFm8

07:28 <tomeu> chrome on chromeos has a wayland compositor (exosphere)

07:29 <Lyude> neat!

07:30 <tomeu> their VMM (crosvm) uses a custom virtio driver to pass wayland protocol across the VM boundary: https://chromium.googlesource.com/chromiumos/platform/crosvm/#emulated-devices

07:30 belgin has joined #panfrost

07:35 <hanetzer> that's pretty nifty :P

07:38 belgin has quit [Quit: Leaving]

07:56 <tomeu> hopefully I will find time to replace that custom driver with vsock and out-of-band buffer passing

07:59 <tomeu> alyssa: what's the rationale for pre-baking stuff instead of waiting until we emit?

08:02 <tomeu> it's being a bit problematic when trying to replace the GPU addresses that we are adding to the cmd stream with relocs

08:02 <tomeu> if it was done all during the emit phase, we would have a single point of divergence between the DRM and non-DRM drivers

08:03 <tomeu> having it all together in the same place would also help with readibility

08:11 <tomeu> alyssa: what do you think of moving closer to what other drivers do and have a panfrost_emit.c with all the emission code, with macros such as OUT_RING and OUT_RELOC?

08:11 <tomeu> the non-drm backend would emit a GPU address with OUT_RING, but the drm one would emit instead a OUT_RELOC

08:21 pH5 has quit [Quit: _]

08:22 cwabbott has quit [Quit: cwabbott]

08:22 cwabbott has joined #panfrost

08:23 <cwabbott> tomeu: why do you want relocs at all? all the other major drivers have ditched them

08:23 <cwabbott> intel is currently rewriting their driver so that they can do exactly what alyssa's code is doing

08:24 <tomeu> cwabbott: oh, do you have any pointers to any discussions on this?

08:25 <tomeu> well, or to code, I wasn't aware of the switch away from relocs

08:26 <cwabbott> tomeu: anholt has a blog post on how he decided to go userptr-only for v3d

08:26 <cwabbott> that kernel driver is upstream afaik

08:26 cwabbott has quit [Client Quit]

08:27 cwabbott has joined #panfrost

08:28 <cwabbott> it turns out that relocs are a huge source of draw-time overhead, and extra complexity in the userspace driver

08:29 <cwabbott> and on modern hardware with an mmu, that's almost never worth it

08:31 <cwabbott> in terms of drivers to emulate, v3d is probably better than etnaviv or lima since it's never, written by someone with experience, and also only has to deal with modern hw

08:32 <tomeu> cwabbott: hmm, just checked emit_one_texture and cl_address returns a v3d_cl_reloc

08:33 <tomeu> which is placed in the cmd stream instead of a gpu address

08:40 <cwabbott> tomeu: I don't see that, there's no v3d_cl_reloc in the uapi

08:41 <tomeu> hmm, so maybe relocations are resolved at a later stage in userpsace instead of in the kernel?

08:41 <cwabbott> from drm_v3d_create_bo: "Returned offset for the BO in the V3D address space. This offset is private to the DRM fd and is valid for the lifetime of the GEM handle."

08:41 <cwabbott> tomeu: it might just be leftovers from vc4

08:41 <tomeu> cwabbott: do you know what's used to provide isolation between processes?

08:42 <cwabbott> tomeu: like midgard and above, each process has its own GPU page table

08:43 <cwabbott> this is something pretty much all modern GPU's support

08:44 <tomeu> awesome, lots of work just saved, thanks!

09:08 <cwabbott> tomeu: btw, one kinda kernel-uapi-related thing I've come across: apparently, the program counter is only 24 bits (or something like that)

09:09 <cwabbott> this means that every location for every instruction must be in the same 2^24 bits

09:09 <cwabbott> *2^24 bytes

09:10 <cwabbott> since otherwise it'll just wrap around

09:11 <cwabbott> I'm not sure what it takes to run two programs with different upper 64-24 bits, but it might involve a cache flush or something like that

09:12 <cwabbott> but the point is, the blob basically deals with this by allocating a whole aligned 2^24 bytes at once for shaders, and the kernel automagically aligns the allocation to 2^24 when the GPU execute permission is set

09:13 <cwabbott> then anything allocated within that 2^24 byte pool has the same upper bits

09:14 <cwabbott> I'm not sure how you want to deal with it, but panfrost userspace and/or kernel has to deal with it somehow

09:14 pH5 has joined #panfrost

09:21 <tomeu> ok, I see

09:22 <tomeu> wonder if that's different for all the other GPUs supported in mesa

09:26 <cwabbott> tomeu: I've never seen that on another GPU

09:27 <cwabbott> what I have seen, is something similar on Intel where there are different base addresses, and everything has to be within a fixed small distance of the base address

09:28 <cwabbott> but that's better than what ARM does because there's no alignment restriction

09:30 <cwabbott> this just seems like a suckier version of the base address concept

09:33 <cwabbott> the motivation is similar, they want to make the instruction cache and program counters smaller, but executed much worse

09:34 <tomeu> yeah

09:45 <cwabbott> I can't think of a better way to handle it than allocating an aligned 2^24 bytes for shaders

09:45 <cwabbott> although, maybe that alignment should be explicit in the uapi instead of added automatically for executable allocations

10:07 <tomeu> no idea of what would be best, tbh

12:57 indy has quit [Quit: ZNC - http://znc.sourceforge.net]

13:00 indy has joined #panfrost

13:23 tomeu has quit [Ping timeout: 264 seconds]

13:40 tomeu has joined #panfrost

13:53 <narmstrong> `[ 98.036507] mali ffe40000.gpu: GPU identified as 0x3 arch 7.0.9 r0p0 status 0` now I have a bifrost soc !

14:12 <tomeu> narmstrong: does anything work? :)

14:12 <tomeu> oh, the ISA is totally different in those, right?

14:26 <urjaman> yeah there's not much to test yet on those afaik (yet)

14:52 <narmstrong> Yeah I’m ready when something is ready to be tested !!

14:54 afaerber has joined #panfrost

15:57 BenG83 has joined #panfrost

16:20 <alyssa> tomeu: Midgard does _not_ use command stream

16:20 <alyssa> We call it that since that's what other hw do but it's fundamental not laid out like (most) hw

16:21 <alyssa> We're memory-resident descriptor backed (like Vulkan), not command backed (like OpenGL)

16:21 <alyssa> Which means the prebaking is taking full advantage of Gallium capabilities and is the Right way to handle it for our hardware; OUT_RING etc would be a regression in cleanliness and performance

16:22 <alyssa> tomeu: And yeah, as cwabbott says we have a full MMU so it's fine :)

16:23 <alyssa> narmstrong: Nice nice

16:36 BenG83 has quit [Ping timeout: 250 seconds]

17:18 pH5 has quit [Quit: bye]

17:32 TheKit has quit [Remote host closed the connection]

17:51 stikonas has joined #panfrost

17:54 pH5 has joined #panfrost

19:24 TheKit has joined #panfrost

20:05 afaerber has quit [Quit: Leaving]

20:58 AntonioND has joined #panfrost

22:23 jernej has quit [Quit: Free ZNC ~ Powered by LunarBNC: https://LunarBNC.net]

22:25 AntonioND has quit [Quit: Quit]