#panfrost on 2019-08-12 — irc logs at freenode.irclog.whitequark.org

2019-02-15 17:52 alyssa changed the topic of #panfrost to: Panfrost - FLOSS Mali Midgard & Bifrost - https://gitlab.freedesktop.org/panfrost - Logs https://freenode.irclog.whitequark.org/panfrost - <daniels> avoiding X is a huge feature

00:28 megi has quit [Ping timeout: 245 seconds]

01:05 vstehle has quit [Ping timeout: 244 seconds]

01:17 tgall_foo has joined #panfrost

01:17 tgall_foo has quit [Client Quit]

01:37 tgall_foo has joined #panfrost

04:12 davidlt has joined #panfrost

04:21 davidlt has quit [Remote host closed the connection]

05:00 vstehle has joined #panfrost

05:01 marcodiego has quit [Quit: Leaving]

06:07 NeuroScr_ has quit [Quit: NeuroScr_]

06:23 jernej has joined #panfrost

06:24 somy has joined #panfrost

07:09 megi has joined #panfrost

08:02 pH5 has joined #panfrost

08:42 vstehle has quit [Remote host closed the connection]

08:42 TheCycoTWO has quit [*.net *.split]

08:50 _whitelogger has joined #panfrost

09:39 raster has joined #panfrost

09:48 adjtm_ has quit [Ping timeout: 245 seconds]

10:22 chewitt has quit [Quit: Adios!]

11:29 _whitelogger has joined #panfrost

11:35 adjtm_ has joined #panfrost

11:42 Kwiboo has quit [Quit: .]

11:44 Kwiboo has joined #panfrost

12:14 narmstrong has quit [Remote host closed the connection]

12:14 narmstrong has joined #panfrost

12:17 <bbrezillon> robher, alyssa: tomeu suggested that I start looking at the SAME_VA feature (CPU/GPU address space sharing), is this something you already looked at?

12:18 <bbrezillon> also, do you have examples of use cases for that? (I see it's useful for OpenCL SVM stuff but was wondering if there was any other potential users)

12:26 Elpaulo1 has joined #panfrost

12:26 Elpaulo has quit [Ping timeout: 245 seconds]

12:26 Elpaulo1 is now known as Elpaulo

12:32 <tomeu> bbrezillon: not sure if SAME_VA is related, but what we need for vulkan is that userspace is able to allocate virtual addresses

12:33 <tomeu> regarding what is needed from the kernel, I guess you can look at how v3d and fdo do it

12:45 <pH5> bbrezillon: not sure if this helps at all, but lynxeye_ implemented this for etnaviv recently: https://patchwork.freedesktop.org/series/64970/

12:59 <bbrezillon> tomeu: hm, I'm not sure yet, but IIUC mali_kbase is more restrictive => the GPU and CPU VA are the same

13:00 <bbrezillon> pH5: thanks for the link

13:00 <tomeu> yeah, I think that Arm's OpenCL supports SVM on some GPUs

13:01 <bbrezillon> this being said, it doesn't seem to be a HW restriction

13:02 afaerber has quit [Quit: Leaving]

13:15 <robher> bbrezillon: there's been some discussion about it on the list between Steven and me.

13:16 <robher> bbrezillon: the h/w restriction is how to support it on GPUs with less VA size than the CPU.

13:20 <robher> Neither fdo nor v3d support this.

13:22 <tomeu> robher: don't support SAME_VA on armhf?

13:23 <robher> tomeu: armhf is an easy case. It's 64-bit userspace with a 32-bit GPU VA that's the problem. T720 IIRC is one.

13:24 <tomeu> ah, of course

13:24 <bbrezillon> robher: I guess there's no way to restrict the mmap() address space

13:25 <robher> Implementation wise, I tend to think we should copy kbase and set the GPU VA when a BO is mmapped. If we set the GPU VA up front, we should use the kernel's address range as that won't overlap with userspace addresses.

13:26 <robher> bbrezillon: I don't know. That was something to investigate. I would think you can in mmap calls.

13:27 <bbrezillon> robher: ok, the other question is, is there a strong reason to keep this GPU_VA = CPU_VA limitation in our uAPI ?

13:28 <bbrezillon> I mean, userspace can pass the CPU VA if it needs those VA to match, and we can reject the operation if the passed VA is outside the supported VA range

13:30 <bbrezillon> that still leaves the problem of restricting the CPU VA address space when mmap() is called on BO that's expected to have matching GPU and CPU VA

13:31 * bbrezillon needs to look at the fdo and v3d implems

13:36 chewitt has joined #panfrost

13:43 <robher> bbrezillon: those don't have an implementation...

13:45 <bbrezillon> robher: I wonder what tomeu was referring to when he suggested to look at how fdo and v3d do it

13:45 <bbrezillon> ?

13:46 <bbrezillon> and indeed, I see nothing like that in v3d

13:46 <tomeu> I thought they did

13:46 <robher> must have been assuming they do already...

13:47 <bbrezillon> so it looks like the only driver that's (soon) implementing something close to what we want is etnaviv

13:47 <tomeu> I remember seeing some code in mesa related to this

13:47 <tomeu> maybe only iris?

13:47 <robher> bbrezillon: lima does I think.

13:47 <robher> and amd

13:49 <robher> both of those just have an ioctl to set the GPU VA and do all the address space management in userspace (i.e. implement drm_mm there).

13:49 megi has quit [Quit: WeeChat 2.5]

13:56 <bbrezillon> robher: I see

13:59 <bbrezillon> robher: actually, I can't find the lima ioctl doing that in drm-misc-next

14:00 <bbrezillon> there's DRM_IOCTL_LIMA_GEM_INFO which returns the GPU VA

14:01 <bbrezillon> but nothing to set it AFAICT

14:02 <robher> bbrezillon: must have gotten removed in review. I remember discussing that...

14:34 warpme_ has joined #panfrost

15:13 warpme_ has quit [Quit: warpme_]

15:28 warpme_ has joined #panfrost

15:37 <alyssa> Okay what actually am I missing

15:37 * alyssa has been trying to debug MRT for an hour now

15:37 <alyssa> cyrozap: Nice :)

15:38 warpme_ has quit [Quit: warpme_]

15:38 <alyssa> robher: Wait a minute. There's such a thing as a 32-bit GPU VA?

15:38 <alyssa> That.... changes things....

15:39 <robher> alyssa: I believe the address size is still 64-bits, but only 31 or 32 address bits are present.

15:40 <alyssa> Ook.

15:43 <tomeu> hehe

15:43 <alyssa> tomeu: What was the result of your OpenCL experiment? :P

15:44 <tomeu> alyssa: my main finding is that it will be all great when ML people agree to stop reinventing build systems and distros

15:44 <alyssa> :p

15:45 <tomeu> I was able to run a simple sanity check test from plaidml on clover after I hacked up support for lowering shared variables with explicit io

15:46 <alyssa> Ah?

15:46 <alyssa> tomeu: (Was this with the SSBO branch?)

15:46 <EmilKarlson> Master Lizard people?

15:46 <alyssa> EmilKarlson: sssss

15:46 <tomeu> I wanted to try to do some inference next, but realized that half of the python files are out of sync with the other half, because of finding #1

15:46 <tomeu> alyssa: ah no, this is on tegra with novueau

15:47 <alyssa> Oh :-(

15:47 <tomeu> so I'm now waiting for pip to rebuild all the python modules that debian had already installed...

15:47 <tomeu> alyssa: should I try with panfrost? should be quite easy

15:47 <alyssa> tomeu: Probably won't work

15:48 <alyssa> But once I land SSBOs, might be fun

15:48 <alyssa> dEQP-GLES31 compute shader tests are starting to pass on that branch

15:51 <alyssa> tomeu: Oh this is interesting

15:52 <alyssa> I accidentally stumbled upon a mode where the hw does wallpapering itself.

15:52 <alyssa> Which is dramatically faster than us doing it

15:52 <alyssa> Except it's slightly broken

15:54 <tomeu> hehe

15:54 <alyssa> tomeu: Chicken bit?

15:55 <alyssa> tomeu: Thing is, it's just -so- close to being functional

15:57 <alyssa> Makes me wonder if it could work

15:59 <alyssa> tomeu: https://people.collabora.com/~alyssa/0001-Chickenbit.patch

16:00 warpme_ has joined #panfrost

16:01 <alyssa> What I don't know is where it's loading from --- the framebuffer itself (in memory) or off the tilebuffer

16:01 <alyssa> But I think memory

16:01 <alyssa> Just... buggy around the borders..

16:03 <tomeu> like always :)

16:03 <alyssa> tomeu: Yeah.... any idea?

16:03 * alyssa wondering if you saw the same bug when bringing up the u-blitter path

16:04 <tomeu> no, didn't see such a problem there

16:04 <alyssa> Hm

16:04 <tomeu> hrm

16:04 <tomeu> is the final result correct when the blob enables it?

16:04 <alyssa> tomeu: Symptom in weston is little black lines in places where the tile is being reloaded

16:04 <alyssa> The blob doesn't enable it.

16:04 <alyssa> (The blob does the wallpaper method like us.)

16:04 <tomeu> ah, thought you managed to find a case in which it did

16:04 <alyssa> Which makes me think it's a chicken bit for something they thought they could get working in hw and they couldn't.

16:05 <tomeu> right

16:05 <tomeu> a proper chicken bit :)

16:13 sravn has quit [Read error: Connection reset by peer]

16:35 pH5 has quit [Quit: bye]

16:37 sravn has joined #panfrost

16:45 warpme_ has quit [Quit: warpme_]

17:08 pH5 has joined #panfrost

17:17 megi has joined #panfrost

17:31 adjtm_ has quit [Ping timeout: 246 seconds]

17:37 raster has quit [Remote host closed the connection]

18:35 <shadeslayer> mmhhh ... got a panfrost crash https://paste.ubuntu.com/p/tyhnxP6jMz/

18:40 <alyssa> Memory corruption....

18:40 <alyssa> try valgrind

18:42 stikonas has joined #panfrost

18:45 <shadeslayer> *twiddles thumbs waiting for plasma to finish*

18:46 warpme_ has joined #panfrost

18:54 warpme_ has quit [Quit: warpme_]

19:09 warpme_ has joined #panfrost

19:11 <shadeslayer> still waiting for it to crash

19:11 <shadeslayer> or show something on the screen

19:20 <alyssa> ......?

19:26 <shadeslayer> alyssa: nothing :S

19:27 <shadeslayer> alyssa: https://paste.ubuntu.com/p/4PN4kT87Vz/

19:33 <shadeslayer> let me start it again

19:35 <shadeslayer> aaaanndd no crash now

19:38 warpme_ has quit [Read error: Connection timed out]

19:44 <shadeslayer> alyssa: idk what happened, but no longer crashes

19:46 <shadeslayer> ooh, it just crashed plasmashell: ../src/gallium/auxiliary/util/u_inlines.h:87: pipe_reference_described: Assertion `count != -1' failed.

19:52 <shadeslayer> alyssa: https://paste.ubuntu.com/p/4yP8kZpZsK/ different crash I suppose

19:58 <alyssa> shadeslayer: Could you figure out which BO is being unreferenced?

19:58 <alyssa> That way we can figure out where we messed up the refcnt.

19:58 <alyssa> reference count

20:01 <shadeslayer> alyssa: I think I'd have to rebuild without optimization to get that

20:07 <alyssa> shadeslayer: Hm, maybe try sprinkling in some prints to bo_reference/unreference and create_resource?

20:07 <alyssa> Try to match the addresses of the BO with what they do, etc

20:09 <shadeslayer> I'm re compiling without optimization :)

20:13 afaerber has joined #panfrost

20:14 <shadeslayer> oh lovely, now I get asserts for safe_iterator ../src/panfrost/midgard/midgard_schedule.c:578:schedule_block: Assertion `ins == __next && "use _safe iterator"' failed.

20:16 pH5 has quit [Ping timeout: 268 seconds]

20:22 pH5 has joined #panfrost

20:29 afaerber_ has joined #panfrost

20:30 afaerber has quit [Ping timeout: 264 seconds]

20:33 <shadeslayer> alyssa:

20:33 <shadeslayer> (gdb) print *bo

20:33 <shadeslayer> $4 = {link = {prev = 0xffff9c63f100, next = 0xaaaad42b7690}, reference = {count = -1}, cpu = 0x0, gpu = 287248384, size = 33280, gem_handle = 106, flags = 4}

20:42 JuJu has quit [Ping timeout: 245 seconds]

20:52 <shadeslayer> I'll take a look tomorrow, night!

21:03 JaceAlvejetti has quit [Ping timeout: 250 seconds]

21:03 JaceAlvejetti_ has joined #panfrost

21:16 NeuroScr has joined #panfrost

21:22 davidlt has joined #panfrost

21:31 adjtm_ has joined #panfrost

22:01 davidlt has quit [Ping timeout: 258 seconds]

22:01 raster has joined #panfrost

22:09 stikonas has quit [Read error: Connection reset by peer]

22:10 stikonas has joined #panfrost

23:00 afaerber_ has quit [Ping timeout: 264 seconds]

23:15 afaerber_ has joined #panfrost

23:45 raster has quit [Remote host closed the connection]

23:56 stikonas has quit [Remote host closed the connection]