#panfrost on 2019-03-14 — irc logs at freenode.irclog.whitequark.org

2019-02-15 17:52 alyssa changed the topic of #panfrost to: Panfrost - FLOSS Mali Midgard & Bifrost - https://gitlab.freedesktop.org/panfrost - Logs https://freenode.irclog.whitequark.org/panfrost - <daniels> avoiding X is a huge feature

00:18 <alyssa_> Why does the entire kernel need to recompile gah

00:18 <alyssa_> Do I have ccache on this machine

00:21 <alyssa_> Guess I should try one of the faster linkers while I'm at it (does lld work with the kernel these days?)

01:24 shenghaoyang has joined #panfrost

01:25 stikonas has quit [Remote host closed the connection]

01:32 BenG83 has quit [Quit: Leaving]

01:59 <shenghaoyang> got the kms driver running on a C101PA (t860) with lots of DATA_INVALID_FAULT s :(

01:59 <alyssa_> shenghaoyang: Any chance you can give advice for the magic touch for getting mainline kernels to boot? :V

02:01 <shenghaoyang> alyssa_: I ripped the PKGBUILD from ArchLinuxARM for mainline linux and pointed it to the panfrost tree *major hack*

02:01 <alyssa_> shenghaoyang: Hey, me too! Still not booting

03:06 <alyssa_> Wow, this patch ought to work.. if only I had a way to boot it...

03:29 shenghaoyang has quit [Remote host closed the connection]

04:01 <alyssa_> In the mean time, I'll be working on AFBC userspace stuff so I'll be able to use the patch when it's ready

04:01 <HdkR> :D

04:01 <alyssa_> Should lead to a perf boost across the board

04:02 <HdkR> Which board?

04:02 <alyssa_> :P

04:02 <HdkR> :D

04:02 <HdkR> across the boards

04:04 <alyssa_> HdkR: Defining a static function in a header is fine right

04:05 <HdkR> sure

04:42 <alyssa_> robher: Playing with the DRM driver for the first time... I'm very impressed. Nice work! D:

04:42 <alyssa_> Er :D

04:46 <alyssa_> Trying to figure out why -brefract wouldn't work

04:46 <alyssa_> When -bshadow is ok

04:50 <alyssa_> Okay, huh, I have a patch that fixes -brefract with DRM driver but we still get flooded with MMU faults for some reason?

05:00 <alyssa_> (set surf to NULL if is_scanout)

05:18 tomeu has joined #panfrost

05:26 <tomeu> o/

07:06 shenghaoyang_ has joined #panfrost

07:11 <tomeu> robher: have come up with a simple clear job for igt: https://gitlab.freedesktop.org/tomeu/igt-gpu-tools/tree/panfrost

07:12 <tomeu> but it faults right away when trying to read the first byte of the job descriptor

07:12 <tomeu> I'm out of time this week to look at it, but thought I would push in case it could be useful to you

07:30 MoeIcenowy has quit [Quit: ZNC 1.6.5+deb1+deb9u1 - http://znc.in]

07:30 MoeIcenowy has joined #panfrost

08:06 shenghaoyang_ has quit [Remote host closed the connection]

08:39 shenghaoyang_ has joined #panfrost

08:39 shenghaoyang_ has quit [Remote host closed the connection]

09:18 shenghaoyang_ has joined #panfrost

09:47 shenghaoyang_ has quit [Remote host closed the connection]

10:36 memeka has left #panfrost [#panfrost]

13:18 rhyskidd has joined #panfrost

13:56 griffinp has quit [Quit: ZNC - http://znc.in]

14:25 rhyskidd has quit [Quit: rhyskidd]

14:31 afaerber has quit [Quit: Leaving]

14:45 afaerber has joined #panfrost

15:33 jernej has joined #panfrost

15:34 <robher> alyssa_, tomeu: what have I missed for todo list? https://www.irccloud.com/pastebin/bEY34VU5/

15:35 <robher> 9. Testing on other Midgard variants. Only T860 is tested.

16:01 <anarsoul|2> frequency scaling? (I'm not sure if it's done in software on midgard/bifrost)

16:04 <robher> anarsoul|2: yes. devfreq and thermal support.

16:04 <cwabbott> robher: you missed some equivalent for GROW_ON_GPF

16:04 <robher> cwabbott: #6

16:06 <cwabbott> that's different from just resizing the heap from userspace, since userspace has no idea how much memory will be required, the tiler might need more memory on-the-fly

16:06 <cwabbott> so for the purposes of the tiler heap, resizing from userspace won't suffice

16:08 BenG83 has joined #panfrost

17:24 pH5 has joined #panfrost

17:46 <robher> cwabbott: so tiler heap and userspace heap are 2 different things, but both are needed?

17:58 shenghaoyang_ has joined #panfrost

18:04 stikonas has joined #panfrost

18:28 shenghaoyang_ has quit [Remote host closed the connection]

19:39 afaerber has quit [Quit: Leaving]

20:17 <alyssa_> robher: Yup

20:34 <cwabbott> robher: if by userspace heap you just mean the part of the driver that allocates/reuses buffers, then yeah

20:37 <cwabbott> the tiler is responsible for taking in a list of triangles from the vertex pipeline and then returning for each bin/tile a list of triangles that may intersect it (and thus have to be rasterized for that tile)

20:37 <cwabbott> in order to do that, it builds up a datastructure in a chunk of memory handed to it by the driver called the tiler heap

20:38 afaerber has joined #panfrost

20:40 <cwabbott> the actual size of the datastructure is going to depend on the number of triangles (known beforehand, unless geometry/tess shaders are in the mix) and which tiles each triangle overlaps (unknown beforehand)

20:42 <cwabbott> so, the way things currently work, the userspace driver allocates the tiler heap as GROW_ON_GPF, reserving some initial space based on its guesstimate of how much it'll take, and then when the tiler tries to write to something that's not mapped the kernel will allocate a new page on demand

20:43 <robher> cwabbott: If you say driver, I hear kernel.

20:44 <cwabbott> robher: ah, I guess in the first sentence I meant the userspace driver

20:44 <robher> I think from the start we've been talking about the same thing.

20:45 <cwabbott> maybe?

20:48 <cwabbott> I hadn't heard about madvise() before, but I guess you could use it for MADV_WILLNEED to let userspace give its guesstimate for the size of the tiler heap

20:50 <cwabbott> I don't think the shrinker is really relevant here, since only the device itself knows when some memory isn't going to be used

20:50 <robher> cwabbott: Backing up. So currently, you can allocate a BO. You give it a size. It is an internal (to the kernel) implementation detail that we pin all of the memory at the start. Item 6 is to stop doing that and pin pages on faults. Then you can allocate 50G and the memory usage is whatever you touch.

20:50 <cwabbott> robher: right... although for most things you won't want/need that

20:51 <cwabbott> I don't even know if they support restarting after page faults for anything but the tiler

20:51 <robher> Now, maybe we'll want to hint to the kernel whether to pin all the pages or not.

20:52 <robher> Most drivers don't pin pages up front.

20:54 <cwabbott> do they? I was under the impression that restarting after a page fault is a relatively new HW feature that most don't support yet

20:54 <robher> They probably pin them on submit.

20:55 <cwabbott> yeah, right

20:55 <cwabbott> I guess that's kind of a separate question

20:55 <robher> If you don't have swap, then it doesn't help to not pin pages.

20:56 <cwabbott> right...

20:56 <cwabbott> and the cpu overhead for keeping track of everything kinda sucks

20:59 <cwabbott> I guess all I wanted to say is that the "only allocate on page fault" behavior is definitely needed for the tiler heap

21:15 <narmstrong> robher: sorry by your edid looks severely broken... and it seems the dw-hdmi i2c code doesn’t handle nacks nor timeouts :-/

21:16 stikonas has quit [Remote host closed the connection]

21:17 <robher> narmstrong: it wasn't broken in 5.0...

21:17 <robher> but is a no name panel...

21:17 <narmstrong> robher: yep because we didn’t handle scdc, broken edid is a complex issue

21:18 <robher> there aren't exactly any small (11") brand name panels.

21:18 <narmstrong> Adding a quirk is the only acceptable solution

21:19 <narmstrong> I’ll propose a solution on the list to trigger a discution on the issue

21:21 <narmstrong> robher: can you run an i2cdetect on the i2c used by the hdmi link ? To confirm the scdc slave address is really not present

21:23 <robher> narmstrong: https://www.irccloud.com/pastebin/sEJvidWS/

21:24 <narmstrong> Damn the scdc slave address is present, this is really weird

21:25 <narmstrong> 0x54

21:26 <narmstrong> Thanks for checking, I’ll try to propose a fix or quirk tomorrow

21:26 <robher> narmstrong: thanks for digging into it.

21:27 <alyssa_> robher: List + the stuff mentioned in here is pretty accurate, I think

21:28 <alyssa_> To recap the GROW_ON_GPF stuff:

21:28 <alyssa_> Most buffers we allocate as normal BOs. It's an implementation detail when that gets pinned. Same as any other driver.

21:29 <alyssa_> A few "special" buffers are explicitly allocated as unpinned (except for the first N pages). That will expand _while the job is running_, in the middle, in response to a page fault. This can cause a stall, so it's used infrequently, but for GPu internal structures, it's needed.

21:33 Elpaulo has joined #panfrost

21:34 <robher> alyssa_: when do we unpin pages?

21:35 <alyssa_> robher: For normal buffers, when userspace calls free or whatever

21:35 <alyssa_> For special buffers, we don't.

22:26 <robher> alyssa_: so when we OOM, we do what? There's little point in pinning on demand if we never unpin.

22:27 * robher afk

22:27 <alyssa_> robher: No, there is a point, since most of the time, only a small fraction of the buffer will actually be accessed by the GPU (=pinned)

22:27 <alyssa_> We do a worst case allocation of 128MB, for example...

22:27 <alyssa_> If we're just running es2gears, maybe 1MB will ever get pinned (and then freed when the process dies)

22:28 <alyssa_> The other 127MB is reserved virtual memory but not backed by any physical pages

22:29 <alyssa_> If we're running STK, maybe 64MB will end up in use due to a really geometry heavy scene (and freed when we quit the game). Well, okay, the other 64MB is still free physically

22:29 <alyssa_> Memory usage is lowered from (worst_case * number of apps) to (sum{apps} (worst_case_for_given_app))

22:38 stikonas has joined #panfrost

22:40 stikonas has quit [Remote host closed the connection]

23:36 <alyssa_> Fun and games with AFBC!

23:38 <alyssa_> The goal is to have AFBC surfaces all the way down

23:38 <alyssa_> Quite a bit of work for that, but I'm already neck deep in it so

23:39 <alyssa_> First step is importing/exporting AFBC BOs

23:41 <alyssa_> I don't think this should be terribly hard..?