#lima on 2019-09-17 — irc logs at freenode.irclog.whitequark.org

2019-07-03 10:24 ChanServ changed the topic of #lima to: Development channel for open source lima driver for ARM Mali4** GPUs - Kernel has landed in mainline, userspace driver is part of mesa - Logs at https://people.freedesktop.org/~cbrill/dri-log/index.php?channel=lima and https://freenode.irclog.whitequark.org/lima - Contact ARM for binary driver support!

00:11 jrmuizel has joined #lima

00:56 nerdboy has joined #lima

00:57 * nerdboy checks off another chromebook soc for panfrost/drm functionality

00:59 <anarsoul> wrong channel? :)

00:59 <nerdboy> depends on model/gpu?

01:00 <nerdboy> even snow has up-scaled gpu compared to pine64

01:01 * nerdboy still figuring out which hardware has what kind of "frost"

01:02 <nerdboy> also break my habit of saying "lima" for mali hardware

01:14 <libv> if only had not tried to stop things by mentioning the word "trademark" after they had seen my first proposal.

01:14 <libv> if only ARM, even

01:31 jrmuizel has quit [Remote host closed the connection]

01:34 camus has joined #lima

01:38 kaspter has quit [Ping timeout: 276 seconds]

01:38 camus is now known as kaspter

01:43 jrmuizel has joined #lima

01:52 kaspter has quit [Ping timeout: 265 seconds]

01:58 yuq825 has joined #lima

02:02 kaspter has joined #lima

02:19 romainmahoux[m] has quit [Read error: Connection reset by peer]

02:19 Danct12 has quit [Write error: Connection reset by peer]

02:19 z3ntu has quit [Read error: Connection reset by peer]

02:19 bshah|matrix has quit [Read error: Connection reset by peer]

02:21 dllud has quit [Ping timeout: 245 seconds]

02:27 romainmahoux[m] has joined #lima

02:29 yuq825 has quit [Remote host closed the connection]

02:29 yuq825 has joined #lima

02:30 dllud has joined #lima

02:41 <MoeIcenowy> anarsoul: maybe there's still rendering issues

02:41 <MoeIcenowy> on X11 some parts of gtk3-demo cannot be rendered correctly

02:42 <MoeIcenowy> (or maybe it's because X11 is spamming us?

02:42 <anarsoul> likely there's still some issues :)

02:42 <anarsoul> we're not passing neither piglit nor deqp completely

02:43 <MoeIcenowy> I am considering to deploy a Pine A64-LTS as piglit playground

02:43 <anarsoul> MoeIcenowy: you made another typo

02:44 <anarsoul> "reset scissor state *if* scissor test is disabled"

02:44 <anarsoul> also add my and Qiang's r-b tags

02:45 <MoeIcenowy> oooooooops

02:50 gtucker has quit [*.net *.split]

03:00 bshah|matrix has joined #lima

03:00 z3ntu has joined #lima

03:00 Danct12 has joined #lima

03:01 <anarsoul> MoeIcenowy: I think it's probably better to set up deqp

03:01 * anarsoul is going to try it this weekend

03:04 nerdboy has quit [Ping timeout: 240 seconds]

03:05 nerdboy has joined #lima

03:12 kaspter has quit [Ping timeout: 240 seconds]

03:23 kaspter has joined #lima

03:24 gtucker has joined #lima

03:28 dllud has quit [Ping timeout: 245 seconds]

03:34 dllud has joined #lima

03:45 nerdboy has quit [Ping timeout: 264 seconds]

04:16 <anarsoul> yuq825: enunes: rellla: any comments on https://gitlab.freedesktop.org/mesa/mesa/merge_requests/1903 ?

04:16 <anarsoul> yuq825: also what's left for https://gitlab.freedesktop.org/mesa/mesa/merge_requests/1763 ? Why it's still in WIP?

04:52 <yuq825> for 1763, still need some test on weaker CPU like H3

04:53 <yuq825> that's for find a balanced threshold to re-generate PP stream

04:57 <anarsoul> isn't it faster than A64?

05:05 Barada has joined #lima

05:19 megi has quit [Ping timeout: 276 seconds]

05:36 Barada has quit [Quit: Barada]

05:40 Barada has joined #lima

05:41 <yuq825> I tested 1763 on amlogic s905x, haven't done on H3

05:43 <anarsoul> I see

05:44 <yuq825> I did test an early version of 1763 on H3, and see some flicker, I guess due to the regenerate pp stream threshold, but recently there's some fix into mesa, so need to test on H3/A64 with 1763 again

05:53 <anarsoul|c> Flickering can be due missing fence

05:55 <anarsoul|c> So if we do the same for scissors it should greatly improve performance of X11

05:56 <anarsoul|c> Currently it suffers due to FB reload, I tried to disable it just to check if it fixes cursor lag - and it does. But unfortunately it makes everything but cursor black.

05:57 <anarsoul|c> Well, everything but updated parts of screen

06:06 Barada has quit [Quit: Barada]

06:07 <yuq825> yes, reload with small scissor like cursor could be optimized like this

06:09 Barada has joined #lima

06:10 dddddd has quit [Remote host closed the connection]

06:10 Barada has quit [Client Quit]

06:12 Barada has joined #lima

06:31 Barada has quit [Quit: Barada]

06:57 tlwoerner has quit [Ping timeout: 268 seconds]

07:01 tlwoerner has joined #lima

07:25 Barada has joined #lima

07:43 <narmstrong> anarsoul: tomeu is adding CI for panfrost, I'll add the necessary to run lima aswell

07:55 mardestan has joined #lima

08:32 <mardestan> does someone know how many vector registers mali 400mp gpu has?

08:42 <mardestan> it seems 6 for normal and 6 for pipelined so 12 alltogether.

08:58 Barada has quit [Quit: Barada]

08:58 <mardestan> some amount of sm3.0 hardware temporaries

09:02 <mardestan> libv: sorry I will leave soon, as everything has been said allready, however i am thinking how can utgard deal with dependencies/hazards?

09:25 <mardestan> seems like hardware handles them, but very confused about the number of registers 6*(128/4) does not really equal sm3.0 spec

09:32 <mardestan> oops 6*(128/4/4) i meant, it comes up as 48 but should be 32

09:46 <MoeIcenowy> yuq825: I think you have some A64?

09:47 <MoeIcenowy> just set A64 to the lowest frequency

09:51 <mardestan> I am fed up anyways about talking to myself, i think i do not want to help you retarded guys anymore.

09:51 <mardestan> i think it is bye by me. You are fucking stupid idiots imo.

09:51 mardestan has quit [Quit: Leaving]

10:22 Barada has joined #lima

10:32 yuq825 has quit [Quit: Leaving.]

11:05 mrueg has quit [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.]

11:09 mrueg has joined #lima

11:25 tlwoerner has quit [Ping timeout: 265 seconds]

11:46 megi has joined #lima

12:15 dddddd has joined #lima

12:59 jrmuizel has joined #lima

13:05 jrmuizel has quit [Remote host closed the connection]

13:08 adjtm has quit [Ping timeout: 276 seconds]

13:11 jrmuizel has joined #lima

13:16 jrmuizel has quit [Ping timeout: 268 seconds]

13:32 tlwoerner has joined #lima

13:52 <MoeIcenowy> anarsoul: the next blocker to use lima on desktop seems to be `gp error irq state=400000` ?

13:54 jrmuizel has joined #lima

14:02 Barada has quit [Quit: Barada]

14:07 adjtm has joined #lima

14:57 <anarsoul> MoeIcenowy: yes

15:14 <MoeIcenowy> BTW, does ARM driver has interface for userspace to know kernel-space error info?

15:16 <anarsoul> no idea

15:19 <Tofe> I wonder, for the scissors for example, the driver always talks in fb coordinates ? it doesn't depend on the application's window buffer or something like that?

15:22 <anarsoul> Tofe: fb is actually a surface that it renders into

15:22 <anarsoul> it's not /dev/fb*

15:22 <Tofe> ah ok, I misunderstood the code, good to know

15:28 * bshah closes #101 issue finally \o/

15:41 <rellla> anarsoul: is https://gitlab.freedesktop.org/lima/mali-syscall-tracker still the tool to get a dump from the blob?

15:41 <anarsoul> yeah

15:42 <anarsoul> and it's not very convenient, you'll have to parse dumps manually

15:42 <anarsoul> i.e. good enough for dumping texture descriptor

15:42 <anarsoul> but not so good for analyzing command stream

15:43 <rellla> i just want to take a quick look at the outstanding texture things...

15:43 <anarsoul> ah, OK

15:43 <anarsoul> that would be cubemap

15:44 <anarsoul> I have the dump if you need it :)

15:44 <rellla> is there any info to rect anywhere already?

15:44 <anarsoul> rect is not supported in GLES

15:44 <anarsoul> you'll have to emulate it

15:44 <rellla> ah

15:45 <rellla> that means treat it as 2d with no mipmap?

15:46 <anarsoul> it also has coordinates in texels and not in [0,1]

15:46 <rellla> https://elixir.bootlin.com/mesa/mesa-19.2.0-rc1/source/src/gallium/docs/source/resources.rst#L100

15:47 <rellla> yes

15:48 <anarsoul> also we can't support rect textures larger than 1024x1024

15:48 <anarsoul> we can't sample them accurately enough

15:49 <anarsoul> btw, we don't support GL 3.1, can we just disable GL_ARB_texture_rectangle extension?

15:49 <rellla> first i have to find out, where the necessary bits have to be added ...

15:50 <anarsoul> check git log for panfrost

15:50 <anarsoul> they had it few months back till they figured out how to do that in hw

15:50 <rellla> ok, emulating rect you mean?

16:02 <MoeIcenowy> anarsoul: we cant

16:02 <MoeIcenowy> cogl/clutter seems to expect it

16:45 <anarsoul> MoeIcenowy: fix cogl/clutter?

16:45 <anarsoul> we can't do it in hardware

16:45 <anarsoul> emulating it requires manipulation with texture coordinates

16:46 <anarsoul> and it automatically brings down precision to fp16

16:46 armessia has joined #lima

16:46 <armessia> Hi guys

16:46 <armessia> I've been looking into adding cubemap support the past few days

16:47 <armessia> Have some basic example working

16:47 <armessia> Will push my branch later today so you can have a look

16:51 <anarsoul> armessia: nice!

16:52 <MoeIcenowy> anarsoul: I still think we should emulate it

16:52 <anarsoul> MoeIcenowy: it's useless

16:53 <anarsoul> if it tries to sample from texture that is larger than 1024 in any dimension rendering will be incorrect

16:53 <MoeIcenowy> To be honest, my reason to use mainline driver is to prevent application changing

16:53 <anarsoul> MoeIcenowy: it's hardware limitation

16:54 <armessia> anarsoul: tnx :-)

16:54 <MoeIcenowy> but if the application do not go beyond 1024

16:54 <MoeIcenowy> the emulation is still useful

16:54 <anarsoul> yeah, and we'll get reports that lima misrenders something

16:54 <armessia> The case where the coordinates are coming from register won't work yet, but if the come straight from varying it does/should

16:55 <anarsoul> rect textures aren't supported in either gles2 nor gl2

16:55 <anarsoul> it's extension for gl2

16:55 <anarsoul> hardware doesn't support rect textures

16:55 <anarsoul> => driver shouldn't advertise support for this extension

16:58 <anarsoul> armessia: it shouldn't be difficult to fix

16:59 <armessia> anarsoul: I think so too, will have to have a further look though

17:07 nerdboy has joined #lima

17:08 nerdboy has quit [Changing host]

17:08 nerdboy has joined #lima

17:22 <Tofe> So, the reason of my crash is that the driver has a "const" op to execute, but that kind of op doesn't have any slot

17:22 <anarsoul> that's surprising

17:22 <anarsoul> can yo share the shader?

17:22 <Tofe> I'm... not even sure which one is it

17:22 <anarsoul> :D

17:23 <anarsoul> then record apitrace please

17:23 <Tofe> I guess QML has some shaders it uses for some predefined blending or so

17:23 <bshah|matrix> Tofe: dump all the shaders? And then grep? :D

17:23 <anarsoul> bshah|matrix: grep for what?

17:23 <bshah|matrix> There's some env var for it

17:24 <anarsoul> you don't know which one results in too many constants in instruction

17:24 <MoeIcenowy> Tofe: how do you know this reason?

17:24 <Tofe> MoeIcenowy: I have the full crash stack

17:24 <MoeIcenowy> Tofe: just use LIMA_DEBUG=gp, and put all the spells to a pastebin

17:24 <bshah|matrix> Ah wait, const is not coming from shader... Sorry for noise

17:25 <bshah|matrix> Tofe: do you know what In qml/qt triggers that?

17:25 <anarsoul> MoeIcenowy: I suspect it's gonna be fragment shader

17:25 <Tofe> bshah: if only... it's our phone qml app, but it's a big qml app, so I wouldn't be able to say what exactly. It crashes when we show the window initally

17:26 <anarsoul> Tofe: apitrace please

17:26 <Tofe> yes yes, one moment, remember I never used that one so far :p

17:31 <Tofe> damn, there's no recipe for OpenEmbedded yet it seems

17:33 <anarsoul> :(

17:35 <Tofe> well, I'll create one, I just hope it'll go smoothly

17:57 <Tofe> oh, great, it bundles its own khronos headers, and they make the assumption that unix==X11

17:59 <anarsoul> MoeIcenowy: I think lima won't be able to run apps that use cogl/clutter unless it switches to using GL_TEXTURE_2D

18:00 <MoeIcenowy> anarsoul: why

18:00 <anarsoul> it's not supported by hardware and fp16 precision is not enough to emulate it properly

18:01 <anarsoul> MoeIcenowy: Utgard PP support only fp16, that's only 10 bits precision for mantissa

18:01 adjtm has quit [Ping timeout: 276 seconds]

18:02 <anarsoul> it has special path to use varyings directly as coordinates for sampler instruction, but coordinates have to be normalized, i.e. in 0..1 range

18:02 <anarsoul> this special path has fp32 precision

18:02 <anarsoul> but once you load varying into register and use register as coordinates for sampler instruction your precision drops to fp16

18:03 <anarsoul> and thus only 10 bits

18:03 <anarsoul> it results in unreadable text if you use textures for UI

18:33 <MoeIcenowy> anarsoul: oops deqp doesn't do well when surfacelsss

18:33 <MoeIcenowy> less *

18:33 <MoeIcenowy> so it's needed to setup some display and use x11/wayland

18:33 <anarsoul> MoeIcenowy: use wayland

18:33 <anarsoul> weston should work fine

18:33 <MoeIcenowy> the problem is to set up a display ;-)

18:34 <anarsoul> I believe you can run headless weston

18:34 <MoeIcenowy> currently I hooked an unpowered LCD screen to Pine A64-LTS

18:34 <MoeIcenowy> anarsoul: but I doubt whether lima will work on headless weston

18:34 <anarsoul> why not?

18:35 <MoeIcenowy> here on my PC es2gears_wayland totally fail on headless weston

18:37 <anarsoul> hm

18:37 <MoeIcenowy> and I think it reasonable to doubt whether acceleration will work on headless "display"

18:37 <anarsoul> well, I'm not sure how they do it in gitlab CI

18:38 <MoeIcenowy> because it needs the underneath display to allocate buffers

18:39 <MoeIcenowy> oh on my PC GLES programs inside headless weston totally doesn't touch GPU

18:39 <MoeIcenowy> it uses llvmpipe

18:39 <anarsoul> well, looks like CI uses surfaceless

18:39 <anarsoul> what's the issue with surfaceless?

18:39 <MoeIcenowy> I totally cannot get it to run here

18:40 <MoeIcenowy> maybe some special parameters are needed?

18:40 <anarsoul> you need to enabled it in mesa as well

18:40 <anarsoul> "-D platforms=surfaceless,x11,wayland"

18:40 <anarsoul> s/enabled/enable

18:41 <MoeIcenowy> I didn't add -D platforms

18:41 <MoeIcenowy> auto should infer surfaceless

18:41 <anarsoul> and I don't think that it builds surfaceless by default

18:42 <anarsoul> yeah

18:42 <anarsoul> just checked it

18:42 <MoeIcenowy> I remember seeing it doing surfaceless...

18:42 <anarsoul> anyway, I'm at work now

18:42 <anarsoul> so I can't debug it

18:42 <anarsoul> panfrost uses it for CI so it should work

18:43 <MoeIcenowy> `EGL/Vulkan/VL platforms: x11 wayland drm surfaceless`

18:43 <anarsoul> see src/gallium/drivers/panfrost/ci/create-rootfs.sh

18:43 <MoeIcenowy> surfaceless is inferred

18:43 <anarsoul> do you have it enabled in deqp?

18:43 <MoeIcenowy> yes

18:43 <MoeIcenowy> deqp can only select one target once

18:43 <MoeIcenowy> but maybe my deqp is too old

18:44 <MoeIcenowy> I'm still fetching from googlesources

18:45 <anarsoul> maybe

19:14 <Tofe> At last ! I got an apitrace !

19:16 <Tofe> How do you usually share that kind of file?

19:18 <Tofe> well, I got it on my dropbox: https://www.dropbox.com/s/95ib0042mvbmeed/luna-qml-launcher.trace?dl=0

19:20 <Tofe> So now I see the various shaders involved, but I'm not sure what to look for, that could have ended with this "const" op

19:25 <Tofe> Damn, this kind of trace is very interesting

19:31 adjtm has joined #lima

19:34 <Tofe> The last used fragment shader is https://paste.ubuntu.com/p/44ZrqSnjFk/ ... it looks quite straightforward...

19:35 kaspter has quit [Ping timeout: 240 seconds]

19:37 <Tofe> There is also a vertex shader, which does some math: https://paste.ubuntu.com/p/vPX379MDM5/

20:04 nerdboy has quit [Ping timeout: 258 seconds]

20:18 <anarsoul> that's not the one that fails

20:19 <anarsoul> Tofe: open an issue and attach the file to it

20:19 <anarsoul> https://gitlab.freedesktop.org/lima/mesa/issues

20:21 <Tofe> ok, will do

20:33 <Tofe> voilà https://gitlab.freedesktop.org/lima/mesa/issues/121

20:33 <anarsoul> Tofe: it looks like it's gpir compiler issue

20:33 <anarsoul> could you compile mesa with debug enabled so you keep all the asserts, just to see where it fails?

20:35 <Tofe> yes, I'll prepare that

20:35 <anarsoul> thanks!

20:37 <anarsoul> rellla: you probably want to mark https://gitlab.freedesktop.org/mesa/mesa/merge_requests/1985 as 'Tested-by' by you :)

20:46 <rellla> yeah, right.

20:48 <anarsoul> I'd also appreciate if you review https://gitlab.freedesktop.org/mesa/mesa/merge_requests/1903

20:52 <rellla> i built that one, but didn't look deeper in the code yet. just let piglit run, but thats probably not the place where we should see the benefits?!

20:54 <rellla> anarsoul: i will try to have a look at it tomorrow

20:54 <anarsoul> thanks

20:55 <anarsoul> rellla: it should get rid of OOMs and page alloc failures that we see intermittently

20:56 <rellla> http://imkreisrum.de/piglit/mali450/c5010e7..12fe44a-lima-bo-cache/

20:58 <rellla> hm, maybe i should check that a second time... not that sth went wrong during built. i did many tests yesterday.

21:08 <anarsoul> I just pushed rebased version with fixes for few issues I noticed

21:09 <anarsoul> rellla: note that you need this kernel patch to use BO cache: https://patchwork.kernel.org/patch/11136781/

21:09 <anarsoul> otherwise it'll OOM badly for you

21:19 <rellla> should be applied...

21:43 <armessia> Just pushed my cubemap branch: https://gitlab.freedesktop.org/arnomessiaen/mesa/tree/lima-cubemaps

21:44 <armessia> It's also working now when getting the coordinates from a register, at least in my test shader

21:45 <anarsoul> armessia: I'd suggest running it through piglit

21:45 <armessia> Had to create a new ppir_op for handling the register case, don't know if it's the good way of doing it.

21:46 <armessia> anarsoul: will try to get piglit up and running, haven't succeeded yet

21:46 <armessia> Trying to build it from buildroot might not have been the best idea :-)

21:47 <anarsoul> probably it's not

21:47 <armessia> What backend do you guys run piglit with? GBM?

21:47 <anarsoul> yep

21:47 <armessia> alright

21:47 <armessia> You run armbian?

21:49 <anarsoul> no, I run archlinux

21:49 <anarsoul> armessia: I think you shouldn't introduce new op for loading cube coords

21:50 <anarsoul> just check number of components, it's gonna be 2 for 2d textures, 3 for cubemap

21:50 <anarsoul> (since we don't support 3d textures)

21:51 <armessia> Had the same feeling

21:51 <armessia> But in the register case num_components is 0 in ppir_codegen_encode_varying?

21:51 <armessia> Or have I missed something?

21:54 <anarsoul> I believe for register case register itself has num_components set

21:54 <anarsoul> you can check it

21:57 <armessia> Tnx for the hint, will look into it tomorrow

21:58 <armessia> Will also try to get piglit up and running on arch

22:10 <anarsoul> what board are you using?

22:11 <armessia> Orangepi one (H3)

22:11 <armessia> Also have an orangepi PC2 (H5) lying around, but haven't tested lima on it yet

22:13 <anarsoul> if you're going to use native compilation I'd suggest to use faster board

22:19 <armessia> yeah native compiling will be an issue I guess, slow to say the least

22:20 <armessia> I'll look into setting something up to cross-compile, but haven't done that yet outside of buildroot or kernel compilations.

22:21 <anarsoul> it's OKish with ccache on A64

22:23 jrmuizel has quit [Remote host closed the connection]

22:27 armessia has quit [Quit: Leaving]