#lima on 2019-08-19 — irc logs at freenode.irclog.whitequark.org

2019-07-03 10:24 ChanServ changed the topic of #lima to: Development channel for open source lima driver for ARM Mali4** GPUs - Kernel has landed in mainline, userspace driver is part of mesa - Logs at https://people.freedesktop.org/~cbrill/dri-log/index.php?channel=lima and https://freenode.irclog.whitequark.org/lima - Contact ARM for binary driver support!

00:09 <anarsoul|c> Yeah, but t let's leave it for later

00:15 jrmuizel has quit [Remote host closed the connection]

01:17 yuq825 has joined #lima

02:46 dddddd has quit [Remote host closed the connection]

03:59 forkbomb has quit [Remote host closed the connection]

03:59 forkbomb has joined #lima

04:56 Barada has joined #lima

05:36 <anarsoul> enunes: we can do something similar to what I did to constants, i.e. clone uniforms on every usage

05:36 <anarsoul> but we can leave it for later

05:37 <anarsoul> let's land control flow support first, I'd really like to avoid introducing any fancy lowerings since they likely will break control flow if it's not here yet

05:44 Elpaulo has quit [Read error: Connection reset by peer]

05:45 Elpaulo has joined #lima

06:32 Barada has quit [Quit: Barada]

06:42 Barada has joined #lima

07:24 sphalerite_ has joined #lima

08:10 sphalerite has quit [Quit: WeeChat 2.4]

08:10 sphalerite_ is now known as sphalerite

09:08 _whitelogger has joined #lima

09:45 <rellla> anarsoul: enunes: i finally got my H5 and H3 set up. so i have now setups for all different Mali4** running: A10 -Mali400, H3 -Mali400-MP2, H5 -Mali450 ...

09:46 <rellla> so if you like me to do tests on different platforms, please ping me

09:47 <rellla> ... now starting a piglit run on current master on all 3 ...

10:11 yuq825 has quit [Remote host closed the connection]

10:22 cwabbott has quit [Quit: cwabbott]

10:25 cwabbott has joined #lima

10:34 dddddd has joined #lima

10:37 <rellla> can i run piglit headless?

10:55 adjtm_ has quit [Ping timeout: 248 seconds]

11:08 <enunes> rellla: hmm yeah with gbm backend

11:08 <enunes> I run it on a board that doesn't have any display output connector

11:10 <rellla> enunes: right, now it runs. we have some right issues with default /dev/dri/* devices.

11:11 <enunes> rellla: hmm yeah I have a local modprobe.conf file with dependencies to load lima after sun4i-drm

11:13 <rellla> do i need to set some other rights somewhere, as sudo piglit.sh runs fine, whereas running it as normal users fails all tests?

11:15 <enunes> I run it as root on the test systems, but maybe you need to add your user to groups 'video' or 'render' or something like that, based on groups in /dev/dri

11:15 <rellla> card0 is root:video, renderD128 is root:render, user is member of video and render...

11:15 <rellla> strange ...

11:17 megi has joined #lima

11:25 <rellla> enunes: doing a re-login after adding the user to the groups does the trick :p

11:31 ecloud has quit [Ping timeout: 245 seconds]

11:33 ecloud has joined #lima

12:56 yuq825 has joined #lima

13:16 jrmuizel has joined #lima

13:18 jrmuizel has quit [Remote host closed the connection]

13:24 yuq825 has quit [Remote host closed the connection]

13:25 ninolein has joined #lima

13:26 yuq825 has joined #lima

13:48 Barada has quit [Quit: Barada]

13:49 jrmuizel has joined #lima

13:54 jrmuizel has quit [Ping timeout: 272 seconds]

14:09 forkbomb has quit [Quit: In the beginning the Universe was created. This has made a lot of people very angry and been widely regarded as a bad move.]

14:10 forkbomb has joined #lima

14:30 jrmuizel has joined #lima

14:37 jrmuizel has quit [Ping timeout: 244 seconds]

14:40 <rellla> heyo, i've done some new piglit runs on master and with anarsoul's cf branch, see here: http://imkreisrum.de/piglit/

14:42 <rellla> i gave them a go on 400 and 450, only regression on 450 from dmesg-fail to fail, which shouldn't be related to cf

14:43 <rellla> s/only/only one/

14:49 adjtm has joined #lima

14:50 <rellla> seems there isn't much left to do on the ppir side except what was mentioned above. nir_ssa_undef_instr is also missing - i'll have a look at that

14:51 <enunes> rellla: dmesg-fail might happen without https://gitlab.freedesktop.org/mesa/mesa/merge_requests/1638

14:54 <enunes> there might be unstable tests without it due to https://gitlab.freedesktop.org/lima/mesa/issues/95

14:54 <enunes> but the patchset needs another respin

14:55 <rellla> sure. i also encounter issues, where the default piglit accurancy of 0.01 makes some tests fail

14:57 <enunes> well at least they should be consistently failing or passing

14:58 <enunes> comparing 400 and 450 on that should be interesting result

14:58 <cwabbott> rellla: the accuracy thing is expected, it's because pp uses half-floats but desktop GL assumes that you have full 32-bit floats, exposing legacy GL support is kinda a hack in the first place

14:59 <cwabbott> *desktop GL support

14:59 <cwabbott> there's a way to only advertise support for mediump (half-precision) in GLES which the blob uses, but it's probably not hooked up in mesa

15:00 jrmuizel has joined #lima

15:00 <rellla> cwabbott, would it be an option, to temporarily hack the accurancy to 0.02 or even 0.03 in piglit to pass the tests?

15:01 <cwabbott> that's not something that could be upstreamed in piglit

15:01 gcl has joined #lima

15:01 <rellla> not for upstream, just local. just for now.

15:02 jrmuizel has quit [Remote host closed the connection]

15:06 <cwabbott> I suppose, although maybe the better thing long-term would be to advertise only mediump support, and then only look at gles2 results

15:06 <cwabbott> there are always going to be a bunch of known failures with desktop gl

15:07 <rellla> ok.

15:10 <rellla> cwabbott: where is the best place to put a nir pass, that is shared between panfrost and lime? compiler/nir ?

15:10 <rellla> s/lime/lima/

15:11 <rellla> that one: https://gitlab.freedesktop.org/mesa/mesa/blob/master/src/gallium/drivers/panfrost/nir/nir_undef_to_zero.c

15:14 <cwabbott> no idea, although you shouldn't need that pass at all

15:14 <cwabbott> when going out of SSA, undef is just a register that is read and never written

15:15 <rellla> http://imkreisrum.de/piglit/mali450/fdd6151_control_flow/fdd6151_control_flow/shaders@glsl-array-bounds-01.html

15:15 jrmuizel has joined #lima

15:16 <cwabbott> again, just turn each undef into a register that is never written

15:16 <anarsoul> cwabbott: it's cheaper to turn it into const

15:17 <rellla> ok

15:17 <rellla> iirc, using the nir pass fixed it.

15:17 <cwabbott> anarsoul: I don't think so, that will take up an extra const

15:17 <anarsoul> consts can be inserted to any instruction and they require no regs

15:18 <cwabbott> but too many consts will split up the bundle

15:18 <cwabbott> I guess you'll have to mark the register as undef so that you won't make it interfere with anything

15:19 <anarsoul> cwabbott: that'll be more code for no benefit

15:22 <cwabbott> anarsoul: it is a slight benefit

15:24 jrmuizel has quit [Remote host closed the connection]

15:25 <cwabbott> if you do it after out-of-ssa, it should only matter for "bad" code like this test though

15:26 <cwabbott> but in i965 there was a significant benefit from making ssa_undef handling even more relaxed, so this "bad" code is unfortunately quite common

15:40 <anarsoul> cwabbott: fair enough

15:48 yuq825 has quit [Remote host closed the connection]

16:02 jrmuizel has joined #lima

16:10 jrmuizel has quit [Ping timeout: 272 seconds]

18:40 jrmuizel has joined #lima

18:45 jrmuizel has quit [Ping timeout: 268 seconds]

18:55 <anarsoul> enunes: I split pp cf commit into several smaller commits, so it's easier to review now

19:16 jrmuizel has joined #lima

19:18 jrmuizel has quit [Remote host closed the connection]

19:27 <anarsoul> rellla: we also need support for different sampler types

19:35 <rellla> anarsoul: yes, as i meant - except what you and enunes already discussed yesterday...

19:35 <anarsoul> oh, OK

19:35 <anarsoul> (no one is working on sampler types btw)

19:35 <anarsoul> it'd be really nice to get support for samplerCube

19:36 <anarsoul> but it requires some RE of command stream since we don't know descriptor format for cube textures

19:37 <anarsoul> (probably it's similar to 2D with few flags set here and there)

19:38 <rellla> i have done that quick hack https://gitlab.freedesktop.org/rellla/mesa/commits/nir_ssa_undef_instr to support undef with a lowering to const like others do it...

19:38 <anarsoul> rellla: I think cwabbott is right and we just need to make it a register that doesn't conflict with anything else

19:38 <rellla> needs refactoring to share the code, but i'm not sure anymore if i should think about the reg solution

19:39 <anarsoul> it shouldn't be difficult, just introduce new reg flag (undef) and if it's set never mark this reg as interfering with other regs in regalloc

19:40 <anarsoul> then just create a dummy op for undef with ppir_dest that contains a reg that has this flag set

19:40 <anarsoul> I mean ppir_node with ppir_op_dummy

19:41 <rellla> hm, ok. and lowering to that reg happens during ppir lowering?

19:41 <anarsoul> don't add this node to node_list

19:41 <anarsoul> rellla: no lowering is necessary

19:43 <anarsoul> well, you may want to introduce a lowering pass that removes dummy nodes from nodes list (but don't free it - we need its ppir_dest)

19:44 <anarsoul> rellla: and please work on top of my branch :)

19:45 <rellla> no, not lower. just create the reg node within ppir_emit_ssa_undef?

19:46 <rellla> i will look into it. and yes, i'll take your branch :)

19:46 <anarsoul> rellla: yes, ppir_node_create_reg() with op = ppir_op_dummy

19:47 <rellla> i think, i'm able to do this...

19:47 <anarsoul> rellla: then add lowering pass that calls list_del(&node->list); for nodes with op = ppir_op_dummy

19:48 <anarsoul> rellla: also add undef flag to ppir_reg and set it in emit_ssa_undef

19:48 <anarsoul> rellla: and then set interference = false if this flag is set in ppir_regalloc_prog_try()

19:49 <rellla> ok, thanks. good mini-howto :p

19:49 <anarsoul> as result regalloc will pick any reg for it

19:49 <anarsoul> and it won't increase reg pressure nor use const nodes

19:50 <anarsoul> and result will be undef

19:50 <anarsoul> (however it doesn't mean that it won't be equal to 5.0, so test may fail)

19:50 <anarsoul> :D

19:54 <rellla> in practice, this shader shouldn't appear out there anyway, except we meet "bad" code!?

19:55 <anarsoul> rellla: yeah, undef shouldn't appear unless there's a bug in shader

19:56 <anarsoul> I can be wrong here though but I don't know any scenarios when it can appear otherwise

20:05 <rellla> what is this else for? is't the whole if-then-else obsolete atm? https://gitlab.freedesktop.org/anarsoul/mesa/blob/lima-zsbuf/src/gallium/drivers/lima/ir/pp/regalloc.c#L687

20:05 <rellla> sry, wrong branch.

20:06 <anarsoul> basically two regs interfere if their live ranges intersect

20:07 <anarsoul> IIRC I verified this code, feel free to double check it

20:08 <rellla> sry for the noise. my misreading.

20:09 <anarsoul> this 'else' hits if reg1->live_in == reg2->live_in

20:09 <rellla> got it

23:27 <anarsoul> enunes: I briefly looked through ideas-lamp-lit shader and looks like fusing condition into branch won't help it

23:28 <anarsoul> it uses something like "vec1 1 ssa_181 = feq ssa_178, ssa_180; if ssa_181 { ...}"

23:28 <anarsoul> well, maybe not the best example

23:29 <anarsoul> "vec1 1 ssa_46 = iand ssa_39, ssa_45; if ssa_46 { ... }"

23:29 <anarsoul> branch condition can be only a combination of less, equal, more