#lima on 2019-08-06 — irc logs at freenode.irclog.whitequark.org

2019-07-03 10:24 ChanServ changed the topic of #lima to: Development channel for open source lima driver for ARM Mali4** GPUs - Kernel has landed in mainline, userspace driver is part of mesa - Logs at https://people.freedesktop.org/~cbrill/dri-log/index.php?channel=lima and https://freenode.irclog.whitequark.org/lima - Contact ARM for binary driver support!

00:23 _whitelogger has joined #lima

00:25 Da_Coynul has quit [Quit: My MacBook Air has gone to sleep. ZZZzzz…]

00:31 Da_Coynul has joined #lima

01:02 yuq825 has joined #lima

01:10 embed-3d has quit [Ping timeout: 258 seconds]

01:40 Da_Coynul has quit [Quit: My MacBook Air has gone to sleep. ZZZzzz…]

02:36 jrmuizel has quit [Remote host closed the connection]

02:38 jrmuizel has joined #lima

02:49 jrmuizel has quit [Remote host closed the connection]

02:57 <wens> I wonder how different utgard is from midgard in terms of control flow stuff

02:58 <anarsoul> similar, but compilers are totally different :)

03:18 <anarsoul> perf doesn't work for me on pine64 :\

03:18 <anarsoul> it reports no events

03:30 <anarsoul> enunes: looks like pmu is broken in A64, so I have no profiler :(

03:40 dddddd has quit [Remote host closed the connection]

05:25 <bshah|matrix> anarsoul: got a chance to look at apitrace I had linked?

05:26 <anarsoul> bshah|matrix: I'm not sure where you linked it

05:26 <anarsoul> :)

05:26 <anarsoul> definitely not here

05:26 <bshah|matrix> Here? :P

05:26 <bshah|matrix> https://bshah.in/virt-keyboard-qmlscene.trace

05:27 <anarsoul> does replay work for you?

05:28 <bshah|matrix> Yes it does.. should have roughly 7 frames

05:29 <bshah|matrix> (it doesn't replay on Ubuntu machine for me due to apitrace version there being old)

05:29 <bshah|matrix> But works on arch

05:31 <anarsoul> 4 @0 eglGetPlatformDisplayEXT(platform = EGL_PLATFORM_WAYLAND_KHR, native_display = 0xffff8a3eaaa0, attrib_list = {}) = 0xaaaad28ba280

05:31 <anarsoul> 4: warning: unsupported eglGetPlatformDisplayEXT call

05:32 <bshah|matrix> Um weirdness.

05:32 <anarsoul> doesn't work on lima either

05:33 <anarsoul> error: unable to open display

05:33 <bshah|matrix> Is this Wayland session?

05:33 <bshah|matrix> (because I have used it on Wayland)

05:34 <anarsoul> it's not

05:34 <anarsoul> well, it doesn't work if I start it from sway either

05:34 <bshah|matrix> Hm quite weird

05:35 <bshah|matrix> I'll try capturing another one later today I guess.

05:35 Barada has joined #lima

07:00 <MoeIcenowy> enunes: got BootAnimation running on Lima after hacking the buffer allocation problem

07:05 <bshah> anarsoul: so I just tried on my end, again, and it seems I also get unsuported call bit, but in the end it replays fine..?

07:05 <bshah> "Rendered 24 frames in 0.817661 secs, average of 29.352 fps"

07:07 <bshah> in either case I uploaded fresh trace at : https://bshah.in/virt-keyboard-qmlscene.trace

07:07 <bshah> (same URL)

07:25 smaeul has quit [Ping timeout: 276 seconds]

07:44 megi has quit [Ping timeout: 246 seconds]

07:57 <MoeIcenowy> oh strange

07:58 <MoeIcenowy> the gbm buffer allocated w/ Lima cannot get correct modifier

07:58 <MoeIcenowy> thus it fails to re-import

08:20 megi has joined #lima

08:30 adjtm has quit [Ping timeout: 272 seconds]

09:09 adjtm has joined #lima

09:55 yuq825 has quit [Remote host closed the connection]

10:07 Da_Coynul has joined #lima

10:12 Da_Coynul has quit [Quit: My MacBook Air has gone to sleep. ZZZzzz…]

10:13 <MoeIcenowy> oh strange... the Gallium screen linked to the GBM instance seems to be softpipe

10:13 <MoeIcenowy> not lima

10:22 Da_Coynul has joined #lima

10:32 _whitelogger has joined #lima

10:38 Da_Coynul has quit [Quit: My MacBook Air has gone to sleep. ZZZzzz…]

10:39 monstr has joined #lima

10:53 dddddd has joined #lima

10:56 chewitt has quit [Quit: Adios!]

11:32 <rellla> what is the best way to share https://gitlab.freedesktop.org/mesa/mesa/blob/master/src/gallium/drivers/panfrost/nir/nir_undef_to_zero.c between lima and panfrost?

11:46 <rellla> and select lowering seems to be broken btw ...

11:47 <rellla> mov.s0 $0.x, sel.v1 $0 ^const0.xyxy ^const0.yxxy, const0 0.000000 1.000000 0.000000 0.000000

11:51 <mmind00> rellla: I'd guess sharing in https://gitlab.freedesktop.org/mesa/mesa/tree/master/src/panfrost/shared , similar to that tiling code that is already there

11:55 <rellla> mmind00: maybe we should move it to compiler/nir instead ...

11:57 afaerber has quit [Quit: Leaving]

11:57 monstr has quit [Quit: Leaving]

11:57 monstr has joined #lima

12:12 <enunes> rellla: yeah it's so generic that I would just propose it to be moved to compiler/nir instead

12:23 <rellla> enunes: ok, will prepare a patch. it solves the missing undef handling and fixes some more piglit tests http://imkreisrum.de/piglit/bc61253_sincos_fddxy_undef/

12:23 <rellla> ... the glsl-array-bounds-* ones..

12:23 <enunes> rellla: I've seen those, my vectorize patchset also indirectly fixed at least one of them

12:24 <enunes> but that pass seems like a good thing to have

12:24 <rellla> enunes: i'm not sure, if it's entirely right to make a zero const out of the undef ssa, but all other driver seem to do the same...

12:26 <enunes> yeah it would be nice to just not do anything rather than probably creating a mov to a field that is probably useless

12:26 <enunes> but it's not even that common and not sure how to solve it otherwise, so I am ok with assigning zero

12:27 <rellla> ok.

12:27 jrmuizel has joined #lima

12:27 <rellla> enunes: should i look into the write_mask/swizzle issue in lower_select (and probably others) ot is this expected to be moved out out ppir lowering?

12:29 <enunes> rellla: one thing I'm about to do in the same vectorize patchset is turn selects into scalars because apparently lima can't support the vec4 selects the way nir intends

12:30 <enunes> are there write_mask/swizzle issues in other op lowerings?

12:30 <enunes> or... potential issues

12:32 <rellla> sin/cos is eliminated with the nir MR

12:32 jrmuizel has quit [Remote host closed the connection]

12:33 jrmuizel has joined #lima

12:35 <rellla> and for the other potential candidates ... i haven't had a look :)

12:36 <rellla> "alu->dest.write_mask = u_bit_consecutive(0, num_components);" or "alu->dest.write_mask = 1;" always is suspicious :p

12:36 <enunes> always confuses me as wel

12:38 <rellla> i just stumbled across it in lower_sin and lower_select. the write_mask was effectively hardcoded to 0001, even if it should write to .y

12:40 jrmuizel has quit [Remote host closed the connection]

12:43 <rellla> this is a problem in piglit, where one of two consts (1.0, 0.0, 0.0, 1.0) and (0.0, 1.0, 0.0, 1.0) is probed.

12:43 yuq825 has joined #lima

12:43 <rellla> when sth is wacky with .xyzw we get wrong results. especially when we deal with a select...

12:47 Unit193 has quit [Read error: Connection reset by peer]

12:48 Unit193 has joined #lima

12:49 <enunes> rellla: this is what I'm currently testing before sending the MR for, it creates many more assignments to .y and .z but those still seem to work

12:49 <enunes> https://gitlab.freedesktop.org/enunes/mesa/commit/7741e81bd639fb80f64e503a658d5eaa986fbb4c

12:50 buzzmarshall has joined #lima

12:51 <rellla> enunes: for the lazy rellla... what does BITSET_SET macro do?

12:52 <enunes> we have a bitset called alu_lower which lists the ops that nir_lower_alu_to_scalar will convert to scalar, I'm telling it to also do that for select ops

12:52 <rellla> ah ok

12:53 <rellla> as it probably fixes some dmesg errors according to your comment, i should give it a quick piglit run :)

12:57 <enunes> vectorize affects many tests and optimizes a lot of them, this is the shaderdb report https://paste.fedoraproject.org/paste/LPNjTA9Osvrtn2P9dW62tQ/raw

12:58 <enunes> couldnt finish testing yesterday, hopefully today

12:59 <rellla> is there anywhere some doc how to create this shaderdb reports?

13:00 <enunes> this one with the test names is a bit of a hack, but if you run piglit one time with MESA_SHADER_CAPTURE_PATH to collect all shaders, you can just use them later with shaderdb itself

13:00 <enunes> after my MR on it gets merged

13:01 <enunes> just the shader names will be like 3-1212.shader_test 3-2203.shader_test etc

13:02 <enunes> https://gitlab.freedesktop.org/mesa/shader-db/blob/master/README.md

13:34 jrmuizel has joined #lima

13:43 Barada has quit [Quit: Barada]

13:52 jonkerj has quit [Quit: brb]

13:54 jonkerj has joined #lima

14:03 monstr has quit [Remote host closed the connection]

14:03 yuq825 has quit [Remote host closed the connection]

14:05 <anarsoul> enunes: I profiled mpv and looks like it spends 10% of time in _mesa_format_convert

14:05 <anarsoul> I think we need to add support for single-component textures

14:08 <enunes> anarsoul: how much work is that?

14:11 <enunes> anarsoul: I tried perf top on my pinebook with Fedora as well and I got nothing, but my perf version doesn't match my kernel so I wonder if that's a problem

14:14 <anarsoul> enunes: not a lot, I believe it's just a matter of adding corresponding enums

14:17 <rellla> enunes: http://imkreisrum.de/piglit/bc61253_sincos_fddxy_undef_fixsel/fixes.html with your vectorize opt ...

14:18 <rellla> fixes/regressions do still look weird within the last ppir instructions ...

14:18 <rellla> lower_select still is untouched

14:19 <enunes> rellla: I figured that when tests result in 0, 0, 0, 0 and that's an impossible result, it's just some random instability that happens for some reason

14:19 <enunes> manually running the test again passes

14:22 <anarsoul> rellla: you probably have to get used with select weirdness in assembler :)

14:23 <rellla> enunes: manual run passes ...

14:23 <rellla> anarsoul: yeah, probably :)

14:23 <anarsoul> however http://imkreisrum.de/piglit/bc61253_sincos_fddxy_undef_fixsel/bc61253_sincos_fddxy_undef_fixsel/spec@glsl-1.20@execution@uniform-initializer@fs-float-array.html doesn't look correct

14:24 <anarsoul> e.g. mov.s0 $1.x, sel.s1 $0.w ^const0.y ^const0.y, const0 0.000000 1.000000 0.000000 0.000000

14:24 <anarsoul> two "const.y" look fishy

14:26 <enunes> I waste so much time with those random failures, and even running the same task in a loop never reproduces it if I want

14:27 <anarsoul> enunes: running the same task in the loop won't help if you have a read from uninitialized register somewhere

14:27 <anarsoul> looks like Mali4x0 preserves reg values

14:27 <anarsoul> at least PP

14:27 <anarsoul> so if you read from $0 but never write into it you'll be reading a value that's left over of some old shader

14:28 <anarsoul> so if you want to reproduce failure run previous shader and shader that failed

14:28 <enunes> I tried that too, running full piglit many times, different tests failed at random runs

14:29 <anarsoul> btw, I found that https://gitlab.freedesktop.org/anarsoul/mesa/commit/af459a8fa905621a8da5cdc2ded863cc11c385c7 helps a lot to catch issues like uninitialized reg

14:29 <rellla> however, have to go now. this branch https://gitlab.freedesktop.org/rellla/mesa/commits/sincos_fddxy_undef_fixsel corresponds to the last piglit results. it contains the sin/cos nir commit, my fddxy commit, handling of undef ssa and the vectorize pass - based on bc61253

14:31 <enunes> anarsoul: can we use that and turn it into an assert or something so we can fix the unitialized reg to see if that resolves the random failures?

14:32 <enunes> or just print some greppable debug

14:32 <anarsoul> enunes: it's hard

14:33 <anarsoul> and I'm not sure if it's worth the efforts

14:33 <MoeIcenowy> strange...

14:33 <anarsoul> basically it happens when dest doesn't match src

14:33 <anarsoul> in ppir

14:33 <MoeIcenowy> EGL_ANDROID_native_fence_sync doesn't work

14:34 <MoeIcenowy> even EGL_KHR_fence_sync doesn't work

14:34 <enunes> anarsoul: yeah and then, it might also be something in gpir... gpir tests fail at random as well with trivial fragment shader

14:37 <anarsoul> :(

14:38 <MoeIcenowy> does anyone know how the out_sync of submit work?

14:46 <anarsoul> yes, it uses fences

14:46 <anarsoul> enunes: you should probably ask Connor to look into it

14:47 <anarsoul> I can read GP disassembly, but I barely understand the compiler

14:50 <enunes> anarsoul: well since examples with trivial vertex programs also fail, I fear it might be something different like kernel or command stream related

14:50 <enunes> I tried valgrind some time ago, enabled it for all shader_runner examples and run many times, some tests still randomly failed, but no valgrind diffs between the randomly failed and passed runs

14:54 <MoeIcenowy> anarsoul: how does fences work on Lima?

14:55 <MoeIcenowy> I think I got a total mess on fence on Lima when trying on Android

15:00 <anarsoul> MoeIcenowy: see kernel driver, it's not really lima-specific

15:00 <anarsoul> enunes: I'd check shader first though

15:01 <anarsoul> MoeIcenowy: btw, looks like I got rid of RCU stalls

15:02 <enunes> maybe disabling no-concurrency and running tests in parallel can reproduce it more easily

15:02 <enunes> I should try that

15:03 <enunes> with a limited set, and then hopefully be able to bisect it

15:05 <anarsoul> enunes: so maybe uninitialized reg read in GP?

15:05 <enunes> anarsoul: maybe, but I think it will also happen with the trivial passthrough vertex shader

16:05 jrmuizel has quit [Remote host closed the connection]

16:29 piggz_ has joined #lima

16:34 jrmuizel has joined #lima

16:35 Elpaulo has quit [Quit: Elpaulo]

16:41 <anarsoul> enunes: then we need a reproducer

16:44 piggz_ has quit [Quit: Konversation terminated!]

16:44 piggz_ has joined #lima

16:55 piggz_ has quit [Ping timeout: 258 seconds]

16:59 piggz_ has joined #lima

17:12 <anarsoul> enunes: you may want to fix this warning: https://gist.github.com/anarsoul/7d01ef090b666aa680cf3ed1e64e58fa

17:29 <enunes> anarsoul: I wonder why I didnt notice that, will do

17:45 Elpaulo has joined #lima

18:47 buzzmarshall has quit [Remote host closed the connection]

18:47 adjtm has quit [Ping timeout: 268 seconds]

20:42 <anarsoul> ouch, mipmapping code is definitely broken

20:43 <anarsoul> LIMA_MAX_MIP_LEVELS is 13 and it'll indeed try to attach all the levels in lima_texture_desc_set_res() if they're present

20:44 <anarsoul> the problem is that texture descriptor is 64 bytes and last 2 levels won't fit

21:10 jrmuizel has quit [Remote host closed the connection]

22:21 adjtm has joined #lima

22:41 Da_Coynul has joined #lima

22:56 Da_Coynul has quit [Quit: My MacBook Air has gone to sleep. ZZZzzz…]

22:56 jrmuizel has joined #lima

23:18 Da_Coynul has joined #lima

23:58 Da_Coynul has quit [Quit: My MacBook Air has gone to sleep. ZZZzzz…]