#panfrost on 2019-04-07 — irc logs at freenode.irclog.whitequark.org

2019-02-15 17:52 alyssa changed the topic of #panfrost to: Panfrost - FLOSS Mali Midgard & Bifrost - https://gitlab.freedesktop.org/panfrost - Logs https://freenode.irclog.whitequark.org/panfrost - <daniels> avoiding X is a huge feature

01:38 _whitelogger has joined #panfrost

01:54 tgall_foo has joined #panfrost

02:56 _whitelogger has joined #panfrost

03:20 _whitelogger has joined #panfrost

03:44 _whitelogger has joined #panfrost

04:15 mifritscher has quit [Ping timeout: 258 seconds]

04:17 mifritscher has joined #panfrost

04:26 _whitelogger has joined #panfrost

04:32 _whitelogger has joined #panfrost

04:57 <anarsoul> alyssa: hey

04:58 <anarsoul> alyssa: have you looked into introducing a CAP to indicate that driver doesn't need in-memory zsbuf when rendering for scanout?

05:36 <alyssa> anarsoul: Memory usage hasn't been high prio tbh

05:36 <alyssa> So it'd be nice, but no, no plans to do so

05:38 <anarsoul> alyssa: it's also a waste of memory bandwidth

07:56 _whitelogger has joined #panfrost

09:30 stikonas has joined #panfrost

09:40 stikonas has quit [Remote host closed the connection]

11:14 _whitelogger has joined #panfrost

11:15 raster has joined #panfrost

12:56 TheCycoTWO has quit [Ping timeout: 244 seconds]

13:07 TheCycoONE has joined #panfrost

13:31 stikonas has joined #panfrost

13:59 _whitelogger has joined #panfrost

14:27 <alyssa> anarsoul: No? You're not actually writing to it (you disable that CAP or no CAP), the issue is just the unused block.

14:27 <alyssa> So yes, it's a waste, but not for bw (at least not in 'frost)

14:49 raster has quit [Remote host closed the connection]

16:01 <HdkR> ack! Cat jumped on computer desk and spooked me

16:02 <HdkR> She's exploring the shelves in my storage room :D

16:02 <alyssa> Meow.

16:12 <anarsoul> alyssa: oh, so you're not setting it for scanout?

16:21 <alyssa> anarsoul: Correct

16:21 <alyssa> Well, I'm setting it but the hw doesn't touch it so it's 0 bw impact

16:29 <anarsoul> alyssa: got it. Now I'm doing the same and it gives me 2fps in 'shadow' scene (14 vs 16)

16:31 <alyssa> anarsoul: Nice :)

16:32 <alyssa> Probably the extra allocation isn't affecting perf much so

16:32 <anarsoul> well, it would be nice to fix it as well to save 8mb

16:33 <alyssa> Sure

16:57 Lyude has quit [Quit: WeeChat 2.2]

17:00 Lyude has joined #panfrost

17:05 <anarsoul> alyssa: do you know if there's a common nir lowering pass to lower fsin/fcos? Looks like GP on utgard can't do it

17:09 <alyssa> anarsoul: Lower to..?

17:10 <anarsoul> polynomial?

17:12 <alyssa> anarsoul: ....You need to lower it to a polynomial? Ouch.

17:12 <alyssa> I'm not aware of a pass for that, no

17:12 <alyssa> I guess it's not too hard to emulate yourself, go back to high school math ;P

17:13 <anarsoul> well, maybe not

17:14 <anarsoul> let me see what blob does

17:16 <alyssa> anarsoul: If you do need to lower to a polynomial, I mean, the Maclaurin series will be easy enough to implement via nir_builder

17:16 <alyssa> x - x^3/6 + x^5/120 - x^7/... or something

17:17 <alyssa> Although, even better, there's fancy games you can play to keep the multiplications down, I don't remember the name of the technique offhand

17:19 <alyssa> anarsoul: Wikipedia diving says the word I was looking for was "Horner's method"

17:19 <alyssa> Bear in mind I don't have any numerical analysis background so I'm probably talking uack

17:20 <anarsoul> alyssa: thanks, I'll try to poke output of offline compiler first

17:21 <alyssa> anarsoul: Probably fair -- there's a good chance you may have ops you don't know about yet

17:23 jolan has quit [Quit: leaving]

17:24 <cwabbott> anarsoul: iirc, that was just handled by a huge polynomial

17:24 <alyssa> cwabbott: Oh, hi!

17:24 jolan has joined #panfrost

17:25 <cwabbott> alyssa: hi!

17:26 <HdkR> HI!

17:26 * HdkR needs more tea

17:26 <anarsoul> cwabbott: thanks

17:29 <anarsoul> cwabbott: and there's no nir pass for that, is there?

17:29 <cwabbott> anarsoul: sadly, no

17:30 <alyssa> anarsoul: Have fun :P

17:30 <cwabbott> GP was the only thing crazy enough not to have dedicated sin/cos acceleration

17:30 <alyssa> Hey, I kinda think implementing that would be fun!

17:30 <anarsoul> alyssa: probably means that mesa doesn't support hardware with this level of sanity yet :)

17:30 <alyssa> But I'll let anarsoul have that pleasure :P

17:31 <anarsoul> cwabbott: but they have log and exp! :)

17:32 <cwabbott> yeah, crazy right :)

17:34 <anarsoul> looks like vc4 does something like that, but not with nir pass

17:34 <anarsoul> probably anholt had this reason to implement it like this

17:34 <cwabbott> well, that reason could've just been "no one else will need to do this" for all we know

17:36 <anarsoul> or it just was there before he converted vc4 to nir

17:48 <anarsoul> cwabbott: alyssa: do you know if there's input range for sin/cos in glsl? I.e. what will happen if I pass 4*PI to sin? Is it expected to return the same as sin(0)?

17:50 <cwabbott> anarsoul: yeah, there are some precision limitations but it should be around 0

17:51 <cwabbott> if you dump the blob's output, you'll see they do some range reduction before the polynomial

17:52 <anarsoul> cwabbott: I'm not used yet to mbs_dump output for vertex shader :)

17:53 <cwabbott> anarsoul: iirc there's a decompile option that will give you a much saner output

17:53 <cwabbott> trying to read raw GP assembly is... not fun

17:54 <anarsoul> ouch

17:56 <anarsoul> https://gist.github.com/anarsoul/23766501b354e8049d8195fcc1cbfeb9

17:58 <anarsoul> that's a bit longer than I expected

18:01 <cwabbott> it duplicates common subexpressions, so it can get a bit long

18:02 <anarsoul> you mean decompiler?

18:02 <cwabbott> yeah

18:03 <anarsoul> well, I'm pretty sure I can use anholt's vc4 code as a reference

18:42 <alyssa> cwabbott: *Resisting urge to write Midgard decompiler intensifies*

18:44 <anarsoul> decompiler? I thought that midgard assembly is sane enough

18:45 <alyssa> anarsoul: It definitely is, but that doesn't make decompiler authoring super enticing regardless ;P

20:16 <anarsoul> what's the difference between nir_op_flt and nir_op_slt?

20:17 <alyssa> Ask in dri-devel?

20:19 <anarsoul> OK, it compiles. Will see if it works in ~30mins (need to compile it on device now)

20:19 <alyssa> Ook

20:20 <alyssa> anarsoul: I love how #panfrost became #lima? :P

20:20 <anarsoul> :)

20:20 <anarsoul> there's no one else to ask on weekend

20:20 <alyssa> Ah, right, but I have no life so you can ask here, got it :)

20:21 <anarsoul> haha, do you imply I have no life either? :)

20:21 <alyssa> anarsoul: You're working on lima on a Sunday too ^_^

20:22 <anarsoul> it's raining here

20:22 <alyssa> Uh-huh ;)

20:22 <anarsoul> so I have an excuse :P

20:23 <alyssa> So what's if it's us, it's us and only us? And what came before, what count anymore or matter, can we thaaaat?

20:41 <anarsoul> darn, [jellyfish] <default>:gpir: unsupported nir_op: flog2

20:42 <alyssa> Ack!

21:18 stikonas has quit [Remote host closed the connection]

21:21 mifritscher has quit [Ping timeout: 252 seconds]

21:48 <anarsoul> OK, I'm a bit puzzled why nir_alu_type_get_type_size() returns 1 for fmul

21:49 <anarsoul> and as result for the very first fmul I get glmark2-es2-drm: ../src/compiler/nir/nir_builder.h:413: nir_build_alu: Assertion `src_bit_size == nir_alu_type_get_type_size(op_info->input_types[i])' failed.

21:56 <alyssa> anarsoul: More informatino please

21:57 <anarsoul> alyssa: https://gist.github.com/anarsoul/b03c1a1547a5e514a86cafd9a15e7efd

21:58 <anarsoul> first nir_fmul_imm() throws this assertion

21:58 <alyssa> anarsoul: Sample source NIR shader

21:58 <alyssa> ?

21:59 <anarsoul> how do I print it?

21:59 <alyssa> nir_print_shader or something

21:59 <alyssa> Probably lima has an env flag for it

21:59 <alyssa> (Panfrost has MESA_MIDGARD_DEBUG=shaders)

21:59 <alyssa> anarsoul: General nitpick... nir_builder can make a lot of this code easier I think

22:00 <anarsoul> but it already uses nir_builder

22:00 <alyssa> Wait, you use that stuff there

22:00 <alyssa> Nvm

22:04 <anarsoul> https://gist.github.com/anarsoul/c5fd109d1e2333fcec68c3237bf0b0ad

22:04 * alyssa eyes

22:05 <alyssa> This doesn't make sense :(

22:06 <anarsoul> interesting, it gets into lower_sin() twice

22:07 <anarsoul> well, there're 2 sins, so it's expected

22:09 <alyssa> Yeah

22:32 fysa has joined #panfrost

22:45 mifritscher has joined #panfrost