#lima on 2019-07-10 — irc logs at freenode.irclog.whitequark.org

2019-07-03 10:24 ChanServ changed the topic of #lima to: Development channel for open source lima driver for ARM Mali4** GPUs - Kernel has landed in mainline, userspace driver is part of mesa - Logs at https://people.freedesktop.org/~cbrill/dri-log/index.php?channel=lima and https://freenode.irclog.whitequark.org/lima - Contact ARM for binary driver support!

02:40 jrmuizel has quit [Remote host closed the connection]

02:59 _whitelogger has joined #lima

03:35 _whitelogger has joined #lima

03:49 dddddd has quit [Remote host closed the connection]

05:15 Barada has joined #lima

06:12 MoeIcenowy has quit [Ping timeout: 245 seconds]

06:14 chewitt has joined #lima

06:16 MoeIcenowy has joined #lima

06:27 guillaume_g has joined #lima

07:00 Wizzup has quit [Ping timeout: 246 seconds]

07:01 Wizzup has joined #lima

09:44 cwabbott has quit [Read error: Connection reset by peer]

09:45 cwabbott_ has joined #lima

10:36 MoeIcenowy has quit [Ping timeout: 252 seconds]

10:36 MoeIcenowy has joined #lima

11:00 Barada has quit [Quit: Barada]

11:04 Net147 has quit [Read error: Connection reset by peer]

11:06 Net147 has joined #lima

11:11 cwabbott_ has quit [Read error: Connection reset by peer]

11:19 afaerber has quit [Quit: Leaving]

12:10 cwabbott has joined #lima

12:26 embed-3d_ has joined #lima

12:29 embed-3d has quit [Ping timeout: 248 seconds]

12:40 afaerber has joined #lima

12:40 dddddd has joined #lima

12:51 jrmuizel has joined #lima

13:22 buzzmarshall has joined #lima

13:50 Barada has joined #lima

13:54 Barada has quit [Client Quit]

14:47 Elpaulo has quit [Quit: Elpaulo]

15:35 <rellla> heyo lima guys, i need your help ;)

15:36 <rellla> i tried to implement ppir fddx/fddy and got a reasonable codegen imho - piglit tests (for example shaders@glsl-derivs) still fail, see http://imkreisrum.de/piglit/glsl-derivs/

15:37 <rellla> for comparision i fed the offline compiler with the shader and get the following code:

15:39 <rellla> https://pastebin.com/raw/7pBEjn2u , whereas lima results in https://pastebin.com/raw/0ErZH7kw (which also includes some compressed debug infos from the piglit test)

15:41 <rellla> my guess is, that it has sth to do with the sync. i don't actually understand, what it does, but maybe it's a problem that dFdx and dFdy are divided into 2 instructions?

15:42 <rellla> (in case you wonder, i hardcoded dFdx to scalar slot and dFdy to vector slot for the testing)

15:44 <rellla> thats the one issue i have. the second one, personally more interested in, what are the general conditions i have to follow, if i want to combine some instructions after the scheduler?

15:46 <rellla> for example that would be instr 001 and 002 in my test case, which contain dFdy and dFdx, both reading from $0 and writing to $1 with different components used.

15:47 <enunes> rellla: right now there is no general automatic combining, if you want to (or must) combine them, you have to combine yourself

15:47 <enunes> did you read speculation on derivatives https://gitlab.freedesktop.org/panfrost/mali-isa-docs/blob/master/Utgard-PP.md#speculation-on-derivatives and does that make sense to you?

15:49 <enunes> maybe they are required to be in the same instruction for some reason? I think you may need to experiment with that

15:51 <cwabbott> no, they're not required to be in the same instruction, that wouldn't make any sense

15:51 <rellla> enunes: i read that, but i must admit, i don't really understand it ;)

15:51 <cwabbott> also, dFdx and dFdy can both go in any slot

15:52 <rellla> *any addition slot ?

15:52 <cwabbott> yes

15:52 <cwabbott> your assembly looks correct

15:53 <cwabbott> my (wild) guess is that you're enabling the right bit to enable helper invocations

15:53 <rellla> let me push my code somewhere ...

15:53 <cwabbott> although that would break texturing too... hmm

15:57 <rellla> https://gitlab.freedesktop.org/rellla/mesa/commits/ppir_fddxy

15:58 <cwabbott> and yeah, sync is required for all instructions that do derivatives and texturing, since threads have to share their results

15:58 <cwabbott> you seem like you're doing it correctly

15:58 <rellla> i disabled nir_lower_wpos_ytransform temporarily to avoid the load_uniform op, but this should not be the problem

16:03 <rellla> maybe there is some other thing missing still... texture based maybe?

16:08 <rellla> enunes: regarding the combination, i guess i have to think about the conditions, when the recent instruction can be combined the last one.

16:08 <enunes> rellla: I think it's not necessary to care about this now unless it's required

16:08 <enunes> which looks like it isn't

16:09 <rellla> isn't it an advantage, when instr count goes down?

16:10 <rellla> or do you just want to say, "the are much more important tasks now" :)

16:10 <enunes> yes but right now it would only be some sort of premature optimization, at some point I think there should be some attempt to do an overall pass combining things when possible

16:56 <anarsoul> rellla: if disassembly looks correct but it still doesn't work you should check actual binary

16:56 <anarsoul> maybe some "unknown" fields differ

16:57 <anarsoul> (that's how I discovered that branch instruction has another "next instruction length" field (for the case when branch is taken)

16:59 drod has joined #lima

17:15 guillaume_g has quit [Quit: Konversation terminated!]

18:18 jonkerj has quit [Read error: Connection reset by peer]

18:19 jonkerj has joined #lima

19:00 buzzmarshall has quit [Remote host closed the connection]

21:51 jrmuizel has quit [Remote host closed the connection]

21:57 afaerber has quit [Quit: Leaving]

22:12 jrmuizel has joined #lima

22:17 jrmuizel has quit [Ping timeout: 268 seconds]

22:23 drod has quit [Remote host closed the connection]

22:26 afaerber has joined #lima

23:32 jrmuizel has joined #lima