#lima on 2019-11-25 — irc logs at freenode.irclog.whitequark.org

2019-07-03 10:24 ChanServ changed the topic of #lima to: Development channel for open source lima driver for ARM Mali4** GPUs - Kernel has landed in mainline, userspace driver is part of mesa - Logs at https://people.freedesktop.org/~cbrill/dri-log/index.php?channel=lima and https://freenode.irclog.whitequark.org/lima - Contact ARM for binary driver support!

00:34 camus has joined #lima

00:35 kaspter has quit [Ping timeout: 265 seconds]

00:35 camus is now known as kaspter

01:12 camus has joined #lima

01:13 kaspter has quit [Ping timeout: 246 seconds]

01:13 camus is now known as kaspter

01:31 camus has joined #lima

01:33 kaspter has quit [Ping timeout: 265 seconds]

01:33 camus is now known as kaspter

03:18 kaspter has quit [Ping timeout: 265 seconds]

03:18 kaspter has joined #lima

03:35 cp has quit [Ping timeout: 245 seconds]

03:44 piggz has quit [Read error: Connection reset by peer]

03:45 piggz has joined #lima

04:18 cp has joined #lima

04:21 kaspter has quit [Remote host closed the connection]

04:22 kaspter has joined #lima

05:18 Barada has joined #lima

05:19 kaspter has quit [Ping timeout: 265 seconds]

05:20 kaspter has joined #lima

06:21 cp has quit [Ping timeout: 265 seconds]

06:23 Barada has quit [Quit: Barada]

06:32 cp has joined #lima

06:42 cp has quit [Quit: Disappeared in a puff of smoke]

06:49 cp- has joined #lima

07:12 maccraft123 has joined #lima

07:45 maccraft123 has quit [Quit: WeeChat 2.6]

07:46 kaspter has quit [Ping timeout: 240 seconds]

07:46 kaspter has joined #lima

07:48 maccraft123 has joined #lima

07:52 maccraft123 has quit [Client Quit]

07:53 maccraft123 has joined #lima

08:02 dddddd has quit [Remote host closed the connection]

08:04 maccraft123 has quit [Read error: Connection reset by peer]

08:04 maccraft123 has joined #lima

08:07 yann|work has quit [Ping timeout: 240 seconds]

08:15 maccraft123 has quit [Quit: WeeChat 2.6]

08:20 <rellla> anarsoul: with your branch i get 4|4 passes but still a kernel error. i'm starting a complete run now.

08:20 warpme_ has joined #lima

08:36 maccraft123 has joined #lima

08:41 maccraft123 has quit [Client Quit]

09:26 _whitelogger has joined #lima

09:26 yann|work has joined #lima

10:03 <rellla> anarsoul: use.index_array.array still screws sth up, so i will do a run without the test

10:04 <rellla> i wonder if that could be some issue with 32/64bit?

10:04 <rellla> http://imkreisrum.de/deqp/vf_1/result_single.xml

10:05 <rellla> glGenBuffers(1, 0x0000ffffd83e1174); seems to deal with a 64bit pointer, which i don't know if we handle that correct!?

10:06 <rellla> the resulting pictures is kind of correct in the lower right area...

10:06 <rellla> can be bogus finding, too :p

10:33 <rellla> maybe we should ignore that issue for now and solve it later?

10:36 ecloud_wfh is now known as ecloud

10:39 maccraft123 has joined #lima

10:48 maccraft123 has quit [Quit: WeeChat 2.6]

11:35 camus has joined #lima

11:39 kaspter has quit [Remote host closed the connection]

11:39 camus is now known as kaspter

11:40 maccraft123 has joined #lima

11:41 maccraft123 has quit [Client Quit]

11:45 maccraft123 has joined #lima

11:48 smaeul has quit [Ping timeout: 240 seconds]

11:53 maccraft123 has quit [Read error: Connection reset by peer]

12:06 megi has joined #lima

12:12 smaeul has joined #lima

12:30 monstr has joined #lima

12:47 adjtm_ has joined #lima

12:50 adjtm has quit [Ping timeout: 276 seconds]

13:02 kaspter has quit [Quit: kaspter]

13:04 kaspter has joined #lima

13:17 maccraft123 has joined #lima

13:21 maccraft123 has quit [Client Quit]

13:21 maccraft123 has joined #lima

13:41 maccraft123 has quit [Quit: WeeChat 2.6]

13:48 <rellla> hm, no. we should solve that now :)

13:49 <rellla> my guess is, that the buffer size in glBufferData() is also relevant. http://imkreisrum.de/deqp/vf_1/result_single.xml uses 763.

13:51 <rellla> if i change that to 80, i get less lines and the test passes. 128 results in a blue square for both, result and reference picture, and 763 displays the right lines at the beginning of the draw but then sth goes wrong.

13:52 <rellla> so imho it's some buffer/memory related issue...

13:53 maccraft123 has joined #lima

14:22 maccraft123 has quit [Quit: WeeChat 2.6]

14:49 monstr has quit [Remote host closed the connection]

14:50 <rellla> heyo, i think i found sth related.

14:56 <rellla> info->count always comes with a MAX of 129 to lima_draw. so the above test uses a count of 763 and this might be the reason, why it breaks.

15:09 <rellla> the last working number of vertices is 126: http://imkreisrum.de/deqp/vf_3/result_single.xml

15:09 <rellla> whereas 127 shows wrong output: http://imkreisrum.de/deqp/vf_4/result_single.xml

16:03 <anarsoul> rellla: kernel error is definitely not good

16:07 <rellla> anarsoul: the first thing i want to solve is, why i get that maximum of 129 number of vertices !?

16:13 dddddd has joined #lima

16:22 <rellla> ok, so i stop for now. somwhere between deqp and mesa the nr is shrinked to 129 :(

16:25 <anarsoul> what number are you talking about?

16:26 <rellla> hooray, they pass. found it.

16:26 <rellla> http://imkreisrum.de/deqp/vf_5/result_single.xml

16:28 <anarsoul> what was the issue?

16:29 <rellla> analyize the issue is up to you :p

16:29 <rellla> let me explain, what i did:

16:29 <rellla> deqp limits the lines per draw to 128: https://github.com/KhronosGroup/VK-GL-CTS/blob/master/modules/gles2/functional/es2fBufferTestUtil.cpp#L56

16:30 <rellla> setting this value high enough to do all in one draw solves the tests (but not the issue probably :)

16:32 <rellla> mesa seems to correctly split the draws (for testing purpose i set max_verts to 50)

16:35 <anarsoul> is it a regression?

16:35 <rellla> i have to find out, if this https://github.com/KhronosGroup/VK-GL-CTS/blob/master/modules/gles2/functional/es2fBufferTestUtil.cpp#L608 is really executed as a loop

16:42 <rellla> it seems, the loop is executed only once. in mesa debug, i get one glDrawElements and one glDrawArrays

16:42 <rellla> https://pastebin.com/raw/Q3vVN35R

16:57 yann|work has quit [Ping timeout: 252 seconds]

16:59 <rellla> so, i can either fix this by setting https://github.com/KhronosGroup/VK-GL-CTS/blob/master/modules/gles2/functional/es2fBufferTestUtil.cpp#L56 to 125

17:00 <rellla> for one draw or to do all in once.

17:00 <anarsoul> rellla: issue should be fixed in lima, not in deqp

17:00 <rellla> yeah

17:02 <rellla> i'll look into that later. at least i know, that the issue is somehow related to nr of vertices

17:04 <rellla> it seems lima is not able to draw more than 125 vertices correctly in one draw.

17:05 <anarsoul> that's probably for line strips

17:06 <anarsoul> not sure if it's correct though since enunes' branch actually fixes glmark2 -b refract which has more than 65k vertices

17:07 <anarsoul> maybe we under-allocate some buffer?

17:56 megi has quit [Ping timeout: 250 seconds]

18:33 yann|work has joined #lima

21:06 rembrandt83 has joined #lima

21:07 <rembrandt83> Finally got my laptop from repair, power button was replaced.

21:08 <anarsoul> rembrandt83: great

21:08 <rembrandt83> So I looked at proper lookup table indexing from bitfields , this is one of the most sophisticated problems to me so far.

21:08 <rembrandt83> how to remap roots and logarithms to make them work with low latency

21:10 <rembrandt83> the tables seem easy to me, and probably one can index into them with filtering units or texture mapping units, but i haven't got much experience and knowhow on vertex shaders.

21:12 <rembrandt83> so you have some bitfield combination with filtering some power of two out from the modulus operation, now you have a base and remainder of that operation, this should be the index, how to clamp it into correct register is the problem.

21:13 <rembrandt83> the mirror odd even filtering mode seems good, but is there something similar in hw for vertex programs too, if texture lookup is missing on the vertex shader?

21:14 <rembrandt83> glpointsize has some hw clamping stuff, but i yet have not identified how that works

21:16 <rembrandt83> so the issue is, the coord normalization methods are quite oftenly used as divisions and logarithms and roots probably

21:16 <rembrandt83> you may have several twos complement cache buffers to get the info from, i am not sure how i redirect it cheaply to the correct one

21:27 megi has joined #lima

21:33 <rembrandt83> transform_feedback isn't kinda there for mali 400mp i assume or is there this chanche, some describe something like vertex texture fetch VTF. without PBOs this is going to put the passthrough fragment program texture info back to CPU and then into VBOs

22:02 <rembrandt83> what i think either some intelligent rounding instruction is needed on vertex shaders or every divisor of power of two needs to have a priority encoder to the constant cache for instance, to fetch the magic number from there, since there is 32powers then 32 buffers

22:26 <rembrandt83> anyhow all this is very complex. i am not entirely in woods with this one, having read literature in stacks too

22:27 <rembrandt83> but i do not like that fast memory as data cache is wasted on page memory while they can be iteratively pinned to store precomputed values of long latency ops

22:28 <rembrandt83> page tables are not very intelligent to waste cache on

22:42 <rembrandt83> most those theorems are bit overkill to me too though, highly hard to follow especially if i can not rely on very good math skills, which i never had.

22:45 <rembrandt83> so after having cracked a single one like lagrange quadtratic interpolation with resillient reading and following, and it feels like a lot of unoptimized one , all the time is wasted since there literally gazillions of theorems on the net

22:57 <rembrandt83> the data path is always going to be coarse grain sort of, but hw makes a lot of 32bit 4byte cache entries and regs available to be used, so it isn't much of a problem to waste them on priority encoder

22:58 <rembrandt83> since accessing those coarse grain regs will be always very fast on short pipeline mode

23:30 maccraft123 has joined #lima

23:31 <rembrandt83> effective address calculator has enough components but it all is pretty perverse to think about those solutions every day with no possibility to negotiate with anyone to cooperate

23:34 <rembrandt83> alone doing it is pretty mad, but this is all i got at the moment injuries took my out of other events long time allready, i think i will branch out doing all my own anyways

23:34 <rembrandt83> my/me

23:35 rembrandt83 has quit [Quit: CGI:IRC (EOF)]

23:38 maccraft123 has quit [Quit: sleep]

23:39 cwabbott has quit [Remote host closed the connection]

23:39 cwabbott has joined #lima