#lima on 2019-11-07 — irc logs at freenode.irclog.whitequark.org

2019-07-03 10:24 ChanServ changed the topic of #lima to: Development channel for open source lima driver for ARM Mali4** GPUs - Kernel has landed in mainline, userspace driver is part of mesa - Logs at https://people.freedesktop.org/~cbrill/dri-log/index.php?channel=lima and https://freenode.irclog.whitequark.org/lima - Contact ARM for binary driver support!

00:02 <mastermart> it does not matter on utgard based gpus cause 128pipeline stages is somewhere between 8-16 maximum instruction words

00:03 <mastermart> and there are enough texture units 16 for them

00:04 <mastermart> but on amd chips for instance there can be 40waves per CU, and only 16texture units, the algorithm became very complex and burned by brain edventually without knowing the queueing

00:04 <mastermart> cause the tests in simulator is not one to one, cause miaow lacks texture unit logic

00:04 <anarsoul> there's likely only one texturing unit on utgard

00:04 <anarsoul> since you can fetch only one texel per clock

00:06 <mastermart> no , this is not possible, i think opengl 2.1 requires to have at least 8 to 16

00:07 <mastermart> but i dunno either, will look at this tomorrow

00:07 <anarsoul> samplers

00:07 <mastermart> but one texel per clock sort of like makes sense

00:07 <anarsoul> not texturing units

00:16 <mastermart> Hmm, i am not sure if i was the one who was wrong after all on #opengl

00:16 <mastermart> i think sampler is the same as texture unit overall

00:16 <mastermart> actually maybe not

00:17 <mastermart> it is the ROP output probably which is fed to texture unit

00:19 <mastermart> I am very tired today, and i drank a bit too, i have to look but if my memory is not fuzzed on this matter, i think

00:19 <mastermart> still every bundle had it's dedicated unit for utgard for texturing too

00:25 <mastermart> anarsoul: but yeah actually i know your fears, my docs say sm3.0 needs to have 4texunits available at least..

00:26 <anarsoul> mastermart: utgard doesn't support sm3

00:26 <anarsoul> it was designed for sm2

00:26 <mastermart> which is yeah short of simd aribte

00:28 <mastermart> anarsoul: so what happens if you try to access the texture unit1 which is the only available

00:28 <mastermart> from another

00:28 <mastermart> thread/warp

00:28 <mastermart> does it stall cause the unit is allready busy?

00:29 <anarsoul> no idea, we don't have performance counters yet, so I can't even take a look

00:29 <mastermart> for my algorithm which i designed, this is indeed reasonably bad and limiting factor for parallelism

00:33 <mastermart> then I should sniff, from connors referenced/citied patents probably if they mention it there, or search some arm/falanx patents from google.

00:45 mastermart has quit [Ping timeout: 268 seconds]

02:19 mastermart has joined #lima

02:19 <mastermart> https://patents.justia.com/patent/8719553

02:21 <mastermart> this clarifies quite a lot but, not everything, first sentence is that functional units 2 3 4 5 6 have access to the cache

02:21 <mastermart> there are two pipelines also like two cores maybe?

02:21 dddddd has quit [Remote host closed the connection]

02:25 <mastermart> the design does re-enter stuff, so it's all good

02:26 <mastermart> for an example if that load misses the cache, the instruction is thrown out of the pipeline and

02:26 <mastermart> and inserted again later

02:30 <mastermart> anarsoul: my primary interest was stall events like deferred blocking assignments in the address calculation stalling because of no evaluation events, then the subsequent stage resets the unit

02:32 <mastermart> and basically always fail on the register later on until the address of the backing reg is changed with WAR hazard

02:32 <mastermart> which is of course the needed behavior

02:36 <mastermart> they yeah kinda have different terminology or meaning for texture units, and they say they have one per pipeline, whatever it means , but barrelled lock-step functional units they have more than those 5

02:37 <mastermart> so definitely each word has functional unit which has access to cache, and it is done in lock-step maybe?

02:47 <mastermart> i head off to sleep, and i can not always access the channel, i am using VPN since obviously my IPs are all blocked/banned :)

02:54 <mastermart> well it appears that utgard and midgard gpus have very flawless design, bu in fact it takes some time to get used to the weird terminolgy forks :) however as computer scientists once again this arm norway is seeming to be good group of them.

03:46 cp has quit [Quit: Disappeared in a puff of smoke]

03:46 cp has joined #lima

04:06 megi has quit [Ping timeout: 268 seconds]

04:13 mastermart has quit [Ping timeout: 265 seconds]

04:58 chewitt has quit [Read error: Connection reset by peer]

05:48 mastermart has joined #lima

07:14 _whitelogger has joined #lima

07:15 maccraft123 has joined #lima

07:33 monstr has joined #lima

07:41 maccraft123 has quit [Ping timeout: 268 seconds]

07:49 yann has quit [Ping timeout: 245 seconds]

08:00 maccraft123 has joined #lima

08:12 maccraft123 has quit [Ping timeout: 264 seconds]

08:14 maccraft123 has joined #lima

08:29 maccraft123 has quit [Quit: WeeChat 2.6]

08:39 <rellla> anarsoul: http://imkreisrum.de/deqp/1/ these are the 2 tests mentioned yesterday

08:42 <rellla> both fail with master, too.

09:13 yann has joined #lima

11:13 <mastermart> http://openasip.org/move/doc/Experiments_of_TTA_on_ASIC_technology.pdf if you can look at page 24 of that.

11:13 <mastermart> Fins make strong statements which are consistent with swedish and yankee benchmarks, and consistent as to how i see and understand and read from miaow.

11:14 <mastermart> namely that with pipelining enabled, the full schedule of warps is asynchronous

11:16 <mastermart> and what is more important since fetch and decode is no longer done, everything works according to other stages critical path or worst case latency to start another pipeline

11:17 <mastermart> in other words, dispatcher is always in charge of feeding rst and clk signals asynchronously. Which is fantastic on it's own. You need not to touch the powering and frequency scaling part of the circuit at all.

11:23 <mastermart> What i say is the cowork of api and hw designers which maybe coincidental/fluky has left the backdoor for performance in both vertex and fragment shaders, hardware designers meant all good, but maybe it was inevitable to patch up/block performance backdoors

11:26 <mastermart> So it is only the driver in software land which is broken, old api and hw seems to provide the possibility at least in some coincidental way.

11:28 <mastermart> It required some positiveness to be seen in life, and minor work added to identify/rediscover those ways, i was all determined and knew this is the case for some years ago allready, however i have been polishing the details ever since.

11:29 <mastermart> and making vast set of backup plans for any hardware ever manufactured, and i start to make the code later in real hw, as job ordered from my company.

11:31 megi has joined #lima

11:31 <mastermart> The idea is as much paranoid as one party maybe, the hdl is defined by another members of community, the hw is designed my some, the api is designed by some, if only the vendor is paranoid about selling their gradual enhancements and the api is just so gigantic

11:32 <mastermart> they probably do not get away all clean for several reasons with blocking the way entirely

11:38 <mastermart> Imo similarly if there is not evidence present and supporting information to bunch of persons lies, there is no results to show their superioty either, it is a matter of time when they expose their lies and fallback

11:39 <mastermart> or fall down

11:41 <mastermart> one commented very strikingly or genuinly (i added it to my facebook timeline) like so: They will ignore you, then they will laugh at you, then they will fight you, and then you win! My supporters have given all one sided advise to me, that this does not end simpler, once the rebeller scammer started, they will continue until stopped.

11:43 <mastermart> cause those three symptoms sort of citied, they took of from initial stage, where bunch of people made a run up based of entire lies of course, then they ignored, then they laughed at me, then they were beaten and then i won.

11:45 <mastermart> To the question when i quit playing due to terror, who from your superior minds to send to play and present estonian sportsmen or women i assume blank silence

11:52 chewitt has joined #lima

11:53 <mastermart> and i am not interested in standing behind and representing intruguents to the world, you either show legit skills, back off or there is going to be a fight due to past criminal activity and terror towards me.

12:07 <mastermart> better straighten up before i die due to medication long term abuse, where people responsible will be treated anyways later, and this is not funny , I chose the best med for me due to being forced to use any of them in list, but some of those meds probably allready canceled on of my good friend, put off her life candle for good.

12:07 <mastermart> roughly 3-4 years ago if i remember that shock time properly.

12:19 mastermart has quit [Quit: Leaving]

13:45 kaspter has quit [Remote host closed the connection]

13:46 kaspter has joined #lima

13:47 gcl has joined #lima

14:09 maccraft123 has joined #lima

14:35 dddddd has joined #lima

15:16 maccraft123 has quit [Quit: WeeChat 2.6]

15:16 megi has quit [Quit: WeeChat 2.6]

15:51 <anarsoul> GL_INVALID_OPERATION instead of GL_INVALID_VALUE?

15:51 <anarsoul> ask #dri-devel?

15:52 <anarsoul> rellla: ^^

16:05 jernej has quit [Quit: Free ZNC ~ Powered by LunarBNC: https://LunarBNC.net]

16:06 jernej has joined #lima

16:15 jernej has quit [Quit: Free ZNC ~ Powered by LunarBNC: https://LunarBNC.net]

16:16 jernej has joined #lima

17:11 yann has quit [Ping timeout: 276 seconds]

17:19 megi has joined #lima

17:59 monstr has quit [Remote host closed the connection]

18:21 yann has joined #lima

19:07 chewitt has quit [Quit: Zzz..]

19:08 chewitt has joined #lima

19:40 chewitt has quit [Quit: Zzz..]

21:02 maccraft123 has joined #lima

21:17 <anarsoul> rellla: what's results.xml?

21:17 <anarsoul> is it overall results?

21:31 <rellla> anarsoul: i enabled the shaders.* tests and sorted out the crashing ones.

21:32 <anarsoul> I see

21:32 <anarsoul> gpir compiler hangs a lot on these

21:32 <anarsoul> however there're few ppir compiler crashes as well

21:32 <anarsoul> and *lots* of failures

21:32 <anarsoul> :)

21:32 <rellla> i'm not sure if thats one the results with the crash or the “final“ result

21:33 <anarsoul> btw, clipping also fails a lot

21:33 <anarsoul> dEQP-GLES2.functional.clipping.*

21:34 <anarsoul> looks like we don't clip when we should

21:34 <anarsoul> (look at result image and reference image)

21:34 <rellla> tomorrow i can push a result.xml with corresponding fails and skips list if you want

21:35 <anarsoul> I have my own :)

21:35 <rellla> ok :)

21:35 <anarsoul> are you planning to look into some failures?

21:35 <rellla> btw, how do i get the images?

21:36 <anarsoul> --deqp-log-images=enable

21:36 <rellla> maybe, i don't know how much time i can bring up for that - and how good my skills are of course :p

21:37 <rellla> mainly i did that to move from piglit to deqp and of course, to see what is solveable by myself...

21:38 <rellla> i still have to find out, where i have to search for the test's code...

21:39 <rellla> GL_INVALID_OPERATION in the above example can have many reasons for example

21:40 <rellla> though i haven't really looked at it...

21:40 <anarsoul> rellla: that's unlikely to be lima issue

21:40 <anarsoul> :)

21:41 <rellla> i just wondered, because they don't appear in the expected fails list

21:44 <rellla> ... and yes, results.xml is the overall result with shaders enabled and re-disabled some subtests due to crashes.

22:41 maccraft123 has quit [Quit: WeeChat 2.6]