#panfrost on 2019-03-04 — irc logs at freenode.irclog.whitequark.org

2019-02-15 17:52 alyssa changed the topic of #panfrost to: Panfrost - FLOSS Mali Midgard & Bifrost - https://gitlab.freedesktop.org/panfrost - Logs https://freenode.irclog.whitequark.org/panfrost - <daniels> avoiding X is a huge feature

00:58 stikonas has quit [Remote host closed the connection]

00:59 stikonas has joined #panfrost

01:01 stikonas has quit [Read error: Connection reset by peer]

01:06 stikonas has joined #panfrost

01:13 stikonas has quit [Remote host closed the connection]

03:05 <alyssa> Slow day, huh :p

04:07 robert_ancell has quit [Ping timeout: 246 seconds]

05:05 <alyssa> tomeu: Just pushed last week's series :)

06:43 <tomeu> o/

06:43 <tomeu> alyssa: awesome :)

08:19 <mifritscher> I've both my own webserver, mailserver, diaspora and friendica ;-) I've no mastodon, but I'm not on twitter either :-D

08:42 shenghaoyang has joined #panfrost

09:13 cwabbott_ has joined #panfrost

09:14 shenghaoyang has quit [Remote host closed the connection]

09:16 cwabbott has quit [Ping timeout: 255 seconds]

09:16 cwabbott_ is now known as cwabbott

09:16 shenghaoyang has joined #panfrost

09:20 pH5 has joined #panfrost

09:44 BenG83 has quit [Ping timeout: 245 seconds]

09:49 <tomeu> alyssa: do you have a script to check the code style?

09:55 cwabbott_ has joined #panfrost

09:56 cwabbott has quit [Ping timeout: 258 seconds]

09:56 cwabbott_ is now known as cwabbott

10:01 shenghaoyang has quit [Remote host closed the connection]

10:11 <tomeu> alyssa: do you have an idea already of what would take to get GNOME Shell rendering correctly?

10:13 BenG83 has joined #panfrost

11:53 raster has joined #panfrost

12:21 raster has quit [Remote host closed the connection]

12:27 afaerber has quit [Quit: Leaving]

12:39 afaerber has joined #panfrost

12:53 raster has joined #panfrost

12:54 Lyude|PTO is now known as Lyude|PTOish

12:54 <Lyude|PTOish> bifrost time

15:17 <tomeu> alyssa: aren't we overwriting a malloc'ed pointer here? https://gitlab.freedesktop.org/mesa/mesa/blob/master/src/gallium/drivers/panfrost/pan_context.c#L127

15:17 <tomeu> and when trying to free it, we crash

15:20 <Lyude|PTOish> that sounds like it was painful to debug

15:20 <tomeu> I think I was lucky :)

15:26 <davidlt> wow, finally

15:26 <davidlt> Intel released thunderbolt 3 spec, which becomes USB 4

15:27 <HdkR> So all of our USB 4 controllers are going to be huge and overpriced? :p

15:27 <raster> they will contain an i7 just in the controller :)

15:28 <HdkR> Hopefully once someone other than Intel releases a controller then the firmware will be able to be updated

15:28 <raster> (though what's new, just about every hdd has an arm cpu in it... and tonnes of raid controllers are little arm machines ad arc bmc's etc.) :)

15:28 <HdkR> aye

15:29 <HdkR> Samsung's SSDs have like...five CPU cores in them now

15:29 <raster> not surprising...

15:29 <raster> moar cores!

15:29 <raster> :)

15:29 <HdkR> Cortex-R4s or something last I knew

15:29 <raster> cortex r's?

15:29 <raster> they are really being paranoid

15:30 <HdkR> Back in their three core designs I knew they separated the logic that one core handled reads, one handled writes, and one handled misc things. Not sure what happens on the new additional cores

15:30 <raster> i wonder if ever gpu's will just become extra cpu cores with different scheduling (and some texel fetch/write/blender instructions etc.)

15:31 <HdkR> You mean Larrabee? :P

15:31 <raster> well larrabee was a discrete gpu right?

15:31 <HdkR> Which was effectively a ton of Pentium 3 CPUs with some additional GPU-esque instructions

15:31 <raster> just happened to be x86

15:32 <raster> well i was more thinking regular cpu cores just with insane SMT

15:32 <HdkR> Or pentium? I forget now

15:32 <raster> i thinbk they were ye olde pentium

15:32 <HdkR> But you can see how that project failed

15:32 <raster> but imagine a cpu that instead of running until a stall, then switching to another vcpu (diff reg bank etc.)

15:33 <raster> it would just run 1 instruction then hw bank switch to another context

15:33 <raster> and it could hold like 32, 64, 128+ of these per core

15:33 <raster> so you'd hide stalls in ctx switches

15:34 <HdkR> You're just describing a GPU at that point

15:34 <raster> yup

15:34 <HdkR> :P

15:34 <raster> it's nota very big leap between gpus and cpus these days

15:34 <raster> so why really have them be so different?

15:34 <HdkR> and if you were mad enough, you could use ARM's SVE instruction set as a GPU. Make a GPU using AArch64 + 2048bit SVE

15:35 <raster> it's missing texel fetch/interpolation stuff in hw

15:35 <raster> no concept of tiled mem layout etc.

15:35 <HdkR> Glue some additional texture fetch pipelines on

15:35 <HdkR> :D

15:35 <raster> yup

15:35 <raster> that's all it really needs... :)

15:36 <raster> with just a lot of hw "SMT" switching (instead of 2 or 4 as you see today - 32x, 64x etc.)

15:37 <raster> your gpu is really just some daemon you ipc to :)

15:37 <raster> (that daemon is scheduled exclusively on these wide smt cores)

15:38 <raster> i dont know why larrabee failed but rumors were that intel underestimated the sw work needed to actually write the "gpu"

15:39 <HdkR> I figured perf/w wasn't anywhere near where it needed to be to be competitive

15:40 <raster> i wonder if that was a sw or hw problem tho...

15:40 <raster> was it they just were taking too long on the sw to maximize the hw utilisation...

15:40 <HdkR> Could be both :P

15:40 <raster> or the hw just wasn't capable enough...

15:40 <davidlt> Xeon Phi (KNC, KNL, etc) also didn't fly

15:41 <raster> so i hear - xeon phi was an offshoot of larrabee

15:41 <davidlt> yes

15:41 <davidlt> KNL was the 1st true product, KNC was kinda beta-testing

15:42 <davidlt> but it was complicated, two types of RAM on the board

15:42 <davidlt> SMT4, AVX-512 (but not all extensions), SSE4, AVX were considered legacy IIRC

15:42 <davidlt> we always struggled to efficiently use it

15:42 <raster> tho to go against gpu's you'd need to seriously up the core *AND* smt count

15:42 <raster> i thought they only did 4 way smt on phi

15:43 <davidlt> Yes, 4 SMT

15:43 <raster> you'd need more like 32x

15:43 <davidlt> and really complicated memory setup

15:43 <raster> or 64x

15:43 <raster> as you need to probably just make a very dumb scheduler that does 1 instruction, then switch so to hide stalls you'd need a lot of cores to fill in the cycles

15:44 <davidlt> soon we will know what Intel is doing with GPUs, Gen 11 stuff will be discussed at GDC IIRC

15:44 <raster> let's see

15:44 <davidlt> Gen 11 is like a base for their Xe

15:45 <raster> i wonder if this ended up coming from their gpu group or the larrabee guys (where they finally managed it and it just took longer)

15:45 <davidlt> come on, they hired loads of people incl. some legendary folks from AMD

15:45 <raster> what i have on my desk is a bit xeon-phi like

15:46 <davidlt> I think, it's at least 3-4 years for them on this project

15:46 <raster> tho i suspect its still beefier per core

15:46 <raster> so i can imagine how you'd build a gpu out of this many cores :)

15:46 <alyssa> mifritscher: You are the 0.1%! :P

15:46 <alyssa> tomeu: I don't have a script to check the code style, no

15:46 <alyssa> Not sure what G-S is missing, probably just a ton of debugging

15:47 <alyssa> tomeu: Oh, fwoosh, yes, you're right. Good catch :) Send a patch? :P

15:47 <raster> davidlt: well i wonder what they based it on - their existing gpu designs just dialed up to 11, or on larrabee like design... :)

15:47 <davidlt> raster, Gen 11 is a new design

15:47 <raster> like from scratch?

15:48 <davidlt> from my understanding

15:48 <davidlt> the execution units are significantly smaller IIRC

15:48 <raster> hmmm then i wonder what it looks like

15:48 <davidlt> and they are doubling the count, from 24 to 48 on iGPU side

15:48 <raster> smaller than their gpu designs or their larrabee design?

15:49 <davidlt> there was a leaked benchmark and it outperforms any integrated AMD solutions based on Vega

15:49 <raster> hmmm

15:49 <davidlt> smaller compared to Gen 9.5

15:49 <raster> but this is discrete

15:49 <davidlt> this is iGPU

15:49 <raster> hmmm ok so that means they can get more cores on

15:49 <raster> oh i thought they were doing a discrete gpu?

15:49 <davidlt> there was a leaked benchmark for Ice Lake

15:49 <davidlt> They are starting with Gen 11 as major step, that's iGPU

15:50 <raster> hmm then whats the relation to the discrete gpu rumors?

15:50 <davidlt> but it's base for them for their 2020 Xe project (which scales from iGPU all the way to datacenter)

15:50 <davidlt> it's a starting point for them

15:50 <raster> hmmm

15:50 <davidlt> one step before the big thing

15:50 <raster> will be interesting to see

15:51 <davidlt> and Gen 11 is like 100-130% (or more in some cases) according to leaked benchmark

15:51 <davidlt> (compared to previous generation iGPU)

15:51 <alyssa> Y'all are noisy :p

15:52 <raster> yup

15:52 <raster> :)

15:52 <tomeu> alyssa: I'm unsure on how this should be fixed, as I'm missing some knowledge on what's the design like

15:52 <raster> davidlt: well double+ is a good leap for team blue.

15:52 <tomeu> alyssa: is the overwritten pointer supposed to be stored somewhere else?

15:52 <alyssa> tomeu: Let me se

15:52 <alyssa> It's been a while since I touched that code

15:53 <tomeu> guess my first doubt is why are we allocating with malloc the levels, then allocating a BO for the AFBC

15:53 <alyssa> tomeu: The overwritten pointer should be freed, I guess, and then we should check for afbc to decide whether to free() or not

15:54 <alyssa> tomeu: Essentially though this is a bigger design issue, I guess -- the way it's setup, anything texture-like defaults to a tiled texture, but if you try to render into it, it turns itself into an AFBC resource

15:54 <alyssa> That's.... probably wrong :p

15:54 <tomeu> ok, I can do that, though I wonder if we shouldn't be delaying the allocation so we don't allocate something that doesn't end up being used

15:55 <tomeu> ah, thought everything would be AFBC except possibly some render targets

15:55 <alyssa> We probably should be, but how late do we delay?

15:55 <alyssa> tomeu: The problem with making everything AFBC is we have no way to compress into AFBC from software

15:55 <tomeu> probably until the same point in time when we allocate now the AFBC buffer?

15:56 <alyssa> AFBC buffer is allocated from set_framebuffer_state

15:56 <alyssa> You're uncovering a massive kludge of hacks right now :p

16:02 <HdkR> I forget, is AFBC lossless or lossy?

16:02 <raster> A

16:03 <HdkR> and don't tell me "visually lossless" is lossless :P

16:03 <HdkR> "The format preserves original image exactly (bit exact), and compression ratios are comparable to other lossless compression standards."

16:04 <alyssa> HdkR: lossless :p

16:04 <HdkR> Totally fair if you're converting resources to AFBC when it becomes an RT as long as you can later sample from AFBC formats :D

16:06 <HdkR> More of an issue if you couldn't sample from it later...

16:14 <tomeu> alyssa: ok, maybe we should do some erfactoring before fixing this

16:20 <tomeu> alyssa: sorry if today looks like I'm patch-bombing too much :)

16:36 paulk-leonov has quit [Ping timeout: 250 seconds]

16:47 paulk-leonov has joined #panfrost

16:55 pH5 has quit [Quit: bye]

17:02 robertfoss has quit [Quit: WeeChat 2.3]

17:02 robertfoss has joined #panfrost

17:30 pH5 has joined #panfrost

17:36 <HdkR> tomeu: You have a typo in your latest blog post. You called Bifrost Bitfrost :P

17:53 stikonas has joined #panfrost

17:57 <raster> frosty bits...

17:58 <raster> :)

18:30 belgin has joined #panfrost

18:38 raster has quit [Remote host closed the connection]

18:39 raster has joined #panfrost

18:41 belgin has quit [Ping timeout: 259 seconds]

18:47 belgin has joined #panfrost

19:11 afaerber has quit [Quit: Leaving]

19:12 raster has quit [Remote host closed the connection]

19:19 belgin has quit [Ping timeout: 258 seconds]

19:25 raster has joined #panfrost

19:27 <raster> alyssa: any reason you abort a clear if color is NULL?

19:28 <raster> or well complain about it and try and clear if it wasn't cleared during a partial render?

19:28 <raster> it may be that it never clears at all for any reason :)

19:28 <raster> so "!ctx->frame_cleared" in panfrost_flush() doesn't mean it'

19:29 <raster> it's partial rendering

19:29 <raster> if it never cleared that panfrost_clear() will be a nop and so won't be doing what the comment says above :)

19:38 raster has quit [Remote host closed the connection]

20:08 bbrezillon has quit [Ping timeout: 272 seconds]

20:08 bbrezillon has joined #panfrost

21:33 afaerber has joined #panfrost

21:36 stikonas has quit [Remote host closed the connection]

21:39 stikonas has joined #panfrost

21:40 stikonas has quit [Remote host closed the connection]

21:41 stikonas has joined #panfrost

21:52 bnieuwenhuizen has quit [Quit: No Ping reply in 180 seconds.]

21:53 hopetech has quit [Ping timeout: 272 seconds]

21:53 hopetech has joined #panfrost

21:54 bnieuwenhuizen has joined #panfrost

22:11 tlwoerner has quit [Quit: Leaving]

22:24 <alyssa> tomeu: Flooding my inbox with patches is more than welcome! :)

22:24 <alyssa> raster: Old bug, the code I pushed last night should fix that

22:30 <alyssa> tomeu: Nice blog post :)

22:57 BenG83 has quit [Quit: Leaving]

23:04 BenG83 has joined #panfrost