#apicula on 2020-10-01 — irc logs at freenode.irclog.whitequark.org

03:57 _whitelogger has joined #apicula

04:20 FabM has joined #apicula

08:48 <trabucayre> pepijndevos: I'm fighting with gowin to see why erase is not working.

08:49 <trabucayre> my feeling with these devices is jtag is less robust compared to lattice, xilinx...

08:59 <pepijndevos> Oh so flash programming is not working because it's not getting erased properly?

09:01 <trabucayre> yes

09:02 <pepijndevos> :(

09:02 <trabucayre> this code worked ... why now not?

09:56 <pepijndevos> Maybe the bits are having a slow day...

10:03 <trabucayre> I think I need to rewrite/rethink entirely this class...

10:04 <trabucayre> it's clearly the stable device

10:10 <trabucayre> the less stable

11:13 <pepijndevos> trabucayre, have you seen this? https://github.com/YosysHQ/apicula/blob/master/doc/commandstructure.md Mostly daveshah's work. "Number of frames" sounds like it could maybe solve the bram problem?

11:16 <pepijndevos> I think I only updated it to my understanding of the CRC, the rest is copied from dave IIRC. He looked at Gowin before I started my internship. The amount of "like ECP5" I've seen during this project is almost comical.

11:17 <pepijndevos> Usually when I'm completely puzzled by something I ask him how ECP5 does it... and that's generally a pretty good initial guess for how Gowin works.

11:19 <pepijndevos> (if you have anything to add to that file from your experience, highly appreciated of course. you know much more about the jtag and crc bits than me)

11:46 <trabucayre> Yes I've seen this page... But I need to reread :)

11:47 <trabucayre> unfortunatly number of frames is the real number of frame in the bitstream, not for the FPGA

11:47 <trabucayre> This why I have used idcode to hardcode correct size

11:48 <trabucayre> Maybe I've somewhere some note about header... The challenge is to find where :)

12:04 <pepijndevos> Lofty, I'm writing some documentation about the timing db... and I'm trying to reconstruct our reasoning about the order of timing info... and failing

12:04 <trabucayre> not a good day for everyone :)

12:05 <Lofty> input rising, output rising; input falling, output rising; input rising, output falling; input falling, output falling

12:05 <Lofty> Permute to taste, I suppse

12:13 <pepijndevos> Lofty, yea, the permutations are what I'm wondering about... It makes sense NMOS is faster so falling times would be lower.

12:13 <pepijndevos> IIRC we had some theory based on instances where 2 of them were identical

12:13 <pepijndevos> 'a_f': [1.2081600427627563, 1.2542400360107422, 1.2081600427627563, 1.2542400360107422]

12:13 <Lofty> I think they were for flops, yeah

12:13 <pepijndevos> 'di_clksetpos': [0.36000001430511475, 0.36000001430511475, 0.5759999752044678, 0.5759999752044678]

12:14 <Lofty> Okay, then the order would be RR,RF,FR,FF

12:14 <pepijndevos> so for LUT A to F 1 and 3 vs 2 and 4 are identical, but in the dff 1 and 2 vs 3 and 4 are identical

12:15 <pepijndevos> ... how did you arrive at that order?

12:15 <pepijndevos> that's the thing I want to document

12:18 <pepijndevos> ah... I guess for LUT only the output matters, while for dff only the input matters? hmmmm, bit murky in my brain

12:31 <pepijndevos> Lofty, ?

12:32 <Lofty> Pretty much, yeah, pepijndevos

12:32 <Lofty> To put it another way: what's the combinational output of a flop?

12:33 <Lofty> It doesn't have one, by definition

12:33 <pepijndevos> right

12:33 <Lofty> So the variation *must* be the input

12:34 <Lofty> Which leaves the LUT timing as dependent on the output by definition

12:34 <Lofty> ...

12:34 <Lofty> by elimination

12:34 <Lofty> I did not get much sleep last night

12:35 <pepijndevos> yea... by elimination I'd agree, but combinational it definitely could have a different Rx Lx time

12:36 <pepijndevos> I guess because it depends on the actual lookup table they just took the worst case there, because you cannot at compile time reason about that probably?

12:36 <pepijndevos> Anyway, elimination is good enough for me

12:36 <Lofty> Yeah, I'd imagine so

12:37 <pepijndevos> I do think you got R/F backwards because 0.3 is the lower number and I'd expect it to be falling edge

12:39 <Lofty> ...No?

12:39 <Lofty> But, eh, you're the one with a master's degree

12:43 <pepijndevos> eh... a degree does not guarantee correctness. Why do you think the lower number is rising edge?

12:51 <Lofty> I'd think the capacitance in the transistors having to discharge would make it slower

13:04 <pepijndevos> So... take a look at https://upload.wikimedia.org/wikipedia/commons/thumb/2/2f/CMOS_inverter.svg/1200px-CMOS_inverter.svg.png capacitance between rising edge and falling edge is the same. The input is always connected to both the PMOS and NMOS transistor. Except maybe the depletion layer changes a bit?

13:07 <pepijndevos> So... okay let's imagine the depletion layer significantly changes the capacitance, but... hmm the integral of the capacitance during a transition is the same in both directions, so I don't think it matters

13:09 <pepijndevos> But well, a bigger VGS increases the depletion region, so decreases the capacitance, and the PMOS is bigger so has bigger capacitance. So... when the input is low, the PMOS has the smallest capacitance, so according to that theory... yea for some given current the dv/dt is higher at the low end?

13:15 <pepijndevos> Anyway, I think the dominant effect is that a PMOS is like 1/3 as effective as an NMOS, so you'd need to make it 3 times as big to have the same rise time, but yea this increases total capacitance and decreases speed, so I think what they do is compromise and do like a 2x PMOS to have it kinda in the same ballpark and decrease the total capacitance a bit.

13:18 <Lofty> Well, LUTs are generally SRAM connected to pass transistors, rather than like multiplexers

13:18 <Lofty> But yeah

14:11 <Claude> Now throw cross conduction and the miller effect into it :)

14:15 <pepijndevos> Be my guest, I'd be entertained :))

14:17 <Claude> Depending what the goal of the process is (speed or low power) this something which definitely needs to be considered for CMOS

14:17 <pepijndevos> Miller effect is just that the capacitor "appears bigger" due to feedback, right? Would that have an effect on rising vs falling edge? Hmmm

14:19 <Claude> Yes , because when the high side pmos is still conducting (due to the artificially increased gate capacity) and the low side nmos is already in the analog mode region a large ammount of current flows

14:20 <pepijndevos> Not sure I follow

14:20 <Claude> But that applies more to output stages ..

14:22 <Claude> Switching from 1 to 0 takes longer usually on outputs

14:23 <Claude> So the falling edge timing on inputs needs a bit more relaxed

14:24 <Claude> But I not that sure on CMOS actually .. we had that on BCD process on high voltage ASIC which probably looked way different. So ignore my babbling

14:25 <pepijndevos> I'd like to understand though. The thing you were saying where a large amount of current flows

14:27 <pepijndevos> Are you talking about an inverter during a falling edge, or some other scenario?

14:28 <kbeckmann> pepijndevos: saw your tweet about fuzzing and that it takes more time etc. do you need some computing power? i have a threadripper (32core) that mostly idles right now. happy to run some stuff for you.

14:28 <Claude> Ah ok , there is a certain ammount of current needed to charge/discharge the totem pole gate capacity. The miller effect makes the high side switch even slower than it already is (pmos ) . So while the nmos is already in its gate threshold region the pmos is still conducting

14:28 <pepijndevos> kbeckmann, nah it's fine. Thanks though :)

14:29 <kbeckmann> cool

14:32 <pepijndevos> Am I wrong in assuming that given reasonable layout, the gate voltages on both devices are the same?

14:33 <Claude> This varied a lot over temperature and process

14:34 <Claude> So there is a certain band in which both conduct or start to conduct

14:34 <pepijndevos> band... in time or input voltage?

14:34 <Claude> Voltage

14:34 <pepijndevos> Like... statically. okay yea

14:35 <pepijndevos> Yea so you get some shoot through current during the switching where both devices are on

14:35 <pepijndevos> or you called it cross conducting I think?

14:35 <Claude> The edge slope and gate capacitance give the time component

14:36 <Claude> Yes we said cross conduction to it

14:37 <pepijndevos> Okay, yea I get there will be cross conduction during the transition... but why will it make the falling edge slower?

14:39 <Claude> Because the high side output switch will conduct longer than its gate drive input signal

14:39 <Claude> So the slope of the output , falling , is longer too

14:41 <Claude> At least on BCD :) because of that , it was an motor drive ASIC, we had to add a lot of deadtime in software for the high side switch

14:43 <Claude> Otherwise the external output h-bridge was "cross conducting" . But the more I think about it , it's probably really not relevant for an CMOS FPGA lut input :) hence just ignore my babbling

14:47 <pepijndevos> yea, I think I get it but also... I kind of don't fundamentally get it. Like... I'm just trying to understand where the delay comes from... sure, there is capacitance, but say you instantaneously would impose a voltage at the gate, would there be a delay? No right? So is the secret in the parasitic inductance that connects the gates? Else I can't see how the PMOS would be "slower"

14:51 <pepijndevos> Ah well... I guess I should do some simulations to see what happens to really understand it.

14:52 <pepijndevos> Practically for now the main thing I care about is not getting the timing backwards haha

14:52 <pepijndevos> So yea if this stuff only applies to BJT power electronics and stuff, and my original idea is in the first order kinda correct, I'll just roll with it for now.

14:58 <Claude> Could , and likely it is, that I got it wrong back then :)

20:12 FabM has quit [Quit: Leaving]