##openfpga on 2019-12-04 — irc logs at freenode.irclog.whitequark.org

00:14 dh73 has quit [Quit: Leaving.]

00:27 Maylay has quit [Quit: No Ping reply in 300 seconds.]

00:28 Maylay has joined ##openfpga

00:34 rohitksingh has joined ##openfpga

01:19 X-Scale has quit [Quit: HydraIRC -> http://www.hydrairc.com <- \o/]

01:20 zino has quit [Ping timeout: 268 seconds]

01:20 X-Scale has joined ##openfpga

01:21 zino has joined ##openfpga

01:46 Maylay has quit [Ping timeout: 240 seconds]

01:49 Maylay has joined ##openfpga

01:52 freemint has quit [Remote host closed the connection]

01:53 Maylay has quit [Excess Flood]

01:53 freemint has joined ##openfpga

02:04 freemint has quit [Remote host closed the connection]

02:05 freemint has joined ##openfpga

02:07 Maylay has joined ##openfpga

02:10 Maylay has quit [Excess Flood]

02:13 Maylay has joined ##openfpga

02:16 nrossi has joined ##openfpga

02:17 Maylay has quit [Excess Flood]

02:40 Maylay has joined ##openfpga

02:42 Maylay has quit [Excess Flood]

02:46 Maylay has joined ##openfpga

02:48 Maylay has quit [Excess Flood]

02:48 Maylay has joined ##openfpga

02:59 Maylay has quit [Quit: No Ping reply in 300 seconds.]

03:15 Maylay has joined ##openfpga

03:21 Maylay has quit [Ping timeout: 265 seconds]

03:26 Maylay has joined ##openfpga

03:28 <mithro> The more and more I get involved with ASIC design, the more and more I'm surprised we have working devices at all. Not because of physics, not because of the complexity in dealing with 10 million transistors. More because we barely make working software with current best development practices, yet EDA and ASIC people are still using 1980 "best" practices....

03:28 Maylay has quit [Excess Flood]

03:29 Maylay has joined ##openfpga

03:31 <sorear> are current best development practices actually better, or do we just think they're better because they're current best development practices

03:31 <sorear> there was a lot of working software already in the 60s

03:32 Maylay has quit [Excess Flood]

03:32 Maylay has joined ##openfpga

03:38 freemint has quit [Remote host closed the connection]

03:38 freemint has joined ##openfpga

03:40 Maylay has quit [Remote host closed the connection]

03:41 whitequark has quit [Ping timeout: 252 seconds]

03:42 whitequark has joined ##openfpga

03:42 <pie_> sorear: probably using several of the current best development practices

03:42 <pie_> ...tbf stacks were simpler too?

03:42 <pie_> that we dont use

03:42 <pie_> (i mean, idk)

03:43 freeemint has joined ##openfpga

03:45 freemint has quit [Ping timeout: 250 seconds]

03:48 Maylay has joined ##openfpga

03:50 <TD-Linux> as a retro computing connoisseur, I can attest that software was just as bad then

03:58 Bike has quit [Quit: Lost terminal]

03:59 Maylay has quit [Quit: No Ping reply in 300 seconds.]

04:01 Maylay has joined ##openfpga

04:03 <pie_> oops

04:57 <OK_b00m3r> sorear: things like easily accessible high quality version control do make a difference, i believe. before 2000 that wasn't anywhere near as accessible. and I can name quite a few other things that have changed in 30 years

04:57 <OK_b00m3r> however, we are still in a deep crisis... :D

04:57 <OK_b00m3r> but not for want of good tools

04:57 <OK_b00m3r> for want of good culture and organisational maturity imho

04:58 rohitksingh has quit [Ping timeout: 240 seconds]

05:28 freeemint has quit [Ping timeout: 250 seconds]

05:37 freeemint has joined ##openfpga

05:42 freeemint has quit [Ping timeout: 250 seconds]

05:46 freeemint has joined ##openfpga

05:51 freeemint has quit [Ping timeout: 250 seconds]

06:10 rohitksingh has joined ##openfpga

06:21 rohitksingh has quit [Ping timeout: 245 seconds]

06:24 OmniMancer has joined ##openfpga

06:30 m4ssi has joined ##openfpga

06:35 rohitksingh has joined ##openfpga

06:57 rohitksingh has quit [Ping timeout: 250 seconds]

08:13 <whitequark> it's not like tools can't drive culture

08:14 <whitequark> accessible tools often enable cultural shifts. you've mentioned it yourself: version control

08:14 <whitequark> package managers enable code reuse

08:15 <whitequark> daveshah: so i'm thinking about maybe reverse engineering iceMACH 4A5

08:22 freeemint has joined ##openfpga

08:23 marcan has quit [*.net *.split]

08:23 awordnot has quit [*.net *.split]

08:23 simeonm has quit [*.net *.split]

08:23 fseidel has quit [*.net *.split]

08:23 pakesson_ has quit [*.net *.split]

08:23 Mimoja has quit [*.net *.split]

08:23 danilonc has quit [*.net *.split]

08:23 kbeckmann has quit [*.net *.split]

08:23 GityUpNow has quit [*.net *.split]

08:23 duck2 has quit [*.net *.split]

08:24 fseidel has joined ##openfpga

08:24 simeonm has joined ##openfpga

08:24 awordnot has joined ##openfpga

08:24 Mimoja has joined ##openfpga

08:24 pakesson_ has joined ##openfpga

08:24 danilonc has joined ##openfpga

08:24 kbeckmann has joined ##openfpga

08:24 duck2 has joined ##openfpga

08:24 GityUpNow has joined ##openfpga

08:24 marcan has joined ##openfpga

08:26 freeemint has quit [Ping timeout: 240 seconds]

08:33 <juri_> too many tools are preventing our culture from gaining power. with each of us on different dev boards with different tool, all we are able to do is consume, then buy a different one, and consume again.

08:37 <mwk> um what?

08:52 Bob_Dole has joined ##openfpga

09:03 ZombieChicken has quit [Ping timeout: 268 seconds]

10:00 freeemint has joined ##openfpga

10:05 freeemint has quit [Ping timeout: 265 seconds]

11:19 freeemint has joined ##openfpga

11:23 freeemint has quit [Ping timeout: 252 seconds]

11:25 <OK_b00m3r> whitequark: Yes, agreed

11:26 <OK_b00m3r> whitequark: but tool existence alone isn't enough, ime

11:27 <zignig> OK_b00m3r: those tools take time to mature , the important thing is to try them out and see what you can build.

11:28 <zignig> building more tools with those tools is where the momentum comes from.

11:34 freeemint has joined ##openfpga

11:38 freeemint has quit [Ping timeout: 245 seconds]

12:20 freeemint has joined ##openfpga

12:24 freeemint has quit [Ping timeout: 245 seconds]

12:27 freeemint has joined ##openfpga

12:31 freeemint has quit [Ping timeout: 245 seconds]

12:38 freeemint has joined ##openfpga

12:40 freeemint has quit [Remote host closed the connection]

12:40 freeemint has joined ##openfpga

12:55 freeemint has quit [Remote host closed the connection]

12:56 freeemint has joined ##openfpga

13:04 Asu has joined ##openfpga

13:08 Asu` has joined ##openfpga

13:10 Asu has quit [Ping timeout: 268 seconds]

13:16 <OmniMancer> daveshah: did you get the indexing suite headers from ms bond?

13:17 <daveshah> Yeah

13:17 <daveshah> I think so

13:21 <whitequark> oh oops i blinked

13:21 <whitequark> daveshah: about iceMACH 4A5: what do you think would be the best approach? I'm not sure how fuzzers would work for CPLDs

13:21 <whitequark> but that might be just because I don't fully understand how they work for FPGAs

13:22 <daveshah> I guess rqou/azonenberg have experience there

13:22 <daveshah> I guess a lot of the high level techniques are quite similar

13:23 <whitequark> right. there are some muxes in it that'd be easy to enumerate, and some I don't really know how to get at

13:23 <azonenberg> daveshah: i had comments in the coolrunner bitstream which helped a lot (it's ascii text based)

13:23 <whitequark> cross-PLA connections being one of them

13:24 <azonenberg> i also did a fair bit of silicon RE

13:24 <azonenberg> to figure out the crossbar config

13:24 <whitequark> ok that's cheating. I don't even have any of the silicon

13:24 freeemint has quit [Ping timeout: 250 seconds]

13:24 <daveshah> Stuff like FF and IO config would be the same as an FPGA fuzzing wise too

13:24 <azonenberg> http://siliconexposed.blogspot.com/2014/03/getting-my-feet-wet-with-invasive.html

13:24 <azonenberg> then the followup post from a few weeks later

13:24 <whitequark> the IO config is a bit weird there

13:24 <azonenberg> i actually landed probes on the die and sniffed signals off the global interconnect

13:25 <azonenberg> to verify that i knew what was what

13:25 <whitequark> since OE has a product term all on its own (?)

13:25 <daveshah> It would be worth looking if there are some low level ways to config the chip

13:25 <daveshah> See what options the tool has

13:25 <whitequark> I did look closely at ispLEVER

13:26 <whitequark> it's brutally primitive

13:27 <daveshah> Yeah, I think it is Windows only?

13:27 <whitequark> it more or less runs under wine

13:27 <whitequark> synplify gets stuck in an infinite loop at startup, but EDIF input works just fine

13:27 <whitequark> or schematic

13:28 <whitequark> internally it lowers everything to flat BLIF and works with that

13:28 <daveshah> Interesting, I wonder if there is abc inside there

13:28 <OmniMancer> daveshah: they reference a LICENSE file in the root that was not copied over

13:28 <daveshah> A lot of the commercial tools with blif inside also use abc...

13:28 <daveshah> It's not widely used commercially otherwise

13:29 <daveshah> OmniMancer: oops, I need to have a look at that

13:29 <whitequark> I don't think it uses abc

13:29 <whitequark> it has some really weird BLIF

13:30 <whitequark> AFAICT it progressively flattens it until the entire design is described with a single truth table (a .tt4) file

13:30 <whitequark> .... oh

13:30 <whitequark> I just realized why it's called .ttX. because it's a truth table.

13:31 <whitequark> so first it produces a .tt2 which has syntax I've never seen before https://paste.debian.net/1119433/

13:31 <whitequark> then .tt3 which looks identical, and .tt4 like this https://paste.debian.net/1119434/

13:31 <daveshah> Interesting

13:31 <whitequark> you can actually use .tt4 with the flow "officially"

13:31 <whitequark> it's documented as one of valid input formats to ispflow.exe

13:31 <daveshah> Not sure if that's really blif or just something that uses similar framing

13:32 <whitequark> it doesn't call it blif

13:32 <whitequark> it also uses .bli .bl0 .bl1 .bl2 .bl3 files

13:32 <whitequark> which are progressively flattened blif

13:32 <whitequark> normal blif with some vendor extensions

13:32 <whitequark> magic comments specifically

13:32 <whitequark> #$ MODULE count16a

13:32 <whitequark> #$ PINS 6 clock reset COUNT_3_ COUNT_2_ COUNT_1_ COUNT_0_

13:34 <whitequark> I think the placement constraints go into an accompanying ini-structure file that the various passes update as they work

13:34 <whitequark> since one of the features is that it doesn't just give you random placement every time you run it

13:34 <whitequark> I think they call it SpeedLock(tm) or something

13:36 <daveshah> That sounds quite useful for fuzzing

13:37 <whitequark> I can just lock anything at any location in the first place

13:37 <whitequark> it's prominently exposed in the GUI even

13:37 <OmniMancer> daveshah: notably this line: "// Licensed under the MIT license. See LICENSE file in the project root for full license"

13:39 <whitequark> more specifically, I can lock any node to any location in any specific macrocell

13:39 <daveshah> Right I'll fix that when I'm next at a computer

13:39 <daveshah> That's handy

13:39 <whitequark> which is interesting because it actually gives more control than you get on LUT arches

13:39 <whitequark> I think

13:39 <whitequark> AFAIU if you lock two gates to different macrocells it'll have to route them accordingly

13:39 <whitequark> so you can force it to use e.g. adjacent macrocell routing

13:40 <whitequark> even though if they were in the same macrocell, it'd combine them into one term if possible

13:40 <daveshah> I have a feeling that Xilinx ISE had something similar for LUTs

13:41 <daveshah> Some way of forcing LUT boundaries for logic statements

13:41 <whitequark> the JED file actually tells me which fuses are for which PLA and which are for the central switch

13:41 <whitequark> but nothing beyond that

13:42 <whitequark> well, it's also conveniently split into per-macrocell and (I think) per-term blocks with whitespace

13:42 <OmniMancer> so what is in a macrocell in a CPLD?

13:42 <whitequark> it's similar to a LUTFF block in an FPGA

13:42 <whitequark> but it has way more inputs

13:43 <mwk> and it's also simultanously associated with an IOB, usually

13:43 <whitequark> not in this FPGA

13:43 <whitequark> er, this CPLD

13:43 <whitequark> each macrocell can route to 8 IOBs

13:43 <mwk> oh, huh

13:44 <whitequark> it actually seems pretty advanced for a CPLD

13:44 <OmniMancer> so it has one LUTFF per macrocell but the LUT equivalent has more possible inputs?

13:44 <mwk> it's not a LUT

13:44 <mwk> it's a sum-of-products

13:45 <OmniMancer> indeed

13:45 <mwk> so you have many more possible inputs, but you don't have flexibility about the function

13:45 <OmniMancer> what kind of boolean functions cannot be described by SoP?

13:45 <mwk> all of them can

13:46 <mwk> but sometimes you need more layers / inputs / terms than the CPLD has available

13:47 <OmniMancer> ah

13:47 <mwk> same as LUTs, really

13:47 <whitequark> see https://cloud.whitequark.org/s/KQYemizR4SaBp8S

13:48 <whitequark> they have something weird going on with the cheapest device in series

13:48 <whitequark> and something else weird going on with the "IO:macrocell ratio"

13:49 <whitequark> there's also something really weird going on with "asynchronous macrocell mode", which lets you use latches but reduces the amount of product terms

13:49 <mwk> d-do they simulate a latch with combinatorial logic

13:50 <whitequark> you don't even need a storage element enabled, and in fact comb functions should use the "synchronous macrocell mode"

13:50 <whitequark> I have no idea?

13:50 <sorear> all I’m looking for is a cpld with the ability to represent all 128-input boolean functions, is that such a hard problem? /joke

13:50 <whitequark> Fig 5b suggests that... maybe?

13:51 <whitequark> I... think they reroute the product term that goes to the "asynchronous preset" signal to the FF clock in "asynchronous mode"

13:52 <whitequark> but I have no idea why that would cut two product terms off the preceding comb function

13:53 <OmniMancer> seems weird

13:53 <OmniMancer> latches count as async?

13:58 <OmniMancer> daveshah: does the DatabasePath actually get used for anything?

14:04 Marex has quit [*.net *.split]

14:04 moho1 has quit [*.net *.split]

14:04 forrestv has quit [*.net *.split]

14:04 dfgg has quit [*.net *.split]

14:05 <whitequark> I wonder if the M4A-32/32 device (the cheapest one) is more like two of their PALs strapped together

14:09 moho1 has joined ##openfpga

14:09 forrestv has joined ##openfpga

14:09 dfgg has joined ##openfpga

14:09 Marex has joined ##openfpga

14:13 moho1 has quit [*.net *.split]

14:13 Marex has quit [*.net *.split]

14:13 forrestv has quit [*.net *.split]

14:13 dfgg has quit [*.net *.split]

14:14 <whitequark> hm

14:18 <whitequark> the 66-bit blocks in the JED file are probably the product terms

14:19 moho1 has joined ##openfpga

14:19 Marex has joined ##openfpga

14:19 dfgg has joined ##openfpga

14:19 forrestv has joined ##openfpga

14:20 <whitequark> there are 4 comb inputs per macrocell and 1 OE per two macrocells, and I see 11 rows per macrocell pair

14:20 <OmniMancer> I am sometimes confused why some of the tools decide to use the SoP logic expression for some LUTs and just bits or numbers for others

14:20 <whitequark> wait is it literally laid out the same as the datasheet figure?

14:20 <whitequark> they even helpfully numbered the rows

14:21 <OmniMancer> what is an OE?

14:21 <whitequark> output enable

14:21 <OmniMancer> thanks

14:24 <OmniMancer> I thought it might be that but the usage felt weird

14:30 <azonenberg> whitequark: which cpld is this?

14:31 <whitequark> ispMACH 4A

14:31 <whitequark> the last true 5V CPLD still in production and not NRND

14:31 <whitequark> I was shocked to find out there are any left

14:33 <q3k> one of the products i worked on used an ispMACH4000Z, should've reverse engineered it instead of spending time on dealing with garbage windows software for it

14:34 <whitequark> i think 4000 and 4A *might* be similar internally?

14:34 <whitequark> i still can't figure out if 4A is a part of 4000 series or not

14:34 <whitequark> i'm not sure if lattice can figure it out either

14:34 <q3k> sounds about right

14:35 <whitequark> would be neat if they are similar

14:35 <q3k> fwiw i do have a working isplever-in-wine-in-docker setup if you want it for fuzzing

14:35 <whitequark> because 4000Z isn't even "mature"

14:36 <whitequark> oh i have isplever working in wine

14:36 <whitequark> just not synplify

14:36 <whitequark> but i don't really care about synplify

14:36 <whitequark> i can just generate EDIF, probably even with yosys

14:36 <q3k> i don't remember if this used synplify or LSE or what, but it does verilog-to-jedec

14:36 <whitequark> it has both synplify and lse but i think the mach series can only use synplify

14:37 <q3k> this certainly worked with the 4000Z

14:37 <whitequark> it's probably something fucked about my wine setup

14:38 * whitequark looks at the datasheet

14:38 <whitequark> I... can't tell if they have the same design and the docs are just gratuitously changed, or if it's a different device

14:38 <whitequark> they're definitely similar

14:40 <whitequark> ok no, the term sizes are very different

14:40 <whitequark> the basic architecture seems almost identical

14:43 <whitequark> oh, the new docs are way more coherent

14:44 <whitequark> ohhhhh

14:44 <whitequark> I figured out what the heck "asynchronous mode" is

14:44 <whitequark> so the register can be configured as a DFF or DLATCH

14:45 <OmniMancer> and asynch is a latch?

14:45 <whitequark> no

14:45 <whitequark> I think the async mode only gives you SR and JK latches

14:46 genii has joined ##openfpga

14:46 <OmniMancer> makes some sense

14:47 <mwk> JK isn't a latch though, it has to be a flip-flop

14:48 <OmniMancer> is a latch not composed of an SR flip-flop?

14:48 <mwk> no

14:49 <whitequark> mwk: I... think they actually mean a JK latch

14:49 <whitequark> like, they say it oscillates with J=K=1

14:49 <whitequark> "the use is inadvisable"

14:49 <mwk> whitequark: ... what

14:49 <mwk> "inadvisable" — that I can agree with

14:49 <whitequark> The flip-flop can be configured as a D-type or T-type latch. J-K or S-R registers can be synthesized. The

14:49 <whitequark> primary flip-flop configurations are shown in Figure 6, although others are possible. Flip-flop functionality

14:49 <whitequark> is defined in Table 8. Note that a J-K latch is inadvisable as it will cause oscillation if both J and K inputs

14:49 <whitequark> are HIGH.

14:50 <mwk> alright

14:50 <mwk> I take it back

14:50 <mwk> JK can be a latch, if you're stark raving insane

14:51 <mwk> also uh

14:51 <mwk> "T-type latch"

14:51 <mwk> it's... at least JK latch has some sense as long as you're not actually using both J and K at once, but a "T latch"?!?

14:52 <whitequark> I have no idea?

14:52 <OmniMancer> for some reason I associate flip-flop with SR-latch

14:52 <whitequark> I think they mean TFF when they say T-latch

14:53 <whitequark> the storage element is described as ... hm

14:53 <whitequark> ok you know what

14:53 <whitequark> it might actually be a T latch

14:53 <whitequark> what the fuck is a T-latch

14:53 <mwk> aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa

14:54 <mwk> whitequark: well umm by obvious analogy

14:54 <mwk> it's a thing that keeps its value when gate is 0

14:54 <mwk> and oscilates like mad when it's 1

14:54 <azonenberg> lolol

14:54 <whitequark> well yes but

14:54 <whitequark> why

14:54 <whitequark> I *have* to get some silicon now

14:54 <mwk> my thoughts exactly

14:55 <OmniMancer> mwk: for all your metastability needs?

14:58 <OK_b00m3r> :)

14:58 <OK_b00m3r> Precision Metastability Source

15:00 <OmniMancer> Need a programmable logic device that contains a pulse generator that generates pulses specifically just long enough to half set an RS-latch in the device

15:00 <sorear> isn’t the oscillaty part of a PLL just a T latch

15:04 <OmniMancer> sorear: I think its usually a ring oscillator

15:04 <OmniMancer> an odd length loop of inverters?

15:05 <azonenberg> that's a DPLL (digital PLL)

15:05 <azonenberg> a normal PLL is usually an L-C tank or similar based VCO

15:05 <OmniMancer> yes assuming a digital one not a radio one

15:05 <azonenberg> with an analog charge pump to bump the voltage up or down

15:05 <azonenberg> a lot of high end serdes at least use LC plls

15:05 <azonenberg> less jitter i think

15:05 <OmniMancer> ah, interesting

15:06 <whitequark> azonenberg: wait what

15:06 <whitequark> I assumed a DPLL is a fully synchronous circuit

15:06 <OmniMancer> not it uses a ring oscillator as its VCO

15:07 <azonenberg> There's multiple ways to do those too

15:07 <OmniMancer> indeed

15:07 <azonenberg> for example a spartan6 DCM is, afaik, a PLL using a ring oscillator but it adjusts the oscillator period by just muxing inverter stages in and out of the delay line

15:07 <OmniMancer> whitequark: how would you multiply frequencies with a fully synchronous circuit?

15:07 <azonenberg> so you get horrible jitter

15:08 <OmniMancer> erg

15:08 <azonenberg> i think you can get better results if you use some kind of bias voltage to adjust the inverter delay while having a fixed length chain

15:08 <OmniMancer> AFAIK ring oscillators do function as VCOs but no direct experience

15:08 <azonenberg> but my understanding is the most stable plls are lc based, or possibly a quartz resonator with a control voltage

15:09 <azonenberg> OmniMancer: what i meant is, you can make it be a VCO, or you can have an all digital varaible speed ring oscillator

15:09 <azonenberg> where you just change the number of taps in the loop

15:09 <azonenberg> the latter is compatible with a pure digital process using foundry cells only, but jitters way worse

15:09 <whitequark> OmniMancer: you oversample

15:09 <azonenberg> whitequark: that's basically a software PLL then

15:09 <whitequark> yes

15:09 <whitequark> I was misusing the term "DPLL" then

15:09 <azonenberg> using an NCO derived from the refclk

15:10 <azonenberg> but you can't make a pll running faster than your reference that way

15:10 <azonenberg> all you can do is phase lock to an external input +/- one cycle of your input clock

15:10 <cr1901_modern> azonenberg: You need a varactor for the "C" part of the LC in a VCO

15:10 <OmniMancer> whitequark: I mean you have a clock at N Hz, how do you generate a clock at 2*N Hz with no other inputs with a fully synchronous circuit?

15:11 <azonenberg> OmniMancer: actually i think you can frequency double using pos/negedge ffs and some combinatorial xors

15:11 <azonenberg> but i wouldnt recommend it :p

15:11 <OmniMancer> that sounds like a dubious prospect, but I will give you that

15:12 <whitequark> OmniMancer: you can't

15:12 <whitequark> it's not a frequency multiplier, it's a phase locked loop

15:13 <OmniMancer> Sorry, I see above the differing interpretation of DPLL

15:13 <azonenberg> yeah basically its just a question of what your resonant element is

15:13 <azonenberg> If it's an NCO, you can be synchronous

15:13 <OmniMancer> well phase locked loops are often used to do frequency multiplication

15:14 <cr1901_modern> My understanding is that a DPLL has an analog VCO- it's only the phase detector that differs (ADPLL has an NCO)

15:15 <OmniMancer> I have done a PLL like structure in synchronous logic with NCOs for doing MSK demodulation for my final year uni project

15:15 <whitequark> cr1901_modern: interesting

15:16 <OmniMancer> yea you can build different phase detectors if you only care about square waves

15:17 <cr1901_modern> An analog PLL uses a multiplier and relies on one of the trig identities to extract the phase difference. A DPLL uses an edge detector (XOR, or Phase-Freq Detector)

15:17 <OmniMancer> yes multiplier sin phase detector

15:17 <whitequark> azonenberg: any ideas for REing the switch matrix of M4A?

15:18 <OmniMancer> analog PLL used for carrier recovery usually also includes a non-linear function on the input to make the carrier obvious

15:19 <cr1901_modern> Yea I don't remember how Costas PLL works

15:19 <cr1901_modern> if that's what you mean

15:19 <OmniMancer> yea I think what I did was some kind of Costas loop

15:20 <OmniMancer> though that is also recovering clock as well I think

15:20 <azonenberg> whitequark: not without spending a bit of time reading about the architecture

15:21 <OmniMancer> oh no it just does carrier, but the variant I did was also producing the clock I think

15:24 <OmniMancer> And I think Costas PLLs can be used as demodulators themselves

15:24 <whitequark> azonenberg: there is nothing interesting in the docs besides "100% routability"

15:28 emeb has joined ##openfpga

15:28 <azonenberg> whitequark: Hmmm

15:28 <azonenberg> do they mean every subset of inputs can be routed to some outputs? or every input can be routed to every output

15:28 <azonenberg> my guess is its a sparse crossbar, full ones are huge

15:28 <whitequark> >— Central, input and output switch matrices for 100% routability and 100% pin-out retention

15:28 <whitequark> is what it says

15:29 <whitequark> the input and output switch matrices are fully described

15:29 <azonenberg> are there actually 3 crossbars?

15:29 <azonenberg> ah ok

15:29 <azonenberg> so there are, but you only have to RE one

15:29 <whitequark> output switch matrix is a 1:8 mux for every IOB

15:29 <azonenberg> Is this a PLA based cpld like coolrunner?

15:29 <whitequark> input switch matrix is 2 or 3 1:2 muxes depending on sub-family

15:30 <azonenberg> sources -> matrix -> and array -> or array?

15:30 <azonenberg> with and/or both programmable?

15:30 <whitequark> nope

15:30 <azonenberg> so fixed or array and programmable and?

15:30 <whitequark> multiple choices for fixed or array

15:30 <azonenberg> ok

15:31 <azonenberg> So i guess the first thing to do is, generate a bunch of random test bitstreams

15:31 <whitequark> there's also a cascade function

15:31 <azonenberg> and try to figure out which crossbar channel is used for each thing

15:31 <whitequark> oh it says that in the jed

15:32 <azonenberg> basically, you want to enumerate a subset of the possible paths through the crossbar

15:32 <whitequark> NOTE Interleaved Central Switch Matrices for BLOCKS 0 and 3 *

15:32 <azonenberg> So 0 and 3 share common inputs

15:32 <azonenberg> then you have left-going and right-going outputs

15:32 <azonenberg> that sounds exactly like coolrunner

15:32 <azonenberg> my conjecture is that the routing fabric will be quite similar

15:32 <azonenberg> you're going to have a big bus on the top layer

15:33 <whitequark> azonenberg: https://imgur.com/a/3WSnDDB

15:33 <azonenberg> via-programmed muxes selecting N of those K inputs

15:33 <azonenberg> then one-hot pass transistors to enable one of those N each to left and right

15:34 <azonenberg> and lol

15:34 <azonenberg> that drawing looks like it's the exact physical die floorplan

15:34 <whitequark> I am pretty sure it is

15:34 <OmniMancer> daveshah: thanks for the update, you might want to change the message in the .h files to reference the subset of COPYING instead of LICENSE?

15:35 <whitequark> azonenberg: here's the PLA https://imgur.com/a/xfPFBI2

15:35 <azonenberg> anyway so, what i'm seeing is... you have 2 clocks and 24*4 = 96 input signals for a total of 98 nets entering the switch matrix

15:35 <whitequark> and you can literally overlay the jed file on it

15:35 <azonenberg> then 33 outputs going to each function block

15:35 <whitequark> upside down

15:35 <azonenberg> the 66x90 is presumably 33 + their complements as inputs

15:35 <whitequark> yes

15:35 <azonenberg> then 90 product terms

15:36 <azonenberg> So i expect each block's switch matrix is going to be a 98-to-33 sparse crossbar

15:36 <whitequark> I think it's a bit weirder

15:36 <azonenberg> note i said sparse

15:36 <whitequark> see how the 2nd pic has A and B?

15:36 <azonenberg> i only have one pic

15:36 <whitequark> 15:35 < whitequark> azonenberg: here's the PLA https://imgur.com/a/xfPFBI2

15:36 <whitequark> this one

15:37 <azonenberg> whitequark: so some of the 33 come from the top and some from the bottom?

15:37 <azonenberg> That's something you worry about later on, when you are REing the logic array

15:37 <azonenberg> i dont think the crossbar cares about it

15:37 <azonenberg> Soooo

15:37 <whitequark> hmm ok

15:37 <azonenberg> what does the crossbar's bit aspect ratio look like?

15:38 <azonenberg> i'm guessing it's 33 rows per block

15:38 <azonenberg> and each row has N bits for the left and N for the right function block

15:40 <whitequark> hmm

15:40 <azonenberg> are you familiar with the coolrunner crossbar?

15:41 <azonenberg> http://siliconexposed.blogspot.com/2014/03/getting-my-feet-wet-with-invasive.html

15:41 <azonenberg> read that

15:41 <azonenberg> and the followup post

15:41 <whitequark> let's see

15:41 <azonenberg> while i'm not expecting an identical architecture i think it's going to be close enough to give you some good insights

15:49 <whitequark> azonenberg: neeat

15:49 <whitequark> re crossbar: yes, I figured that's approx how it worked

15:49 <whitequark> ice40 has this sort of architecture too

15:49 <azonenberg> my point is, this is bitstream mapped directly to silicon for a similar design

15:50 <azonenberg> really big parts might have a multilevel tree

15:50 <azonenberg> but your cpld is about the size of a coolrunner

15:50 <azonenberg> So basically what you need to do is, if you want to black box it

15:50 <whitequark> that's the second smallest part

15:50 <azonenberg> start generating random bitstreams and collecting data

15:50 <whitequark> their biggest parts don't fit on one page

15:50 <whitequark> they have to use "detail A" on block diagram

15:51 <whitequark> 512 macrocells, hm

15:51 <azonenberg> you want to make a big list, for each possible input to the crossbar

15:51 <azonenberg> what crossbar outputs has it been seen used in?

15:51 <azonenberg> or transpose the matrix... for each crossbar output, what inputs have you seen there?

15:52 <azonenberg> And, for each in-out combination, what was the bitstream coding?

15:52 <azonenberg> you can design targeted fuzzing cases later on, right now you just want to find a subset of the legal paths to figure out the architecture

15:52 dh73 has joined ##openfpga

15:54 <whitequark> wait

15:54 <whitequark> what the fuck

15:54 <azonenberg> ?

15:54 <whitequark> it writes a report where it describes the configuration of every crossbar mux

15:55 <azonenberg> Why reverse when the tools don't keep secrets? lol

15:55 <azonenberg> does not surprise me, the coolrunner reports were extremely verbose too

16:01 <whitequark> hm, so I can easily pin a signal to one specific CSM input

16:02 <azonenberg> Yep

16:03 <azonenberg> Pinning the outputs is where it gets hard

16:03 <azonenberg> i gave up on that on coolrunner after figuring out maybe 1/3 of the matrix by looking at bitstreams, i couldnt figure out how to craft pathological enough configurations to generate the last few

16:03 <whitequark> oic

16:04 <azonenberg> I knew the full configuration table was in an ISE data file but for EULA reasons i didnt want to use that

16:04 <azonenberg> so i needed a cleanly derived source for the data

16:04 <azonenberg> and i was teaching the hardware RE class at the time

16:04 <azonenberg> so i had an obvious path forward

16:04 <azonenberg> at the time i also didn't understand the sparse crossbar structure

16:04 <azonenberg> once i looked at the layout it made total sens

16:28 OmniMancer has quit [Quit: Leaving.]

16:35 <whitequark> hm

16:35 <whitequark> azonenberg: so I think I understand how half of it worsk

16:37 <whitequark> on the 64/32, there are 33 muxes per PLA, 66 interleaved muxes total, the muxes are 9:1, there are 9 rows per CSM block in JED file

16:37 m4ssi has quit [Remote host closed the connection]

16:37 <whitequark> so the JED file is structured as (66+14)x9

16:37 <whitequark> it looks like there are in fact 33 one-hot muxes per PLA in each CSM block

16:38 <azonenberg> Nine makes perfect sense

16:38 <azonenberg> because the switch matrix is 98 inputs

16:38 <whitequark> what I don't understand is

16:38 <azonenberg> So i'm gonna hazard a guess that you have ground, two clocks, and 96 signals for a total of 99 inputs to the fabric

16:38 <whitequark> what the heck is the 14x9 block doing?

16:39 <azonenberg> The 99 inputs are divided into 9 groups of 11 signals

16:40 <azonenberg> nine 11:1 mask programmed muxes as the first level of the tree, then a 9:1 one hot mux for the second

16:40 <azonenberg> as soon as i saw the 98 inputs i saw the symmetry there, 99 is a nice number for a 2 level mux tree

16:41 <azonenberg> The 14x9 is a big question, i agree

16:41 <whitequark> it appears to be organized as 7x2x9

16:50 <whitequark> hmmm

16:50 <whitequark> maybe that's the input switch matrix?

16:51 <whitequark> it has 3 inputs per macrocell pair

16:55 emeb has quit [Quit: Leaving.]

16:57 <whitequark> the input switch matrix needs at most 1 bit per input, as described

16:57 <whitequark> so probably not

17:12 <whitequark> interesting. here's the techlib used by that CPLD: http://noel.feld.cvut.cz/hw/amd/16507b.pdf

17:14 <whitequark> >As discussed above, it is impossible to over-charge or over-discharge the programming cell since

17:14 <whitequark> the mechanism is self-limiting.

17:21 <whitequark> oh

17:21 <whitequark> oh Lattice bought Vantis from AMD

17:22 <whitequark> ok that explains why the datasheets for M4 and MACH4000Z are totally different

17:23 <whitequark> >The same fitter technology included in MACHXL software is seamlessly incorporated into thirdparty tools from leading CAE vendors such as Synario, Viewlogic, Mentor Graphics, Cadence and

17:23 <whitequark> MINC. Interface kits and MACHXL configurations are also available to support design entry and

17:23 <whitequark> verification with other leading vendors such as Synopsys, Exemplar, OrCAD, Synplicity and Model

17:23 <whitequark> Technology. These MACHXL configurations and interfaces accept EDIF 2.0.0 netlists, generate

17:23 <whitequark> JEDEC files for MACH devices, and create industry-standard SDF, VITAL-compliant VHDL and

17:23 <whitequark> Verilog output files for design simulation.

17:23 <whitequark> ok this explains a lot

17:27 <whitequark> http://noel.feld.cvut.cz/hw/amd/mach_.htm

17:38 <whitequark> ok I see, to understand the sync/async macrocells one has to read the docs for the AMD PLDs http://noel.feld.cvut.cz/hw/amd/pld_.htm

17:38 <whitequark> which were spun off as Vantis and which Lattice then bought

17:42 <whitequark> cr1901_modern: hahaha, the fitter for M4A is apparently a direct descendant of the fitter made by AMD in the early 90s. it runs on DOS, of course

17:42 <whitequark> I think they just rebuilt it for win32

17:44 <whitequark> god bless this person, who saved a local copy http://noel.feld.cvut.cz/hw/amd/pld_design.htm

17:44 <cr1901_modern> bahaha wow...

17:45 <whitequark> I wonder if you could just give it the data files from ispLEVER

17:45 <cr1901_modern> At least you're having good luck w/ the lineage of the M4A. Both ECP5 and Mach use NeoCAD format, and AFAICT (daveshah feel free to correct me), it's been trial and error operating on those files since it's not documented at all.

17:46 <whitequark> MachXO?

17:46 <cr1901_modern> yes, MachXO{1,2,3}

17:47 <whitequark> MACH seems to have absolutely nothing in common with MachXO

17:47 <cr1901_modern> except the first 4 letters?

17:47 <cr1901_modern> case insensitive*

17:47 <whitequark> yes

17:53 dh73 has quit [Read error: No route to host]

17:57 <whitequark> https://www.eetimes.com/document.asp?doc_id=1214102 and *this* explains why the toolchain in ispLEVER can actually synthesize for XC2064

17:58 <whitequark> there's also DesignDirect involved somehow

17:59 <whitequark> oh and apparently CoolRunner was originally done by Philips?

18:00 <cr1901_modern> *googles* Huh, no kidding...

18:01 <whitequark> https://www.electronicproducts.com/Digital_ICs/Programmable_logic_evolves_with_improved_foundry_processes.aspx

18:02 <whitequark> of course, ORCA (now Lattice) was Lucent at that time

18:16 <kc8apf> FPGA/CPLD family trees are just as confusing as human family trees

18:16 IanMalcolm has quit [Read error: Connection reset by peer]

18:17 IanMalcolm has joined ##openfpga

18:19 IanMalcolm has quit [Client Quit]

18:20 IanMalcolm has joined ##openfpga

18:45 <TD-Linux> is the switch from PLAs to LUTs related to nvram vs sram for configuration? or did that just happen at about the same time

18:54 <kc8apf> TD-Linux: oddly, looking back at PLAs brought up the whole Lattice family tree from earlier

18:54 <kc8apf> MMI was bought by AMD then spun out as part of Vartis which was acquired by Lattice

18:56 <kc8apf> MMI was 2nd source licensee for xc2000 and I believe had some exposure to xc3000 and xc4000

18:56 <kc8apf> I actually have qty 2 of the MMI2064

20:03 ZombieChicken has joined ##openfpga

20:19 ZombieChicken has quit [Remote host closed the connection]

20:24 Asu has joined ##openfpga

20:24 Asu` has quit [Ping timeout: 268 seconds]

20:41 mumptai has joined ##openfpga

21:05 emeb has joined ##openfpga

21:06 nrossi has quit [Quit: Connection closed for inactivity]

23:04 Asu has quit [Remote host closed the connection]

23:04 rohitksingh has joined ##openfpga

23:11 mumptai has quit [Quit: Verlassend]

23:33 Bike has joined ##openfpga

23:33 genii has quit [Quit: Time for beer and hockey.]

23:51 <mithro> https://twitter.com/ApertusOSCinema/status/1202280446408773632?s=20

23:56 <TD-Linux> mithro, that would be a cool platform to port hdmi2usb to

23:59 <mithro> TD-Linux: assuming it is cost effective, yes!