#m-labs on 2018-12-20 — irc logs at freenode.irclog.whitequark.org

2018-12-09 21:21 sb0_ changed the topic of #m-labs to: https://m-labs.hk :: Logs http://irclog.whitequark.org/m-labs

01:46 dlrobertson has joined #m-labs

02:59 Gurty has quit [Ping timeout: 252 seconds]

03:01 Gurty has joined #m-labs

03:44 rohitksingh_work has joined #m-labs

04:44 zng has quit [Quit: ZNC 1.8.x-nightly-20181129-f3eca21b - https://znc.in]

04:46 zng has joined #m-labs

05:25 <attie> whitequark: memory inferrence is really messy, but the way migen describes it is calqued on how xilinx wants you to do it. The tools strongly encourage you to use NO_CHANGE preferentially, to the point of printing a message every time you use something else.

05:26 <attie> but they also straight-up assume that you will never have an address collision between different ports

05:26 <attie> despite the verilog that they are inferred from having very clear behavior for that case, the synthesizer will happily ignore it and implement something that does the complete opposite.

05:27 <attie> there are still unresolved SyncFIFO bugs in the code from this.

05:28 <attie> I spent about a month last year looking into this, but I couldn't find a portable solution.

05:30 <attie> I guess the only way around that bug would be something like Memory() being specialized by platform to instantiate the BRAM with the right settings, with the inferrable verilog as a fallback.

05:38 <whitequark> attie: WTF

05:39 <whitequark> well the Xilinx synthesizer is a piece of shit then.

05:39 <whitequark> but thanks for mentioning.

05:40 <whitequark> one more item in the list titled "however immature open-source FPGA tools are, the vendor tools manage to outmatch them each time"...

05:42 <attie> there's still an old PR hanging around with a partial fix that can't be applied cause it breaks for non-xilinx

05:42 <attie> PR #105

05:43 <whitequark> that is horrifying. but at least in nmigen you could use attr translation to do this.

05:43 <attie> I can sort of understand why they do it that way, once they designed their BRAM the way they did

05:43 <attie> it's either that or never infer BRAM at all

05:43 <whitequark> how did they do it?

05:43 <whitequark> I don't remember their design offhand

05:44 <attie> if the write port is in read-first mode the collision is ok

05:44 <attie> otherwise the read port has undefined behavior

05:44 <whitequark> hrm

05:45 <whitequark> attie: wait.

05:46 <whitequark> this doesn't seem right.

05:46 <whitequark> the default mode is WRITE_FIRST, and nothing in misoc or artiq ever switches it to NO_CHANGE.

05:46 <whitequark> how come artiq works at all?

05:46 <attie> write first and no change have the same behavior mostly?

05:46 <attie> it's only different for r/w on the *same* port iirc

05:47 <whitequark> yeah, NO_CHANGE only considers the same port

05:47 <whitequark> that's my whole problem with it

05:47 <whitequark> it does this weird thing where each write port is paired with a read port whether you want it or not

05:47 <attie> that's how xilinx BRAM is

05:48 <whitequark> yeah, but migen is not called xigen

05:48 <attie> it's... extremely inspired by the Xilinx user guide chapter III, HDL coding styles

05:49 <whitequark> well that's depressing

05:49 <whitequark> i *guess* i'll have to delete enough anime to make space for vivado again.

05:49 <attie> which is frankly how you have to write verilog if you want anything approaching sensible output from the xilinx tools.

05:49 <whitequark> ah no, i have the piece of shit installed here already.

05:50 <whitequark> what frontend does xilinx use anyway?

05:50 <attie> they don't really accept verilog input. they accept a sequence of macros.

05:50 <attie> no idea, their own proprietary mess i assume?

05:50 <whitequark> right, this doesn't sound like synopsys

05:51 <whitequark> synopsys actually seems pretty decent

05:51 <attie> I wonder if you could go from rtlil directly to edif netlist.

05:52 <whitequark> write_edif ?

05:52 <attie> avoid the whole "going through a language that was never made to describe the thing we want to do" rigamarole

05:52 <whitequark> but that won't work because nmigen generates yosys coarse grained cells

05:52 <whitequark> not xilinx

05:53 <attie> well, I guess you'd need write_xilinx_edif then...

05:53 <attie> (I have no idea what goes into that. I briefly flirted with the idea about...6 years ago?)

05:55 <whitequark> I *think* I could coerce Yosys into generating the sort of HDL Xilinx tools will recognize

05:55 <whitequark> (there is a very good russian expression for this kind of thing that is unfortunately too obscene to actually mention)

05:56 <attie> right, this is a Very Professional channel

05:56 <whitequark> attie: any idea if Xilinx will at least accept memory init in form of an initial block and not $readmemh?

05:59 <attie> no idea

06:05 <attie> user guide says yes

06:05 <attie> integer i; initial for (i=0; i<DEPTH; i=i+1) ram[i] = 0; end

06:06 <whitequark> ok cool, so the yosys generated verilog stands a chance of being acceptable as is

06:10 <whitequark> attie: ugh, xilinx block rams are corrupted if you asynchronously reset the FFs driving them

06:10 <whitequark> of course, this is exactly what you want to do to reset an AsyncFIFO

06:10 <whitequark> (reliably, that is)

06:11 <attie> asynchronously to which clock, the one driving the port in question?

06:11 <whitequark> yeah

06:12 <whitequark> I mean it doesn't matter for resetting AsyncFIFO.

06:12 <whitequark> (because if you reset it you don't care what happens to the data)

06:12 <attie> right

06:13 <attie> I gave up and made my design non-resettable at some point because of block ram

06:14 <attie> slowly inching my way back these days to a point where I might not have to reprogram the FPGA every time

06:14 <whitequark> hmm I'm really unhappy with this AsyncFIFO behavior because it screws Glasgow over

06:14 <whitequark> so I fixed it properly in nmigen, nmigen has async reset now

06:16 <attie> is glasgow based on a xilinx fpga?

06:18 <whitequark> thankfully it is not

06:18 <whitequark> and will never be

06:19 <whitequark> the current revision is based on Lattice iCE40, which is a small but very nicely designed deivce

06:19 <whitequark> the next on Lattice ECP5

06:19 <whitequark> which is also refreshingly simple architecture wise

06:20 <whitequark> like you can actually just take the transceiver primitive and get it to work. i had to reverse-engineer it a little bit, but the whole thing took me less than 24 hours from never using a transceiver to getting 8b10b data via PCIe

06:21 <whitequark> reverse-engineer here meaning "look up what port and parameter names their cell uses", mostly

06:21 <whitequark> the main problem with Lattice is that they have several departments capable of producing nice silicon, nice docs and reasonably nice tools, but they do not talk to each other

06:21 <attie> *gasp* an fpga without overengineered bloated hardblocks stuffed with bugs because "never mind we can fix it in post"?

06:22 <whitequark> yes. that's ECP5.

06:22 <whitequark> actually i've yet to stumble on a bug in ECP5 or iCE40

06:23 <whitequark> the worst problem I've had with ECP5 was that the bit names in docs, ip generator, register map, and primitive basically never match each other

06:24 <attie> heh. every employee has their own private naming scheme?

06:24 <whitequark> more like every department?

06:24 <whitequark> they're not like super different. it's that, e.g. the one that was doing the fabric added ff_ (for "fabric feed") to every signal name so the cell has those.

06:25 <whitequark> but the one that was doing SERDES IP and documenting it has only learned about that late in the cycle so it's mentioned once in a footnote

06:25 <whitequark> really, i wish all of my FPGA problems were that bad.

06:26 <attie> yeah, that sounds benign

06:26 <attie> you can probably Ctrl-F that footnote looking for "ff_"

06:28 <whitequark> it's mostly that there is no complete description of the SERDES primitive

06:28 <whitequark> so I had to guess that, and also infer from one of the names that it has an undocumented power down pin, etc

06:29 <whitequark> not *really* an issue

07:09 <cr1901_modern> >xilinx block rams are corrupted if you asynchronously reset the FFs driving them

07:09 <cr1901_modern> This seems... bad if e.g. you're executing a program from block RAM and you use async reset to, say, reset the PC.

07:10 <whitequark> well yes

07:10 <whitequark> their workaround is to "not do that"

07:11 <cr1901_modern> "Doctor, it hurts when I asynchronously reset the BRAM"

07:12 <cr1901_modern> Someone (I'll do it if I ever get my hands on one) should test an Altera part too in case there are fun surprises.

07:18 <cr1901_modern> whitequark: What are you unhappy with when you say _this_ AsyncFIFO behavior (1:14:36 AM)? The existence of NO_CHANGE? The fact that async reset corrupts your data? Or both?

07:22 <whitequark> right now if you reset the read or write asyncfifo domain

07:22 <whitequark> you start reading stale data.

07:27 <cr1901_modern> right now as in old migen/sync reset?

07:33 <whitequark> well, asyncfifo doesn't exactly get reset

07:33 <whitequark> yu can reset one half of an asyncfifo

07:33 <whitequark> which never does what you want

07:34 <whitequark> really, it should be either reset completely or reset_less

07:36 <cr1901_modern> I'll have to take a look when I wake up; you explained that you're using async reset b/c "no guarantee the consumer clock will be present". But without that restriction, why can't you in principle make the produce/consume reset signals both reset synchronously using a pulse stretcher?

07:39 <whitequark> you can

07:46 m4ssi has joined #m-labs

08:03 uberardy has quit [Quit: uberardy]

08:20 cr1901_modern has quit [Read error: Connection reset by peer]

11:18 hartytp has joined #m-labs

11:18 <hartytp> sb0: I'm starting to think about plans for Sayma v2.0

11:19 <hartytp> my initial thought is this

11:19 <hartytp> I won't do any work on it until you get RTM DRTIO up and running

11:19 <hartytp> at that point you can work on fixing any bugs and generally getting support for unsynchronised SAWG rock solid

11:20 <hartytp> in parallel to that, I was thinking of moving the DAC control to a kernel

11:20 <hartytp> then implement the sync in much the same way as Urukul

11:20 <hartytp> it feels like it will be much easier to develop and debug the code from kernels

11:21 <hartytp> I'd move all DAC SPI to kernels and expose a few functions to control the JESD core

11:21 <hartytp> what do you think?

11:43 hartytp has quit [Ping timeout: 256 seconds]

12:05 rohitksingh_work has quit [Read error: Connection reset by peer]

12:06 ElementW has quit [Quit: -]

12:06 rohitksingh_work has joined #m-labs

12:39 rohitksingh_work has quit [Read error: Connection reset by peer]

13:24 dlrobertson has quit [Quit: WeeChat 2.3]

13:53 sb0_ has joined #m-labs

13:53 <sb0_> why does it need rtm drtio?

13:54 <sb0_> not enough pins to drive the FF from AMC?

14:01 <sb0_> we can probably move to kernels, the reason it's done in rust was the potential dependency of drtio on the clock tree. and since the hmc7043 is tightly interfaced with the DAC, it made sense to control the DAC from the runtime as well

14:38 sb0_ has quit [Quit: Leaving]

14:59 cr1901_modern has joined #m-labs

15:00 awygle_ has quit [Quit: No Ping reply in 180 seconds.]

15:01 awygle has joined #m-labs

15:42 <sb0> whitequark: can those transceivers be easily operated in fixed-latency mode?

15:42 <sb0> i.e. align the clock divider to the comma, not keep whatever divided clock is there and barrel-shift data to align the comma

15:43 <sb0> this is very annoying to do with xilinx

16:03 rohitksingh has joined #m-labs

17:09 rohitksingh has quit [Remote host closed the connection]

19:03 <cr1901_modern> >not keep whatever divided clock is there and barrel-shift data to align the comma

19:03 <cr1901_modern> Is this what Xilinx IP does?

21:48 <whitequark> sb0: hmm, good question

21:54 <whitequark> sb0: I can't seem to find many details in the UG, but there is a control register that has two mode

21:54 <whitequark> "bitslip word alignment mode" and "barrel shift word alignment mode"

21:54 <whitequark> would the former be what you want?

21:55 <whitequark> it says in re: en_bitslip signal elsewhere:

21:55 <whitequark> SerDes Word Alignment to shift byte clock by 1 UI

21:55 <whitequark> 1 = Slip Rx byte clock by 1 UI for Word Alignment

21:55 <whitequark> 0 = No slip